The DHS Program User Forum
Discussions regarding The DHS Program data and results
Home » Countries » Other countries » Zambia 2018 -- duplicates in cluster number and household number (Trouble merging datasets due to duplicates)
Zambia 2018 -- duplicates in cluster number and household number [message #24978] Fri, 12 August 2022 11:47 Go to previous message
nchesterman is currently offline  nchesterman
Messages: 1
Registered: August 2022
Hi all,

I am attempting to merge variables from the male and female datasets onto the household dataset. The DHS help site for this ( indicates that v001 and v002 (cluster and household numbers) should be unique for the datasets. However, I am finding that these two variables do not uniquely identify all observations; there are many duplicates of these two variables in the dataset.

I also checked my variables of interest, and found that the data differs between duplicated cases. So the households are not true duplicates.

Has anyone else encountered this issue for the Zambia dataset, and have suggestions for how to successfully merge on cluster and household numbers with the household dataset?

Read Message
Read Message
Previous Topic: Analyse multiniveau
Next Topic: Cambodia DHS (all years) - Regional labels appear in duplicate and triplicate
Goto Forum:

Current Time: Fri Jun 9 06:43:50 Coordinated Universal Time 2023