The DHS Program User Forum
Discussions regarding The DHS Program data and results
Home » Countries » Other countries » Zambia 2018 -- duplicates in cluster number and household number (Trouble merging datasets due to duplicates)
Zambia 2018 -- duplicates in cluster number and household number [message #24978] Fri, 12 August 2022 11:47 Go to previous message
nchesterman is currently offline  nchesterman
Messages: 1
Registered: August 2022
Member
Hi all,

I am attempting to merge variables from the male and female datasets onto the household dataset. The DHS help site for this (https://dhsprogram.com/data/Merging-Datasets.cfm) indicates that v001 and v002 (cluster and household numbers) should be unique for the datasets. However, I am finding that these two variables do not uniquely identify all observations; there are many duplicates of these two variables in the dataset.

I also checked my variables of interest, and found that the data differs between duplicated cases. So the households are not true duplicates.

Has anyone else encountered this issue for the Zambia dataset, and have suggestions for how to successfully merge on cluster and household numbers with the household dataset?

Thanks,
Nathan
 
Read Message
Read Message
Previous Topic: Analyse multiniveau
Next Topic: Values in contraceptive calendar - South Africa
Goto Forum:
  


Current Time: Fri Mar 29 01:53:25 Coordinated Universal Time 2024