The DHS Program User Forum
Discussions regarding The DHS Program data and results
Home » Data » Dataset use in Stata » Unique identifier in pooled DHS (countries and years)
Unique identifier in pooled DHS (countries and years) [message #971] Tue, 10 December 2013 08:40 Go to previous message
marion is currently offline  marion
Messages: 1
Registered: December 2013
Member
Dear all,

I pooled the birth history datasets for all phases and all countries. My total sample consists of roughly 6,000,000 observations. I created a unique identifier by combining v000+v001+v002+v003+bidx, as it is recommended in the DHS manuals. However this combination does not uniquely identify each line (=each child). There are more than 1,500,000 observations which are not uniquely identified by this combination of variable. My guess is that is problem could be connected to the presents of different waves within the phases (e.g. for Uganda phase 5 there are the datasets UGBR5HDT and UGBR52DT). Do I have to assign a coding for the different waves within the phases? I did not read anything about this requirement in the manuals so I am not sure how to proceed best. Do you have any advises?

Many thanks and warm regards,
Marion
 
Read Message
Read Message
Previous Topic: calculating median breastfeeding duration using current status data
Next Topic: Pooled Datasets - Use of Svyset & regional controls
Goto Forum:
  


Current Time: Sun Nov 24 20:20:03 Coordinated Universal Time 2024