The DHS Program User Forum
Discussions regarding The DHS Program data and results
Home » Countries » India » Having problem in merging household members data using birth data
Having problem in merging household members data using birth data [message #3213] Fri, 07 November 2014 19:51 Go to next message
ddd2332 is currently offline  ddd2332
Messages: 2
Registered: November 2014
Location: Canada
Member
Hello.

I am trying to merge the birth order and twin variables from the birth dataset to the householdmember dataset for my research analysis (for India dataset).
I know that the identifier variables are v001 and v002, but I can see that merging by using v001 and v002 will mess up since different individuals have same v001 and v002 within house and that makes the identifiers not unique..
I am trying to figure out how to make a unique identifier so that I can merge these two data.

I would really really appreciate if someone could help me out on this..

Thank you so much!


Jung.
Re: Having problem in merging household members data using birth data [message #3215 is a reply to message #3213] Fri, 07 November 2014 20:51 Go to previous messageGo to next message
Trevor-DHS is currently offline  Trevor-DHS
Messages: 789
Registered: January 2013
Senior Member
Unfortunately you can not get a complete match, even for children as the information about birth order and whether the child is a twin are only available if the mother of the child also lives in the household, is aged between 15 and 49 and is successfully interviewed. the following code will match as much as is possible:

* Use the births recode file
use "IABR52FL.dta" 
* Keep only the variables of interest 
* Cluster, household, birth order, whether twin, line number from household schedule
keep v001 v002 bord b0 b16
* rename the cluster, household and line number for merging
rename v001 hv001
rename v002 hv002
rename b16 hvidx
* drop cases that don't have a household line number - either dead or live elsewhere
drop if hvidx==. | hvidx==0
* sort the data on cluster, household and line number
sort hv001 hv002 hvidx
* save a temporary working file
save "iabr52_subset.dta", replace

* Use the household members recode file
use "IAPR52FL.dta", clear
* Sort on the cluster, household and line number
sort hv001 hv002 hvidx
* Merge the data to add the birth order and whether a twin
merge 1:1 hv001 hv002 hvidx using "iabr52_subset.dta"
* Tabulate by age to see the match
tab hv105 _merge, row


Tabulate _merge by age to see which cases do not match (where _merge==1) and we get about an 85% match for children up to about age 8, and then it drops off to only about 70% by age 15. It is not perfect, but hopefully will be useful for your needs.
Re: Having problem in merging household members data using birth data [message #3221 is a reply to message #3215] Fri, 07 November 2014 21:56 Go to previous message
ddd2332 is currently offline  ddd2332
Messages: 2
Registered: November 2014
Location: Canada
Member
Thank you so much for your help! I really appreciate it!
Previous Topic: Weighting and slums in NFHS-3
Next Topic: DLHS-3 and DHLS-4 India
Goto Forum:
  


Current Time: Sun Apr 28 08:19:45 Coordinated Universal Time 2024