The DHS Program User Forum
Discussions regarding The DHS Program data and results
Home » Data » Merging data files » Are the merging commands and results correct (Verifying the correctness of IR and PR Files merging command)
Re: Are the merging commands and results correct [message #18947 is a reply to message #18794] Tue, 24 March 2020 14:18 Go to previous messageGo to previous message
Bridgette-DHS is currently offline  Bridgette-DHS
Messages: 3214
Registered: February 2013
Senior Member

Following is a response from DHS Research & Data Analysis Director, Tom Pullum:

I recommend that when you change the names of the matching variables, you consistently use EITHER the PR variable names OR the IR variable names. For example, rename hv001 v001; rename hv002 v002; rename hvidx v003. Then sort both files on v001 v002 v003. The way you did it will work but it's not as systematic.

I think the merge has worked ok. My rule of thumb is that about a quarter of the people in the PR file will be women age 15-49. That is, the cases with _merge=1 will be about 3 times as large as the cases with _merge=3. If that's what you got, you are ok.

You could consider reducing the PR file to women age 15-49, right after the "use ...PR...". This will give you both the de fact and de jure residents.


 
Read Message
Read Message
Read Message
Previous Topic: INDIA - NFHS-4 - PR AND CR
Next Topic: Merging variables from original DHS with IPUMS DHS
Goto Forum:
  


Current Time: Sun Dec 22 10:10:54 Coordinated Universal Time 2024