The DHS Program User Forum
Discussions regarding The DHS Program data and results
Home » Data » Dataset use in Stata » Merging and appending Kenya DHS
Re: Merging and appending Kenya DHS [message #12914 is a reply to message #12907] Mon, 07 August 2017 14:06 Go to previous messageGo to previous message
Bridgette-DHS is currently offline  Bridgette-DHS
Messages: 3117
Registered: February 2013
Senior Member
Following is a response from Senior DHS Stata Specialist, Tom Pullum:


I prefer "renormalize" to "denormalize"....

When using a cross-sectional analysis (e.g. for 2014) you do not need to renormalize, but you do need to choose between weights. If any variables in a regression or tabulation, etc., come from the IR file I would use v005 rather than hv005. v005 is equal to hv005, except for a slight adjustment for loss of a few women respondents. Similarly, if you are using variables from the MR file, then use mv005 rather than hv005. If you were using the couples file, that includes both v005 and mv005 on the same record, then preference is given to mv005 because male nonresponse is higher than female nonresponse.

I went ahead and did the merge for 2014 with the following Stata lines:

use e:\DHS\DHS_data\MR_files\KEMR70FL.dta, clear
keep mv001 mv002 mv003 mv005
rename mv001 hv001
rename mv002 hv002
rename mv003 hvidx
gen sex=1
save e:\DHS\DHS_data\scratch\KE_temp.dta, replace

use e:\DHS\DHS_data\IR_files\KEIR70FL.dta, clear
keep v001 v002 v003 v005
rename v001 hv001
rename v002 hv002
rename v003 hvidx
gen sex=2

append using e:\DHS\DHS_data\scratch\KE_temp.dta
sort hv001 hv002 hvidx
save e:\DHS\DHS_data\scratch\KE_temp.dta, replace

use e:\DHS\DHS_data\PR_files\KEPR70FL.dta, clear
keep hv001 hv002 hvidx hv005
sort hv001 hv002 hvidx
merge hv001 hv002 hvidx using e:\DHS\DHS_data\scratch\KE_temp.dta
tab _merge
keep if _merge==3
drop _merge

summarize *v005
pwcorr *v005


Here are the results from the last two lines:

index.php?t=getfile&id=768&private=0

I see that this survey only had a subsample of men. That may be an issue for the kind of analysis you want to do.

I would not say that you have a panel study; you have repeated cross-sections. If you are looking at changes from one survey to the next, you do not need to alter the weights. Because you have completely different men and women in each cross-section, and you are not (I hope!) trying to combine successive surveys in an additive way, then you do not need to renormalize. You definitely do not need to add up the weights.

If you still have doubts, please let me know (with an example) the sort of thing you plan to do with the combined file....

  • Attachment: v005.jpg
    (Size: 31.66KB, Downloaded 1694 times)
 
Read Message
Read Message
Read Message
Read Message
Previous Topic: DHS 2010 Malawi Report Replication
Next Topic: Definition of ARI
Goto Forum:
  


Current Time: Sun Jul 28 04:18:04 Coordinated Universal Time 2024