The DHS Program User Forum
Discussions regarding The DHS Program data and results
Home » Data » Dataset use in Stata » Merging datasets from multiple countries
Re: Merging datasets from multiple countries [message #4022 is a reply to message #4018] Wed, 18 March 2015 08:25 Go to previous message
Bridgette-DHS is currently offline  Bridgette-DHS
Messages: 3117
Registered: February 2013
Senior Member
Following is a response from DHS Senior Stata Specialist, Tom Pullum:


You can combine several countries/surveys in the way you describe with "append". It's an easy but powerful command. You can get the syntax with "help append". Here are a few suggestions. First, be sure to have some variable (I call it "survey") to identify the different surveys. You cannot always rely on hv000 or v000 to do this. Second, understand that some variables, such as region, are country-specific and the codes will mean different things in different surveys. Third, the variable and value labels for the final survey in the append will be the only ones that are saved. Fourth, do not keep more variable than you need because the file can get very large. Fifth, if you use svyset, you need to re-define the cluster and stratum variables, for example with "egen cluster=group(survey v001)", and probably re-normalize the weights. I prefer to weight each survey equally, for example by forcing the total weight in each survey to be one billion (hv005 or v005 is constructed to have mean value of 1 million). The steps to do this sort of thing are described elsewhere on the forum.
 
Read Message
Read Message
Read Message
Previous Topic: Link the DHS Individuals (IR) with DHS HIV (AR)
Next Topic: Creating a panel dataset
Goto Forum:
  


Current Time: Fri Jul 26 01:28:33 Coordinated Universal Time 2024