Re: Merging datasets from multiple countries [message #4022 is a reply to message #4018] |
Wed, 18 March 2015 08:25 ![Go to previous message Go to previous message](/theme/default/images/up.png) |
Bridgette-DHS
Messages: 3230 Registered: February 2013
|
Senior Member |
|
|
Following is a response from DHS Senior Stata Specialist, Tom Pullum:
You can combine several countries/surveys in the way you describe with "append". It's an easy but powerful command. You can get the syntax with "help append". Here are a few suggestions. First, be sure to have some variable (I call it "survey") to identify the different surveys. You cannot always rely on hv000 or v000 to do this. Second, understand that some variables, such as region, are country-specific and the codes will mean different things in different surveys. Third, the variable and value labels for the final survey in the append will be the only ones that are saved. Fourth, do not keep more variable than you need because the file can get very large. Fifth, if you use svyset, you need to re-define the cluster and stratum variables, for example with "egen cluster=group(survey v001)", and probably re-normalize the weights. I prefer to weight each survey equally, for example by forcing the total weight in each survey to be one billion (hv005 or v005 is constructed to have mean value of 1 million). The steps to do this sort of thing are described elsewhere on the forum.
|
|
|