How to combine data from different countries for a period of years? [message #3894] |
Mon, 02 March 2015 10:50 |
kinsukmanisinha@gmail.com
Messages: 9 Registered: January 2015 Location: Milan
|
Member |
|
|
Dear all,
I am using the DHS database for the first time and also I have never worked with survey data before. My aim is a bit broad, it is to understand which variables have an impact on child mortality/under five mortality.
In order to achieve my aim I need to combine datasets from different countries for different year and build a pooled database.
And, this is the first problem I face -- I don't know how to combine databases from different countries for different year. Do I need to add a weight? If yes, how do I calculate this weight and how do I take it into account? Or take another factor into account.
I do understand that my question is a bit naive but if anyone could point me in the right direction, that would be simply great..!!!!
Many thanks..!!!
Regards
Kinsuk
|
|
|
|
Re: How to combine data from different countries for a period of years? [message #3953 is a reply to message #3916] |
Tue, 10 March 2015 08:52 |
kinsukmanisinha@gmail.com
Messages: 9 Registered: January 2015 Location: Milan
|
Member |
|
|
Many Thanks for the detailed reply.
I agree with you, I should start with one data set first.
I read your points and I have a question:-
Point 3 says:-
3) You should know that each individual data set includes its own weight so you should be weighting data even if you are only analyzing one survey.
So, for example if I use only Egypt for year xxxx, I don't think I need to use weights. But if I use Egypt and Sudan for the same year, same survey, then I need to worry about weight. Is this what you meant?
Thanks once again..!!!
|
|
|
Re: How to combine data from different countries for a period of years? [message #4002 is a reply to message #3894] |
Mon, 16 March 2015 19:46 |
Trevor-DHS
Messages: 803 Registered: January 2013
|
Senior Member |
|
|
If you are using only Egpyt xxxx survey you should be using weights. The reason for this is that the probability of selection of households as used in the sample design varies for different parts of the country. To correct for this it is necessary to use weights. You will find the relative weights in one of the following variables:
v005 - for women and children's datasets
hv005 - for household and household member datasets
mv005 - for men's datasets
If you are combining surveys for, say, Sudan and Egypt, then you have to not only use the weights, but also denormalize the weights (described in other posts on the forum) before pooling the data.
[Updated on: Mon, 16 March 2015 19:50] Report message to a moderator
|
|
|
|