Home » Data » Weighting data » Weighting and pooling multicountry datasets (is it enough to apply level 1 and level 2 weights at country level and used the poolet dataset?)
Weighting and pooling multicountry datasets [message #24571] |
Thu, 02 June 2022 07:32 |
Denise_kpebo
Messages: 2 Registered: June 2022
|
Member |
|
|
Hi,
I am working on a multilevel logistic regression for multi-country (n=15) study using the DHS guide (https://www.dhsprogram.com/pubs/pdf/MR27/MR27.pdf) for the calculation of level weight.
Firstly, I have conducted the analysis within each country separately to determine which level of alpha should be used (e.g., allocation of variation in weights to the level-1 and level-2 units).
Then based on alpha for each country, I've weighted each dataset separately before merging the 15 countries datasets
Is this sufficient? May I start my analysis at this point with the pooled database using the svyset command?
Or do I still need to apply additional weights considering the differences in population size, since the population are not the same? And if so, how can I do that?
Many thanks in anticipation of your help
[Updated on: Tue, 22 October 2024 15:18] by Moderator Report message to a moderator
|
|
|
Re: Weighting and pooling multicountry datasets [message #24581 is a reply to message #24571] |
Fri, 03 June 2022 08:45 |
Janet-DHS
Messages: 878 Registered: April 2022
|
Senior Member |
|
|
Following is response from DHS Research & Data Analysis Director, Tom Pullum:
There have many postings on how to weight pooled surveys. There are basically two options. The first is to re-scale the weight for each country so that the total weight for country X is proportional to the population of country X at the time of the survey. You can get the estimated population size from the UN Population Division website, World Population Prospects 2019. The second option is to rescale so that the total weight is the same for each country (or survey). That is, if you pool 10 countries, you re-scale so that each survey has 1/10 of the total weight. Specific steps for both options have been given on the forum.
The first option has the problem that typically one large country, such as India or Nigeria, will completely dominate the results.
At DHS we often leave the weights alone and pool surveys into a single file just to simplify the data processing. We give results separately for each survey, but do not give results for all the surveys combined. Pooled surveys, from different countries and different years, do not describe a well-defined population. It's very hard to interpret a mean or percentage or coefficient from a mix of different surveys.
[Updated on: Tue, 22 October 2024 15:19] by Moderator Report message to a moderator
|
|
|
|
|
|
Re: Weighting and pooling multicountry datasets [message #30246 is a reply to message #24581] |
Tue, 22 October 2024 07:07 |
ewa.b
Messages: 37 Registered: January 2014
|
Member |
|
|
Hello, I'd like to ask a question on the rescaling so that the total weight is the same for each country (or survey), so the second option mentioned by Janet above. I read a lot of posts on how to rescale the weights using UN estimates (first option) but I was not able to identify steps for the second. Could you send a link where it is described how to rescale the weights according to option 2? Many thanks.
[Updated on: Tue, 22 October 2024 15:20] by Moderator Report message to a moderator
|
|
|
Re: Weighting and pooling multicountry datasets [message #30249 is a reply to message #30246] |
Tue, 22 October 2024 15:24 |
Bridgette-DHS
Messages: 3189 Registered: February 2013
|
Senior Member |
|
|
Following is a response from Senior DHS staff member, Tom Pullum:
I have prepared a Stata program that illustrates how to pool and reweight three PR files, using the latest surveys from India, Nepal, and Bangladesh (see attached). For population sizes I just googled estimates for 2020, rounded to the nearest million. You can be more precise if you want. For different files you can get estimates of the relevant subpopulation from World Population Prospects (UN Population Division).
I often append surveys for computing convenience, but I avoid giving pooled estimates. If you do that, keep in mind that the actual surveys are from different dates. We rarely, if ever, cover an entire region or subregion. The largest country will dominate the results, sometimes overwhelmingly, as in this illustrative group of countries, which includes India. The label from the last file in the append will over-write other labels. Variables such as hv024 (region) are obviously different in different countries. Other variables, such as types of facilities, sources of drinking water, etc., often also vary across surveys and must be reconciled with recodes.
|
|
|
|
Re: Weighting and pooling multicountry datasets [message #30259 is a reply to message #30255] |
Wed, 23 October 2024 07:55 |
Bridgette-DHS
Messages: 3189 Registered: February 2013
|
Senior Member |
|
|
Following is a response from Senior DHS staff member, Tom Pullum:
To weight each survey equally, you would run the same program but replace all the "target" numbers with a fixed number such as 1. It should not matter what number you use. If you have ten surveys, say, then the procedure will add up the target values, get 10, and then alter hv005 so that each survey gets a proportion 1/10 of the total of hv005 in all ten surveys.
|
|
|
Goto Forum:
Current Time: Thu Nov 7 13:40:59 Coordinated Universal Time 2024
|