The DHS Program User Forum
Discussions regarding The DHS Program data and results
Home » Data » Weighting data » Weighting and pooling multicountry datasets (is it enough to apply level 1 and level 2 weights at country level and used the poolet dataset?)
Weighting and pooling multicountry datasets [message #24571] Thu, 02 June 2022 07:32 Go to next message
Denise_kpebo is currently offline  Denise_kpebo
Messages: 2
Registered: June 2022
Member
Hi,


I am working on a multilevel logistic regression for multi-country (n=15) study using the DHS guide (https://www.dhsprogram.com/pubs/pdf/MR27/MR27.pdf) for the calculation of level weight.

Firstly, I have conducted the analysis within each country separately to determine which level of alpha should be used (e.g., allocation of variation in weights to the level-1 and level-2 units).
Then based on alpha for each country, I've weighted each dataset separately before merging the 15 countries datasets

Is this sufficient? May I start my analysis at this point with the pooled database using the svyset command?
Or do I still need to apply additional weights considering the differences in population size, since the population are not the same? And if so, how can I do that?

Many thanks in anticipation of your help

[Updated on: Tue, 22 October 2024 15:18] by Moderator

Report message to a moderator

Re: Weighting and pooling multicountry datasets [message #24581 is a reply to message #24571] Fri, 03 June 2022 08:45 Go to previous messageGo to next message
Janet-DHS is currently offline  Janet-DHS
Messages: 878
Registered: April 2022
Senior Member
Following is response from DHS Research & Data Analysis Director, Tom Pullum:

There have many postings on how to weight pooled surveys. There are basically two options. The first is to re-scale the weight for each country so that the total weight for country X is proportional to the population of country X at the time of the survey. You can get the estimated population size from the UN Population Division website, World Population Prospects 2019. The second option is to rescale so that the total weight is the same for each country (or survey). That is, if you pool 10 countries, you re-scale so that each survey has 1/10 of the total weight. Specific steps for both options have been given on the forum.

The first option has the problem that typically one large country, such as India or Nigeria, will completely dominate the results.

At DHS we often leave the weights alone and pool surveys into a single file just to simplify the data processing. We give results separately for each survey, but do not give results for all the surveys combined. Pooled surveys, from different countries and different years, do not describe a well-defined population. It's very hard to interpret a mean or percentage or coefficient from a mix of different surveys.

[Updated on: Tue, 22 October 2024 15:19] by Moderator

Report message to a moderator

Re: Weighting and pooling multicountry datasets [message #24613 is a reply to message #24581] Tue, 07 June 2022 22:45 Go to previous messageGo to next message
Denise_kpebo is currently offline  Denise_kpebo
Messages: 2
Registered: June 2022
Member
Thanks so much for your help on this

I'm going to use the first option and I have the population size for each country but this is my first time to weight data based on country population size and I actually have no idea of the state command to be used, could you please help me with that one last time?


Best

[Updated on: Tue, 22 October 2024 15:19] by Moderator

Report message to a moderator

Re: Weighting and pooling multicountry datasets [message #24627 is a reply to message #24613] Thu, 09 June 2022 08:38 Go to previous messageGo to next message
Janet-DHS is currently offline  Janet-DHS
Messages: 878
Registered: April 2022
Senior Member
Following is a response from DHS Research & Data Analysis Director, Tom Pullum:

If you search through previous forum responses you will find examples of how to do this.

[Updated on: Tue, 22 October 2024 15:19] by Moderator

Report message to a moderator

Re: Weighting and pooling multicountry datasets [message #26872 is a reply to message #24627] Sat, 13 May 2023 12:01 Go to previous messageGo to next message
jamwiggly is currently offline  jamwiggly
Messages: 2
Registered: May 2023
Member
I was searching but maybe those posts have been deleted. :(

[Updated on: Tue, 22 October 2024 15:19] by Moderator

Report message to a moderator

Re: Weighting and pooling multicountry datasets [message #30246 is a reply to message #24581] Tue, 22 October 2024 07:07 Go to previous messageGo to next message
ewa.b is currently offline  ewa.b
Messages: 37
Registered: January 2014
Member
Hello, I'd like to ask a question on the rescaling so that the total weight is the same for each country (or survey), so the second option mentioned by Janet above. I read a lot of posts on how to rescale the weights using UN estimates (first option) but I was not able to identify steps for the second. Could you send a link where it is described how to rescale the weights according to option 2? Many thanks.

[Updated on: Tue, 22 October 2024 15:20] by Moderator

Report message to a moderator

Re: Weighting and pooling multicountry datasets [message #30249 is a reply to message #30246] Tue, 22 October 2024 15:24 Go to previous messageGo to next message
Bridgette-DHS is currently offline  Bridgette-DHS
Messages: 3189
Registered: February 2013
Senior Member
Following is a response from Senior DHS staff member, Tom Pullum:

I have prepared a Stata program that illustrates how to pool and reweight three PR files, using the latest surveys from India, Nepal, and Bangladesh (see attached). For population sizes I just googled estimates for 2020, rounded to the nearest million. You can be more precise if you want. For different files you can get estimates of the relevant subpopulation from World Population Prospects (UN Population Division).

I often append surveys for computing convenience, but I avoid giving pooled estimates. If you do that, keep in mind that the actual surveys are from different dates. We rarely, if ever, cover an entire region or subregion. The largest country will dominate the results, sometimes overwhelmingly, as in this illustrative group of countries, which includes India. The label from the last file in the append will over-write other labels. Variables such as hv024 (region) are obviously different in different countries. Other variables, such as types of facilities, sources of drinking water, etc., often also vary across surveys and must be reconciled with recodes.

Re: Weighting and pooling multicountry datasets [message #30255 is a reply to message #30249] Wed, 23 October 2024 06:16 Go to previous messageGo to next message
ewa.b is currently offline  ewa.b
Messages: 37
Registered: January 2014
Member
Thank you for this reponse, it is very helpful. I'd like to ask a clarifying question. If I understood correctly, the procedure described in the response and the file involves recalculating weights using the information about countries population size at the time of the survey. This way of rescaling means that the largest country will dominate the results.

The response of Janet-DHS (Fri, 03 June 2022 13:45) below suggested that this is one of the options to rescale weights, with the second being "rescale so that the total weight is the same for each country (or survey). That is, if you pool 10 countries, you re-scale so that each survey has 1/10 of the total weight". Could you provide description on how to do that second type of rescaling?

Thank you again.

Re: Weighting and pooling multicountry datasets [message #30259 is a reply to message #30255] Wed, 23 October 2024 07:55 Go to previous message
Bridgette-DHS is currently offline  Bridgette-DHS
Messages: 3189
Registered: February 2013
Senior Member

Following is a response from Senior DHS staff member, Tom Pullum:

To weight each survey equally, you would run the same program but replace all the "target" numbers with a fixed number such as 1. It should not matter what number you use. If you have ten surveys, say, then the procedure will add up the target values, get 10, and then alter hv005 so that each survey gets a proportion 1/10 of the total of hv005 in all ten surveys.

Previous Topic: Strata in surveys where v022-v025 does not seem to work
Next Topic: Sampling weights with value zero
Goto Forum:
  


Current Time: Thu Nov 7 13:40:59 Coordinated Universal Time 2024