The DHS Program User Forum
Discussions regarding The DHS Program data and results
Home » Data » Weighting data » Revisiting the topic of weighting data
Re: Revisiting the topic of weighting data [message #30698 is a reply to message #30695] Tue, 21 January 2025 14:14 Go to previous message
Bridgette-DHS is currently offline  Bridgette-DHS
Messages: 3229
Registered: February 2013
Senior Member

Following is a response from Senior DHS staff member, Tom Pullum:

I am not enthusiastic about pooling surveys to get some kind of average over a long period of time. However, if you do that, a major issue is that the different surveys will have different sample sizes, and if you don't adjust for that, your results will be most influenced by the largest survey and therefore the conditions at the time of that survey.

Say that n1, n2, n3, n4 are the four sample sizes and the total is N. Say that the weight variables in the samples are w1, w2, w3, w4. You can construct w1'=w1*N/(4n1), w2'=w2*N/(4n2), etc. Then, the sum of the weights should be the same in each survey.

You can define the population of interest however you want but it sounds like you want the children under 5 to be the cases, and you would pool the KR files.

Most analysis uses pweights, and they are the weights in svyset. Pweights are automatically normalized in Stata to have a mean of 1 in the separate files and in the pooled file, so it doesn't really matter if you have a factor of 1000000 or something else.
 
Read Message
Read Message
Previous Topic: Weighting of pooled country and year data
Goto Forum:
  


Current Time: Thu Jan 23 21:57:15 Coordinated Universal Time 2025