The DHS Program User Forum
Discussions regarding The DHS Program data and results
Home » Data » Weighting data » Weighted data and population size
Weighted data and population size [message #17720] Mon, 13 May 2019 20:29 Go to next message
shopnobaz is currently offline  shopnobaz
Messages: 4
Registered: March 2015
Location: Dhaka
Hello DHS experts,

I want to do a pooled analysis of BDHS 2007, 2011 and 2014 (KR file for children). As per forum discussion, during regression analysis of pooled data, I need to de-nomalize the sampling weight. I did this using

gen wgt = weight_all *** weight_all = v005/1000000(Total number of households during each survey year/sample households in each survey) and append three surveys data 
gen psu = cluster  *** each survey clusters are unique eg. 2007_1, 2011_1 and so on
svyset psu, weight(wgt) strata(strat) , singleunit(centered) || _n  *** each survey strata are unique

When I fitted weighted logistic regression, after adjusting weight this way, I found:
svy: logit y x

Number of strata   =        63                 Number of obs     =      19,896
Number of PSUs     =     1,561                 Population size   =  44,882,311

Could anyone please suggest that the process is correct? Is the population size reliable or not?

Thank you very much.
Re: Weighted data and population size [message #17738 is a reply to message #17720] Fri, 17 May 2019 12:59 Go to previous message
Bridgette-DHS is currently offline  Bridgette-DHS
Messages: 1906
Registered: February 2013
Senior Member

Following is a response from our Research & Data Analysis Director, Tom Pullum:

Is "weight_all = v005/1000000(Total number of households during each survey year/sample households in each survey)" a verbal description of a Stata command? If so, what command? It's not clear what you are doing to the weights, other than dividing by 1000000, which will not affect the results at all.

The "Population size" in the results has been distorted by svyset, particularly by the components other than the weight. You can ignore it.
Previous Topic: using SPSS Complex Samples
Next Topic: Weighting Namibia 2013 Data (PR)
Goto Forum:

Current Time: Fri Sep 18 07:29:38 Eastern Daylight Time 2020