Weighted data and population size [message #17720] |
Mon, 13 May 2019 20:29 |
shopnobaz
Messages: 4 Registered: March 2015 Location: Dhaka
|
Member |
|
|
Hello DHS experts,
I want to do a pooled analysis of BDHS 2007, 2011 and 2014 (KR file for children). As per forum discussion, during regression analysis of pooled data, I need to de-nomalize the sampling weight. I did this using
gen wgt = weight_all *** weight_all = v005/1000000(Total number of households during each survey year/sample households in each survey) and append three surveys data
gen psu = cluster *** each survey clusters are unique eg. 2007_1, 2011_1 and so on
svyset psu, weight(wgt) strata(strat) , singleunit(centered) || _n *** each survey strata are unique
When I fitted weighted logistic regression, after adjusting weight this way, I found:
svy: logit y x
Number of strata = 63 Number of obs = 19,896
Number of PSUs = 1,561 Population size = 44,882,311
Could anyone please suggest that the process is correct? Is the population size reliable or not?
Thank you very much.
|
|
|
Re: Weighted data and population size [message #17738 is a reply to message #17720] |
Fri, 17 May 2019 12:59 |
Bridgette-DHS
Messages: 3214 Registered: February 2013
|
Senior Member |
|
|
Following is a response from our Research & Data Analysis Director, Tom Pullum:
Is "weight_all = v005/1000000(Total number of households during each survey year/sample households in each survey)" a verbal description of a Stata command? If so, what command? It's not clear what you are doing to the weights, other than dividing by 1000000, which will not affect the results at all.
The "Population size" in the results has been distorted by svyset, particularly by the components other than the weight. You can ignore it.
|
|
|