 The DHS Program User Forum
Discussions regarding The DHS Program data and results
Home » Topics » General » Survey Sampling weights (Weights)
Survey Sampling weights Wed, 08 April 2020 13:43  rwmungai Messages: 1Registered: April 2020 Member

I understand it is important to post questions in the Forum but I am unable to figure this out. That is why I sent an email.

gen wt=hv005/1000000
ta shstate [iw=wt]
This gives me a population of 189.7K. I expect to see a population in millions. So I decided as below
gen wt=hv005/1000
ta shstate [iw=wt]
This gives me a population of 189.7 million.

I want to get the "N" in terms of the total population and not the sample "n".

Thanks,
Rose
Re: Survey Sampling weights [message #19051 is a reply to message #19050] Wed, 08 April 2020 14:45 Liz-DHS Messages: 1516Registered: February 2013 Senior Member
A response from Dr. Tom Pullum:
Quote:

The weight variable in DHS surveys is normalized, which means that mean weight is 1, or, equivalently, the weighted total is equal to the unweighted total. HOWEVER, in order to get rid of a decimal point, the calculated weight is multiplied by one million and then rounded to the nearest integer.

When you use iweights you need to divide by one million. When you use pweights, as in a statistical model, you CAN divide by one million but you don't need to, because Stata automatically normalizes with pweights.

If you want inflation weights, which will give estimates of the population counts, you need to define new weights, and to do that you have to provide the population total. Say that the total number of households or persons in households or women 15-49 (depending on whether you are using the HR or PR or IR file) is N. You then multiply the weight by N/n, where n is the sample size. You also divide by one million and round to the nearest integer and then use fweight. Note that you cannot use standard errors, confidence intervals, or tests with fweight, only with pweight.

DHS advises against inflation weights. It can be difficult to find what is the population total on the date of the survey (the median date of interview, say). The sample includes only the household population and census estimates may be larger than the household population. It's unusual to see any use of inflation weights, but if you really want them, that's how you calculate them.

 Previous Topic: Using other country DHS data Next Topic: Calculating CI for median survival time
Goto Forum:

Current Time: Thu Dec 3 08:49:26 Coordinated Universal Time 2020