The DHS Program User Forum: Weighting data » Weighting district-level data

Home » Data » Weighting data » Weighting district-level data

Show: Today's Messages :: Show Polls :: Message Navigator

Re: Weighting district-level data [message #8793 is a reply to message #8791]

Fri, 18 December 2015 15:37

Reduced-For(u)m
Messages: 292
Registered: March 2013

Senior Member

Interesting. Quick replies:

1 - cool. My guess is that since it is usually 1 PSU going to 1 district, and all observations in some PSU have the same weight, that it is basically the numerator/denominator canceling each other out. But while we are on that - since it is a fraction, and since that fraction is an estimate of the population proportion, fewer observations means a less good estimate of the actual proportion. Clustering might not work here at all since you would lose the uncertainty generated in your first stage - you might have to bootstrap both stages. I'll leave that to you to decide how far you want to go down that rabbit-hole, but you may want to account for the uncertainty in your "observations" because those are estimates themselves.

2 - it is nice to help (smiley face)

3 - diff district_indicator, t (treated) p (t), cluster(District) ... I think the problem is the extra "," after p(t). Delete that, and I think it will run.

4 -I see what you are doing. That is interesting, and I could see how it would work. But you should also know that you are not necessarily comparing "apples to apples" anymore. Suppose PSU 1 is in District 1 in round 1. Then, in Round 2, District 1 contains PSU 397 (which wasn't sampled in round 1). Then you are using different people from different towns/areas to define the same "district" variable. How many PSUs per district do you have, on average? I ask because with many, I'd think a law of large numbers might apply and you'd be fine. But with only 1 or 2 PSUs per round, much of the difference across time within district is going to be do to sampling variation and not do to real changes. Again, in theory, with many N(obs), G(groups) and T(periods) you are OK, but in finite numbers you are asking a whole lot of the data.

Report message to a moderator

[Message index]

		Weighting district-level data By: amira.elshal.1@city.ac.uk on Thu, 17 December 2015 04:35
		Re: Weighting district-level data By: Reduced-For(u)m on Thu, 17 December 2015 17:32
		Re: Weighting district-level data By: amira.elshal.1@city.ac.uk on Fri, 18 December 2015 11:55
		Re: Weighting district-level data By: Reduced-For(u)m on Fri, 18 December 2015 12:47
		Re: Weighting district-level data By: amira.elshal.1@city.ac.uk on Fri, 18 December 2015 14:43
		Re: Weighting district-level data By: Reduced-For(u)m on Fri, 18 December 2015 15:37
		Re: Weighting district-level data By: amira.elshal.1@city.ac.uk on Sat, 19 December 2015 06:11
		Re: Weighting district-level data By: Reduced-For(u)m on Mon, 21 December 2015 16:08
		Re: Weighting district-level data By: amira.elshal.1@city.ac.uk on Thu, 24 December 2015 04:00
		Re: Weighting district-level data By: Reduced-For(u)m on Mon, 28 December 2015 17:53
		Re: Weighting district-level data By: amira.elshal.1@city.ac.uk on Mon, 21 December 2015 05:48
		Re: Weighting district-level data By: amira.elshal.1@city.ac.uk on Mon, 21 December 2015 16:51

Previous Topic:	Collection of Strata Information
Next Topic:	Descriptives and chisq

Goto Forum:

-=] Back to Top [=-

[ Syndicate this forum (XML) ] [

] [

]

Current Time: Tue Dec 16 23:04:43 Coordinated Universal Time 2025