Re: Do I need to weight data for cohort analysis? [message #9506 is a reply to message #9505] |
Wed, 06 April 2016 20:24 |
Reduced-For(u)m
Messages: 292 Registered: March 2013
|
Senior Member |
|
|
The weights are not just for differing sample sizes and population (sorta) but for probability of being sampled - some groups of people are over/under sampled by design. So if you want to actually estimate levels (such as, the average schooling of a group of people) you do in fact need the weights. The clustering is also necessary to get correct standard error estimates, so you have to do that too. Neither of these should be difficult.
The time-series/cohort dimension might require you to do more with your standard errors (in the sense of clustering at a higher level) but it would depend on what exactly you are estimating and how. But the way you describe it - which is just to calculate average schooling by cohort - there is no real time dimension to your analysis, you are just picking your sub-groups based on cohort.
Now - if you wanted to estimate the determinants of schooling using variation in cohort exposure to some variable X, that would be a different story about the adjustments you want to make. But otherwise, you can just grab each cohort you want from the data, calculate the means using weights and the CI/p-val using clustered standard errors, and you are fine.
For Stata - to do that in a regression context, just "svy: reg Schooling if cohort==c" for each cohort c and the constant will give you the mean and you can use the confidence intervals Stata spits out. Before that you have to do the "svyset" command that is described in many other posts and on the DHS website, but the exact right specification of the "svyset" command depends on the particular survey you are using.
|
|
|