The DHS Program User Forum - RDF feed
https://userforum.dhsprogram.com/index.php
Comparing variables across different years
https://userforum.dhsprogram.com/index.phpindex.php?t=rview&goto=4142&th=2034#msg_4142
For my project I am looking at a husband's migration across the 2006 and 2011 Nepal DHS. Is it possible to compare variables across years and test if they are significantly different? I know the PSUs and study design differ between years, so I am assuming that its not correct to do something as simple as conduct a Chi-Sq in a 2x2 tables of Year and another variable (example: Year x Sex).

How do you test if variables differ across years in a univariate analysis?

Thank you very much
Derek

]]>UAB_user2015-04-07T20:18:35-00:00Re: Comparing variables across different years
https://userforum.dhsprogram.com/index.phpindex.php?t=rview&goto=4144&th=2034#msg_4144
One way to think about it: you could pool the datasets (append them together) and regress on a constant and a dummy variable for "surveyed in 2011". The coefficient on the survey year dummy is the difference between the two means and you could do a t-test of the coefficient equals 0 (you could also run the regression with no constant and just a dummy for each survey year and those would give you the means of the two survey years, and then you can test the equality of the coefficients with an F-test). That would let you use account for survey design in both years (provided you fixed your PSUs to be survey-specific and de-normalized your weights appropriately, though the re-normalizing may be less important for two surveys from the same country if they are similarly sized samples and similar designs).

You could do this separately for men/women, or you could make a dummy for each group (genderXyear) and put them all in one regression and then test the coefficients of interest with post estimation commands (you would use "test" in Stata). ]]>Reduced-For(u)m2015-04-07T22:38:58-00:00Re: Comparing variables across different years
https://userforum.dhsprogram.com/index.phpindex.php?t=rview&goto=4145&th=2034#msg_4145
Do you think it would be possible do multivariate models? Specifically, logistic regression models? I was thinking of using "year" as a variable to adjust for in the models, but I run into the same problems with PSUs as before.

Thanks for your help!
]]>UAB_user2015-04-07T23:20:03-00:00Re: Comparing variables across different years
https://userforum.dhsprogram.com/index.phpindex.php?t=rview&goto=4146&th=2034#msg_4146
You can add covariates to the model in exactly the same manner. Just instead of running a least-squares regression you run a logit:

local X "list variables here"

svy: logit Y `X' i.year, noconstant

Something like that. Or, if you wanted to compare changes in the X's over time, you could interact the variable with the survey year dummies, and run tests of statistical significance on the coefficient from the interaction term.

For your PSUs, just generate a new PSU variable that is PSU*10000+SurveyYear, and in your svyset command, set your cluster to that new PSU variable.