The DHS Program User Forum: Domestic Violence

Home » Topics » Domestic Violence » Compare two surveys (Methodology to compare two surveys)

Show: Today's Messages :: Show Polls :: Message Navigator

Compare two surveys [message #24742]

Mon, 04 July 2022 01:53

kmdshoyaib
Messages: 13
Registered: May 2022

Member

Hi all, I am planning to compare two Individual datasets from 2015 survey and 2019 survey. My dependent variable would be from Domestic Violence module and independent variables would be other factors such as occupation literacy etc.

I would like to know how to compare the variables affecting the domestic module.

Should i do the regression analysis on each dataset and then compare the odds ratio from one dataset to another dataset or is there any other way.

Any leads are highly appreciated.

Report message to a moderator

Re: Compare two surveys [message #24771 is a reply to message #24742]

Thu, 07 July 2022 12:34

Janet-DHS
Messages: 938
Registered: April 2022

Senior Member

Following is a response from DHS Research & Data Analysis Director, Tom Pullum:

You can pool the IR files from the two surveys into a single file. In Stata, this is done with the "append" command. Define a variable "survey" that is 0 for the cases in the 2015 survey and 1 for the cases in the 2019 survey. Then you can do the same kind of regression that you would do for a single survey, but in the pooled file you include "survey" as a covariate or predictor. If the coefficient for "survey" is statistically significant, then the relationship of the outcome to the other covariates has changed. This is a simple description of a strategy we often use at DHS.

Report message to a moderator

Re: Compare two surveys [message #24889 is a reply to message #24771]

Thu, 28 July 2022 22:01

kmdshoyaib
Messages: 13
Registered: May 2022

Member

Thank you for the reply. When comparing two different surveys, should i use complex analysis plan file for survey 1 or complex analysis plan file for survey 2 ?

Report message to a moderator

Re: Compare two surveys [message #24890 is a reply to message #24771]

Thu, 28 July 2022 22:03

kmdshoyaib
Messages: 13
Registered: May 2022

Member

Thank you for the response. When comparing, should I use the analysis plan file of survey 1 or analysis plan file of survey 2 ?

Report message to a moderator

Re: Compare two surveys [message #24912 is a reply to message #24889]

Mon, 01 August 2022 12:34

Janet-DHS
Messages: 938
Registered: April 2022

Senior Member

Following is a response from DHS Research & Data Analysis Director, Tom Pullum:

You need to construct new variables for the clusters and strata, like this:

egen cluster_ID=group(survey v001)
egen stratum_ID=group(survey v023)

and use these new variables in svyset. Note that even if v023 is defined the same way in both surveys, you still need to distinguish between the two surveys. You do not need to alter the weights. Good luck.

Report message to a moderator

Re: Compare two surveys [message #25717 is a reply to message #24771]

Wed, 30 November 2022 12:08

Nikee
Messages: 1
Registered: November 2022

Member

Greetings Janet,

I want to ask what kind of regression models and techniques could be used while comparing the two datasets from these two surveys. I am assessing the impact of different socioeconomic development factors on domestic violence.

Report message to a moderator

Re: Compare two surveys [message #25763 is a reply to message #25717]

Tue, 06 December 2022 09:19

Janet-DHS
Messages: 938
Registered: April 2022

Senior Member

Following is a response from DHS staff member Tom Pullum:

As suggested on July 7, you would construct a pooled file with a binary variable that is coded S=0 for the first survey and S=1 for the second survey. Say that V is a scale constructed from the DV module. In Stata, enter "regress V S". Look at the coefficient for S. If it is positive and statistically significant, then the mean of V is greater in survey 1 than in survey 0. If V is also a 0/1 (binary) variable, then you enter "logit V S" and look at the coefficient for S. You can have more elaborate models that include interaction terms, controls, etc.

It sounds like you want to include macro-level indicators of development. There are limitations to this. Suppose you hypothesized that domestic violence declines as women's education improves. To show this, you add to your regression the national percentage of women who have achieved some level of education at the time of the first survey and the time of the second survey--that is, two numbers, one for each survey. You re-run the regression above, including those numbers. This will not work, because those national two percentages will be confounded with S. (There are other ways to describe this issue.) But suppose instead that you used a variable in the data files, such as E=0 if the woman had a "lower" level of education and E=1 if she had a "higher" level. You could include E in your regression, AND you could include the interaction between E and S, to get at the effect of E on differences in V and differences in the trend. You could include cluster-level covariates from another source, such as the DHS spatial covariates files.

The models should include svyset and svy as described in earlier posts.

Report message to a moderator

Previous Topic:	Domestic violence and pregnancy outcomes
Next Topic:	Azerbaijan D103 "not at all"

Goto Forum:

-=] Back to Top [=-

[ Syndicate this forum (XML) ] [

] [

]

Current Time: Sun Aug 24 01:06:14 Coordinated Universal Time 2025