The DHS Program User Forum
Discussions regarding The DHS Program data and results
Home » Data » Weighting data » Re-Numbering Survey_Year and Survey_Rounds (Creating Unique survey_year and survey_round variables)
Re-Numbering Survey_Year and Survey_Rounds [message #17629] Tue, 30 April 2019 10:44 Go to next message
Yawo is currently offline  Yawo
Messages: 34
Registered: February 2019

I have pooled data from about 10 countries, and ready to generate new strata/cluster variables. Consistent with advice from this board, I created a new variable, "survey_year" which takes the value of 1, 2, 3, 4 (to represent each of the survey rounds within a country.

Here are the two commands I intend to use:

egen cluster=group(survey_year v001)
egen stratum=group(survey_year v024 v0025)

But I am not sure if the survey_year variable is uniquely representing the survey rounds. For example, I have the following rounds of surveys, and their corresponding "survey_year" values:

Congo2007: survey_year=1
Congo2013: survey_year=2

Gabon2012: survey_year=1

Ghana2003: survey_year=1
Ghana2014: survey_year=2

So, survey_year=1 can refer to different years. Does this matter, for generating new strata and cluster variables?

What if I create two variables: "survey_round" (with values: 1, 2, 3, 4) - to capture the different rounds of survey, and a second variable: "survey_year" (to consecutively number the survey years: 2003, 2004, 2005, 2006, .......2017).

The two new strata variables could be:

egen cluster=group(survey_year survey_round v001)
egen stratum=group(survey_year survey_round v024 v0025)

Would these new approach be ok?

Thanks - Yy
Re: Re-Numbering Survey_Year and Survey_Rounds [message #17718 is a reply to message #17629] Mon, 13 May 2019 12:21 Go to previous messageGo to next message
Bridgette-DHS is currently offline  Bridgette-DHS
Messages: 2537
Registered: February 2013
Senior Member
Please reference the response to another post: 16966.
Re: Re-Numbering Survey_Year and Survey_Rounds [message #17842 is a reply to message #17629] Mon, 24 June 2019 12:02 Go to previous message
boyle014 is currently offline  boyle014
Messages: 78
Registered: December 2015
Location: Minneapolis
Senior Member
Dear Yawo,

This kind of pooling is easier if you use IPUMS DHS. The various surveys will be appended in the data file you download, and all of the variables will be harmonized.

IPUMS DHS also includes unique ids for surveys (country, year, country-year), and unique strata and cluster variables created for comparative analyses. It also includes harmonized geographic regions for countries over time.

As of this writing, IPUMS DHS includes 156 standard DHS surveys--more are being added all the time.

Professor Elizabeth Boyle
Sociology & Law, University of Minnesota, USA
Principal Investigator, IPUMS-DHS
Previous Topic: Weighting Namibia 2013 Data (PR)
Next Topic: using sample weight
Goto Forum:

Current Time: Thu Jun 30 20:55:32 Coordinated Universal Time 2022