The DHS Program User Forum: Weighting data » Pooled data analysis from 25 countries

Home » Data » Weighting data » Pooled data analysis from 25 countries

Show: Today's Messages :: Show Polls :: Message Navigator

Re: Pooled data analysis from 25 countries [message #16616 is a reply to message #16587]

Wed, 06 February 2019 10:50

Bridgette-DHS
Messages: 3230
Registered: February 2013

Senior Member

Following is a response from Senior DHS Stata Specialist, Tom Pullum and Senior DHS Sampling Specialist, Mahmoud Elkasabi:

1. For certain countries cluster and PSU variables are not the same. In such cases which variable shall I use to create the otherwise PSU level variables.
In most of the countries, variables for cluster and PSU are the same. In cases where the two are different, you can use the cluster.

2. Do I need to specify the strata variable while using svyset command in Stata. If yes, how do i deal with missing strata information.
Yes, you can use HV022. In cases where HV022 is not consistent with the stratification described in sampling design in Appendix A of the final report, you can recode a new stratification variable. If you are using HV022, you should not expect to find missing strata information.

3. I have already de-normalized the weights as suggested in earlier posts on this forum. Do I need to re-normalize the weights before using them? If yes, how shall I do it?
I do not see a reason to re-normalize the de-normalized weights. This should not affect your regression.

4. In one post on this forum I read that in multi country analysis data must be clustered at country level. Do I need to do that for this analysis? If yes, how do I cluster data at two different levels i.e., country level and then individual PSU level?
I would recommend fixed effects for country or survey, rather than random effects. Just assign a survey code such as survey=1, 2, 3, etc. to each survey and include a term such as "i.survey" in the model specification.

Someone else could prefer random effects for surveys, especially if there are MANY surveys in the analysis. That would require a 3-level hierarchical model (for respondents / clusters / surveys).

When combining surveys, the strata and cluster codes but be unique. For example, you do not want cluster 1 in survey 1 to be confused with cluster 1 in survey 2. For example, you could have "egen cluster_id=group(survey hv021)" and "egen stratum_id=group(survey hv022)."

Report message to a moderator

[Message index]

		Pooled data analysis from 25 countries By: Mayank_Ag on Mon, 04 February 2019 07:10
		Re: Pooled data analysis from 25 countries By: Bridgette-DHS on Wed, 06 February 2019 10:50
		Re: Pooled data analysis from 25 countries By: boyle014 on Wed, 06 February 2019 12:51
		Re: Pooled data analysis from 25 countries By: Mayank_Ag on Wed, 06 February 2019 14:19
		Re: Pooled data analysis from 25 countries By: boyle014 on Wed, 27 February 2019 09:06

Previous Topic:	Pooled data: which weight to use
Next Topic:	weighting tabulations for 2 way tables

Goto Forum:

-=] Back to Top [=-

[ Syndicate this forum (XML) ] [

] [

]

Current Time: Thu Jan 1 14:59:09 Coordinated Universal Time 2026