The DHS Program User Forum: Ethiopia » using design elements correctly

Home » Countries » Ethiopia » using design elements correctly

Show: Today's Messages :: Show Polls :: Message Navigator

using design elements correctly [message #25832]

Fri, 16 December 2022 03:42

gebretsh@gmail.com
Messages: 17
Registered: June 2022

Member

Dear Dr Tom and other DHS experts,

I'd like to thank you for your usual invaluable firsthand assistance with DHS data analysis.
I would like now to ask questions on the specifications of the three design elements of DHS data: cluster, weight and strata

1) I have a habit of using these three elements whenever I do analysis using DHS data via the svyset function in Stata.
svyset psu [pw=weight], strata(strata var) singleunit(centered)
Now, I read today the Stata's survey data reference manual and recommends the specification of a secondary sampling unit (ssu), which is the household ID in DHS, as follows:

svyset psu [pw=weight], strata(strata var) || household Id (v002)

I have already analyzed my data using the first command and sent it to a journal for publication. Should I re-analyze the data using the second code?

2) I want to use a Stata command that does not support "svy". The Stata command that I want to use is "mvdcmp", a tool used to do decomposition analysis between two groups. Now, In place of the svy command, I just opt to use another way of supplying the design elements into my syntax, as follows:
mvdcmp place of residence: logit skilled_onc_2days wealth_early1 wealth_early2 [pw=w1], robust cluster(id)
To add to the problem, this "mvdcmp" command does not support/accept strata, and supports only weight and cluster, as indicated above. Is there a severe problem If ignore the strata variable from being taken into account in my analysis.

Thanks so much for your advice.

Regards,

Report message to a moderator

Re: using design elements correctly [message #25836 is a reply to message #25832]

Fri, 16 December 2022 11:04

Bridgette-DHS
Messages: 3230
Registered: February 2013

Senior Member

Following is a response from Senior DHS staff member, Tom Pullum:

We recommend the version of svyset that you are currently using. I just ran the lines below on the Philippines 2017 DHS, for an example. #1 includes only the weights, v005. #2 adds the usual adjustments for clustering and strata with svyset. #3 is your proposed modification of svyset, with subsampling of households.

All three models give exactly the same estimates of coefficients. #2 and #3 give estimates of standard errors, test statistics, and confidence intervals that are different from #1. However, the estimates of standard errors, etc. are exactly the same in #2 and #3. That is, you can use #3 if you want but it appears from this simple check that the results will be the same as with #2.

Note: I am not proposing that you would analyze CEB with linear regression! This is just an example of a statistical model.

* Estimation #1
regress v201 i.v013 i.v190 [pw=v005]

* Estimation #1
svyset v001 [pw=v005], strata(v022) singleunit(centered)
svy: regress v201 i.v013 i.v190

* Estimation #1
svyset v001 [pw=v005], strata(v022) || v002
svy: regress v201 i.v013 i.v190

Report message to a moderator

Re: using design elements correctly [message #25837 is a reply to message #25836]

Fri, 16 December 2022 13:09

Bridgette-DHS
Messages: 3230
Registered: February 2013

Senior Member

Here's a response to your second question, from Tom Pullum:

mvdcmp is just one example of an estimation command that does not allow svyset. There are also some commands that allow svyset now, but not in earlier versions of Stata. When this happens, your only option is to make as many of the adjustments for weights, clustering, and stratification as possible. Sometimes, if svyset is not accepted, you can still include [pweight=v005] before the comma and cluster(v001) as an option after the comma. The adjustment for stratification is the only one of the three adjustments that can only be done with svyset and svy. If the estimation command does not accept these adjustments, all you can do is to put a comment in your paper or report saying which adjustments were not possible.

Report message to a moderator

Re: using design elements correctly [message #31262 is a reply to message #25837]

Wed, 06 August 2025 06:01

Chris78
Messages: 1
Registered: August 2025

Member

DHS surveys are designed and stratified at the PSU level, and households within PSUs are not independently sampled in a way that would necessitate treating them as a second-level sampling unit in most practical analyses.

Report message to a moderator

Previous Topic:	childhood acute respiratoty tract infection and associated factors by usingEDHS2016
Next Topic:	preference of data file and merging data files

Goto Forum:

-=] Back to Top [=-

[ Syndicate this forum (XML) ] [

] [

]

Current Time: Sun Dec 7 23:37:56 Coordinated Universal Time 2025