Re: Issues with Honduran DHS dataset 2011-12 [message #18943 is a reply to message #18843] |
Mon, 23 March 2020 13:21 ![Go to previous message Go to previous message](/theme/default/images/up.png) ![Go to next message Go to previous message](/theme/default/images/down.png) |
Mariela Contreras
Messages: 7 Registered: September 2019
|
Member |
|
|
Thank you for your reply Tom and Bridgette.
I am analyzing the association between intimate partner violence (IPV) and child stunting in Honduras. I am using the 2011 dataset.
I wonder why I get a smaller population size (weighted observations) than the number of observations (unweighted) during analysis? To keep track of what I have done so far, below my procedure in Stata.
Thank you for your support,
Mariela
============================================================ ========
PROCEDURES
============================================================ ========
I downloaded the dataset HNIR62FL.DTA and followed the steps.
1. Setting the sampling design features as:
*********************************************************
*WEIGHT VARIABLE
gen weight = d005/1000000
*SURVEY SET
gen psu = v021
gen strata = v022
svyset psu [pw = weight], strata(strata)
*********************************************************
2. Keeping records that responded the domestic violence questions (keep if v044==1)
3. Generating the variable "**Women age 15-49 who have experienced physical violence since age 15".
**********************************************************
gen everpsyvio=1 if (d105a>=1 & d105a<=4)|(d105b>=1 & d105b<=4)|(d105c>=1 & d105c<=4)|(d105d>=1 & d105d<=4)|(d105e>=1 & d105e<=4)|(d105f>=1 & d105f<=4)|(d105g>=1 & d105g<=4)|(d105j>=1 & d105j<=4)|(d130a>=1 & d130a<=4)
replace everpsyvio=1 if d115y==0
replace everpsyvio=1 if d118y==0
replace everpsyvio=0 if everpsyvio==.
**********************************************************
4. Generating the variable corresponding to "*Persons Committing Physical Violence //Current husband/partner"
**********************************************************
gen current=0 if everpsyvio==1
replace current=1 if v502==1 & ((d105a>=1 & d105a<=4)|(d105b>=1 & d105b<=4)|(d105c>=1 & d105c<=4)|(d105d>=1 & d105d<=4)|(d105e>=1 & d105e<=4)|(d105f>=1 & d105f<=4)|(d105j>=1 & d105j<=4))
replace current=1 if v502==1 & d118a==1
**********************************************************
5. Generating the variable child stunting. I use the last alive child with height measurements.
**********************************************************
**the youngest child's height
keep if hw70_1 < 9996
**computing height for age < -2
gen haz=hw70_1/100 //converting to meters
gen pstunted=0
replace pstunted=1 if haz<-2
replace pstunted=. if haz==.
tab pstunted
**********************************************************
6. Crosstable between physical violence by current husband/partner and child stunting
tab pstunted current //unweithed
| current
pstunted | 0 1 | Total
-----------+----------------------+----------
0 | 800 611 | 1,411
1 | 221 245 | 466
-----------+----------------------+----------
Total | 1,021 856 | 1,877
svy: tab pstunted current, col //weighthed
(running tabulate on estimation sample)
Number of strata = 38 Number of obs = 1,877
Number of PSUs = 892 Population size = 1,635.6582
Design df = 854
-------------------------------
| current
pstunted | 0 1 Total
----------+--------------------
0 | .8063 .7385 .7765
1 | .1937 .2615 .2235
|
Total | 1 1 1
-------------------------------
Key: column proportion
Pearson:
Uncorrected chi2(1) = 12.2383
Design-based F(1, 854) = 8.7656 P = 0.0032
.
end of do-file
|
|
|