The DHS Program User Forum
Discussions regarding The DHS Program data and results
Home » Data » Dataset use in Stata » Understanding variables in PR & IR files
Understanding variables in PR & IR files [message #3546] Mon, 05 January 2015 06:31 Go to next message
Mercysh is currently offline  Mercysh
Messages: 35
Registered: April 2014
Member
I am doing data quality checks before analysis and checking consistency between hv112, child reported mother's line number (hv112) and b16 children's line number and hc60 (line number of mother from women's questioonaire).
I merged the Lesotho PR & IR files for 2009 using the following code:
use LSIR60FL
sort v001 v002 v003
save LIR_merge.dta
use LSPR60FL.DTA
gen v001 = hv001
gen v002 = hv002
gen v003 = hv112
save LPR_merge.dta
merge m:1 v001 v002 v003 using LIR_merge.dta
*drop if IR & PR matched
drop if _merge==3
tab b16_* if b16_* !=0
*only for complete women questioonaire
tab hv112 hc60 if hc60 !=994

Please help me understand how the codes 993 & 995 are derived when hv112 is valid line number (1 to 14).
I also don't know if I am interpreting correctly that there are women who reported individuals as their children but the children did not report them as their mothers.

Lastly:
using Lesotho PR file for 2009
I weight the data:
generate weight= hv005/1000000
tab hv105 [iweight= weight]
I get total of 43 162, which is 10 000 more than the 33 719 total reported in the report.

Please help


Mercy

[Updated on: Tue, 06 January 2015 06:51]

Report message to a moderator

Re: Understanding variables in PR & IR files [message #3684 is a reply to message #3546] Mon, 26 January 2015 10:04 Go to previous message
Bridgette-DHS is currently offline  Bridgette-DHS
Messages: 3199
Registered: February 2013
Senior Member
Following is a response from Senior DHS Stata specialist, Tom Pullum
First, hc60, like all of the hc variables, is only coded for young children. In LSPR60FL.dta, hc60 is missing (".") if hc1 (months of age) is missing, i.e. if hc1>59. If you do "label list hc60" you get this:

. label list hc60
hc60:
993 mother not de facto
994 incomplete woman's interview
995 not in household

If you then do "tab hc60" in LSPR60FL.dta, there are 2,584 non-missing values, one for every child with hc1<=59. Of these, 29 have code 993; 233 have code 994, and 591 have code 995.

Code 993 would typically mean that the mother was in the household but she was a "visitor" who did not sleep there the previous night, so she was not eligible for the woman's interview; code 994 means "incomplete woman's interview", i.e. she is a de facto member in the household listing but is not in the IR file, even though she should have been; code 995 means that the mother was not in the household at all, in which case the child should have hv112=0. However, I see that of the 591 children with hc1<=59 and hc60=995, there are 125 who do not have hv112=0. There are 470 children with hc1<=59 and hv112=0, and 4 of them do NOT have hc60=995.

I doubt that these 125+4=129 apparent discrepancies between hc60=995 and hv112==0 are data quality problems, because codes such as these are usually forced to be consistent. If I had more time, I would look into that.

However, I hope this answers your question about the coding of hc60.

Previous Topic: Merging male and female datasets
Next Topic: Brant test
Goto Forum:
  


Current Time: Sun Nov 24 22:58:55 Coordinated Universal Time 2024