Re: Cambodia mortality/morbidity rates due to accidents: calculation issues [message #9884 is a reply to message #9830] |
Thu, 02 June 2016 11:27 |
Bridgette-DHS
Messages: 3230 Registered: February 2013
|
Senior Member |
|
|
Following is a response from Senior DHS Stata Specialist, Tom Pullum:
You were right to expect the codes in the HR file to appear in the PR file too. It occasionally happens that they are not transferred. This is a data processing error and usually only happens, as in this case, with country-specific questions.
Note that if the accident resulted in a death, then the person who died will not be included in the household listing, and could not be in the PR file. Other accidents, that do NOT result in a death, COULD be associated with an individual in the PR file.
There appears to have been a coding omission. I am looking just at the 2014 survey, and at the files KHPR71FL.dta and KHHR72FL.data. I see that the HR file includes sh50 and sh53_1/2/3 through sh57_1/2/3, all related to accidents. The PR file includes sh50 and sh53 through sh57, without subscripts. The omission is that the line number of the person referred to was not coded. In the HR file there should have been, say, sh52_1/2/3, which would have been the line numbers of the three people (max) in the household who had accidents (but survived).
The line numbers are not given explicitly in the HR file, but apparently DP had access to them, because the PR file DOES identify the (surviving) individuals. You say that the variables are all missing in the PR file, but I do see them in KHPR71FL.dta. (I don't have time to look at the other Cambodia surveys.)
If you look at the KHPR71 file carefully (for example with "list hv001 hv002 hvidx sh50 sh53 sh54 sh55 sh56 sh57 if sh50==3, table clean") you will see that everyone in the household has the same value of sh50, and the people who had non-missing values of sh53-sh57 will have codes for those variables. For example, if there were 3 people who had accidents, those people will be identified. The value of sh54 (on survival) is superfluous because it is always yes. (People who died are identified in the HR versions of these variables, but you have no other information about them.)
I hope this will point you in the right direction. Let me know if you have other questions about how to use these variables.
|
|
|