NGHR7BFL [message #24711] |
Mon, 27 June 2022 12:33 |
Mayo
Messages: 12 Registered: February 2019
|
Member |
|
|
Hello,
I am using NGHR7BFL to do an analysis and in the data file, I noticed that there are multiple variables for "Highest educational level attained" (HV106$01 onward). Which one should I use?
|
|
|
|
|
Re: NGHR7BFL [message #24967 is a reply to message #24721] |
Wed, 10 August 2022 16:51 |
Mayo
Messages: 12 Registered: February 2019
|
Member |
|
|
Hi everyone,
I have another question. I am aware that there are different weights for different sample selections/units of analysis....Households will be my unit of analysis and I would like to identify how to properly weight this survey data in SPSS.
Thanks,
Dr. M.
|
|
|
|
Re: NGHR7BFL [message #24993 is a reply to message #24711] |
Tue, 16 August 2022 09:32 |
Mayo
Messages: 12 Registered: February 2019
|
Member |
|
|
Thanks again.
Another question is: how do I know if there is data missing at random (MAR) or MCAR?
Dr. M.
|
|
|
Re: NGHR7BFL [message #25127 is a reply to message #24993] |
Thu, 01 September 2022 16:34 |
Janet-DHS
Messages: 888 Registered: April 2022
|
Senior Member |
|
|
Following is a response from DHS Research & Data Analysis Director, Tom Pullum:
You are asking questions that go beyond DHS data and therefore beyond the scope of the forum.
DHS has very low levels of "missing" data. A blank or dot in a DHS data file should be interpreted as Not Applicable (NA). If you are thinking of "missing" as "don't know" or "refused" or something like that, we use special codes such as 8, 9, 9994, etc., depending on the variable. The frequencies of those codes are usually very low.
In general, to test whether "missing" is random with respect to some potential covariate, you construct a binary variable that is 1 if "missing" and 0 if "not missing" and do a logit regression of that variable on the covariate, to see whether there is a statistically significant relationship.
|
|
|
|
Re: NGHR7BFL [message #25360 is a reply to message #25350] |
Wed, 12 October 2022 09:37 |
Janet-DHS
Messages: 888 Registered: April 2022
|
Senior Member |
|
|
Following is a response from DHS staff member Tom Pullum:
Beginning with DHS-7, most surveys include hv270a in the PR file, v190a in the IR, KR, and BR files, and mv190a in the MR file. The "a" indicates that the wealth quintiles are residence-adjusted, i.e. calculated separately for urban and rural areas. A problem with the original, unadjusted wealth quintiles is that, in most surveys, there are very few households in the top quintile in rural areas and very few households in the bottom quintile in urban areas. If you use the unadjusted wealth quintiles in a model, much of the information is actually an urban/rural distinction. If you use the unadjusted wealth quintiles in a model, AND include urban/rural (hv025, etc.) then you have a better separation of wealth and residence, but the model may run into estimation issues because there are (typically) so few cases in the two combinations I mentioned.
Bottom line: if your model includes urban/rural, which it probably should, then you may want to use the adjusted wealth quintiles rather than the unadjusted. But there's no law saying you have to do that. It would be good to tell the reader which version you are using.
|
|
|
|
Re: NGHR7BFL [message #25602 is a reply to message #25585] |
Wed, 16 November 2022 09:10 |
Janet-DHS
Messages: 888 Registered: April 2022
|
Senior Member |
|
|
Following is a response from DHS staff member Tom Pullum:
The best variable for this purpose is hv000 (in Stata, HV000 in SPSS). That variable only takes one value, the string "NG7". If you tab that variable, without weights, you get 188,010 cases in the PR file, i.e. individuals in the household survey. If you do the same thing in the HR file, which has households as units, you get 40,427 households.
|
|
|
|
|
|
Re: NGHR7BFL [message #26359 is a reply to message #26328] |
Mon, 13 March 2023 14:00 |
Mayo
Messages: 12 Registered: February 2019
|
Member |
|
|
Hi Janet,
Regarding is wealth quintile calculated after data collection or beforehand?
Thanks in advance,
Dr. M.
|
|
|
Re: NGHR7BFL [message #26388 is a reply to message #26359] |
Wed, 15 March 2023 10:31 |
Janet-DHS
Messages: 888 Registered: April 2022
|
Senior Member |
|
|
Following is a response from DHS staff member, Tom Pullum:
The wealth quintiles are constructed from the raw data files after the data have been collected. Using household level characteristics (or "assets"), a principal components analysis is done with households as units. The first principal component is interpreted as the continuous wealth index. The quintiles (hv270 in the PR file) are constructed from the continuous index (hv271). The DHS website provides more details.
|
|
|