Home » Data » Weighting data » Response rate and weights (Effect on child health using information available for both parents)
Response rate and weights [message #26534] |
Thu, 30 March 2023 03:30 |
shreyaj7
Messages: 7 Registered: March 2023
|
Member |
|
|
It is mentioned in "Demonstration of How to Weight DHS Data in Stata" that when using couples' information one should use Men's weight as they have higher nonresponse rates. I am studying the effect on child health and want to use the sample in which the information for both the parents (like education, age, employment, etc.) are available (which is approximately 15% of the total sample in the KR file).
My concerns are:
1. Can we use this sample and still get results that are representative?
2. The weights that need to be used is men's weights or women's weight as mentioned earlier they have low response rates than women's.
Kindly help me out.
|
|
|
Re: Response rate and weights [message #26540 is a reply to message #26534] |
Thu, 30 March 2023 11:06 |
Bridgette-DHS
Messages: 3208 Registered: February 2013
|
Senior Member |
|
|
Following is a response from Senior DHS staff member, Tom Pullum:
The first issue I see is that children are linked with their mothers, and you cannot be sure that the mother's current partner is the child's father. To confirm that, you have to work with hv113-hv114 in the PR file, which identifies the father if he is alive and in the same household. Second, often there is subsampling of men for the interview with men. For example, in the NFHS-4 and -5, only 1/6 of men were interviewed. For these reasons, a study that includes the effect of the father's characteristics on the child's health and welfare can be challenging. (Important, but challenging.)
It is recommended that if you have a table, regression, etc., that includes variables from the survey of men, even if it also includes variables from the survey of women, you should use mv005 for the weight. The reason is that nonresponse is higher for men than for women. This would apply even if you are not using the CR file. It applies if you do any merge with the MR file and are including in your command any variables from the MR file.
At the same time it should be said that the estimates will not be very sensitive to which weight you use. Your conclusions will probably be robust with respect to the choice of weight. It's just considered to be "best practice."
|
|
|
|
Re: Response rate and weights [message #26555 is a reply to message #26549] |
Fri, 31 March 2023 12:01 |
Bridgette-DHS
Messages: 3208 Registered: February 2013
|
Senior Member |
|
|
Following is a response from Senior DHS staff member, Tom Pullum:
Recently I posted a program to merge all the data for children, mothers, and fathers. It may go beyond what you need, but it combines the data in the BR ((and KR), PR, IR, and MR files with children age 0-17 as cases. For the NFHS-4 and -5 surveys, which had only a 1/6 subsample of men, you will lose a lot of children and mothers in order to get the fathers, but yes, you will still have a representative sample. The estimates will be unbiased. And because these surveys were so large, you will still have a large sample.
If you want to compare the significance of effects for the mothers and fathers, you need to be careful. To take a simple example, say you wanted to look at the effects of maternal and paternal education on child survival. If you use the full sample to estimate the maternal effect, and the 1/6 sample to estimate the paternal effect, both coefficients will be unbiased, and comparable. However, even if the effects were equal the t or z score for the mothers would be about sqrt(6)=2.4 times as large as the one for men, with much more potential to be statistically significant. You'd have to take that difference in statistical power into account if you inferred that the mother's education was significant, but the father's education was NOT.
|
|
|
Re: Response rate and weights [message #26565 is a reply to message #26555] |
Sat, 01 April 2023 04:52 |
shreyaj7
Messages: 7 Registered: March 2023
|
Member |
|
|
Thank you for sharing the do file. It's very helpful. If at all possible for you can you share the merge_children_mothers_fathers.dta file. I want to compare my merged data file with it if I have got it right or made some mistakes in the process.
Please clarify one thing for me. In your example "effect of maternal and paternal education on child survival" you are looking at the effects of mother and father in separate regressions, right? Not taking them together in one regression? because if we do then we will be left with near about those children observations only for whom we have both parent's info who are alive and live with them. so roughly 35-40k observations.
Also, I had one more query. I just need father's education and mother's education to create my variable of interest and other controls could be mother, child, HH characteristics. So can I use the info about the child's father line no. in PR file and from there merge into KR file?
I tried this as well and now I have father's education variable for approximately 1,73, 000 children (in KR file out of 2,32,920).
Can I now use these 1,73,000 observations as my sample size and apply women's weights and do my analysis? Will that be correct?
|
|
|
|
|
|
|
|
Re: Response rate and weights [message #26635 is a reply to message #26633] |
Wed, 12 April 2023 07:48 |
Bridgette-DHS
Messages: 3208 Registered: February 2013
|
Senior Member |
|
|
Following is a response from Senior DHS staff member, Tom Pullum:
This looks good to me. The state weights are equal to the national weights multiplied by a constant (for each state) so the means should come out the same using state weights or the national weight. The national weight takes into account the different sampling fractions in different states and is definitely what you want to use for national estimates.
The only thing I might do differently would be to use the CR file rather than the IR file. For couples in the MR file, the woman and man have to name each other as partners, leading to better matching. The level of education for the man is reported by the man himself, rather than from the wife, who may introduce some bias, especially if there is a large difference in their levels of education. If you repeat the analysis using the CR file, the man's weight would be preferable because there is a higher level of nonresponse for men.
I would also include parallel analyses of hypergamy (marrying up) and homogamy (the same level). I think what you want to get at is the balance between hypogamy and hypergamy, and that's going to be affected by the amount of detail in the education distribution. For example, if the distribution is very coarse, such as no/any education, then that alone will lead to more homogamy and less of the other two.
|
|
|
|
Re: Response rate and weights [message #26706 is a reply to message #26702] |
Thu, 20 April 2023 11:21 |
fred.arnold@icf.com
Messages: 84 Registered: May 2021
|
Senior Member |
|
|
Women and men who are eligible for the individual questionnaire are each asked what their caste/tribe is and whether they belong to a scheduled caste, a scheduled tribe, an other backward class, or none of these. However, although the first question specifies the caste/tribe, there are more than 1,000 castes/tribes and there is no variable for those castes/tribes. Also, a much smaller percentage of men than women are eligible for an individual interview.
|
|
|
Goto Forum:
Current Time: Thu Dec 12 23:43:18 Coordinated Universal Time 2024
|