The DHS Program User Forum - RDF feed
https://userforum.dhsprogram.com/index.php
Mismatch between National report and NFHS4 raw data analysis
https://userforum.dhsprogram.com/index.phpindex.php?t=rview&goto=16227&th=7505#msg_16227
I am working with NFHS4 raw data. Many times my analysis is not matching with National Report. Tried to sort out this issue, discussed with various people, but couldn't succeed. For example the number given in the national report of children born in the two years preceding the survey who ever breasted itself is greater than the total number of children which includes both who breastfed and not. How come that number will be greater than this total number. This is only one example, there are many other issues as well. I have used weights and all, followed every statistical procedure explained by USAID people.]]>Muneer Kalliyil2018-11-29T02:40:37-00:00Re: Mismatch between National report and NFHS4 raw data analysis
https://userforum.dhsprogram.com/index.phpindex.php?t=rview&goto=16237&th=7505#msg_16237
fredarnold2018-11-30T16:48:55-00:00Re: Mismatch between National report and NFHS4 raw data analysis
https://userforum.dhsprogram.com/index.phpindex.php?t=rview&goto=16239&th=7505#msg_16239
I am very gratitude andThanks for your response
If you are saying this number is (97,989) is the total number and the denominator, again it is not matching with our analysis, I have attached the stata code for the same, let me know whether I have made any mistakes.
These are my stata command used for
***********************************************************
clear all
use "/Users/muneerkalliyil/Desktop/UNICEF/NFHS4/IAKR74FL.DTA"
***APPLYING WEIGHTS************************
*Survey Weights
des v005
gen survweight = v005/1000000
gen stratum = v023
svyset v021 [iweight=survweight], strata(stratum)
****AFTER APPLYING WEIGHTS, JUST WE ARE LOOKING HOW MANY CHILDRENS ARE THERE BELOW 24 MONTHS,
**total number of children under 24 months
svy: tab hw1 if hw1 < 24
*******************************************************
The result says no of obs as 96782;
. svy: tab hw1 if hw1 < 24
(running tabulate on estimation sample)
Number of strata = 2513 Number of obs = 96782
Number of PSUs = 26472 Population size = 93053.216
Design df = 23959
NFHS4 National report gives the total number of children below 24 months as (97989), but our analysis shows as (96782 )
Please let me know, waiting for your reply
Regards
Muneer]]>Muneer Kalliyil2018-12-01T02:37:55-00:00Re: Mismatch between National report and NFHS4 raw data analysis
https://userforum.dhsprogram.com/index.phpindex.php?t=rview&goto=16246&th=7505#msg_16246
fredarnold2018-12-02T22:42:38-00:00Re: Mismatch between National report and NFHS4 raw data analysis
https://userforum.dhsprogram.com/index.phpindex.php?t=rview&goto=16247&th=7505#msg_16247
Thanks for your reply
I will be very grateful if you could tell me how to include both living and dead children. As per my understanding, the variable hw1 (age in months) includes both, I have gone through all other variables in Children file, but couldn't get a variable differentiating between dead and alive children. Please let me know if you know how to include.
Looking forward to hearing from you
Thanks in advance]]>Muneer Kalliyil2018-12-03T03:53:52-00:00Re: Mismatch between National report and NFHS4 raw data analysis
https://userforum.dhsprogram.com/index.phpindex.php?t=rview&goto=16251&th=7505#msg_16251
fredarnold2018-12-03T21:25:04-00:00Re: Mismatch between National report and NFHS4 raw data analysis
https://userforum.dhsprogram.com/index.phpindex.php?t=rview&goto=16254&th=7505#msg_16254
very thankful to your replies
I tried using V008 - B3 < 24 for age, then I got the number of children below 24 as 101955. However, the national report says 95% of the total below 24 has been breastfed, if we take the 95% of this number (96857), again it is not matching with national report number (97989). Could you please let me know if I am making any mistakes
Looking forward to hearing from you
Regards]]>Muneer Kalliyil2018-12-04T03:36:02-00:00Re: Mismatch between National report and NFHS4 raw data analysis
https://userforum.dhsprogram.com/index.phpindex.php?t=rview&goto=16257&th=7505#msg_16257
fredarnold2018-12-04T18:53:50-00:00Re: Mismatch between National report and NFHS4 raw data analysis
https://userforum.dhsprogram.com/index.phpindex.php?t=rview&goto=16259&th=7505#msg_16259
I have already applied survweight, tried both pweight and iweight. The number which I told earlier is coming after applying the weight. Secondly, If the number in the second column is denominator, then our estimated denominator is definitely coming larger than the given denominator. As I told before, in our estimation, the total number of children below 24 months is coming around 101955. However, the given number in the report is only around 97989.
Please let me know if I am making any mistakes.
Looking forward to hearing from you
Thanks]]>Muneer Kalliyil2018-12-05T04:28:46-00:00Re: Mismatch between National report and NFHS4 raw data analysis
https://userforum.dhsprogram.com/index.phpindex.php?t=rview&goto=16262&th=7505#msg_16262
gen age = v008 b3
gen xweight = v005/1000000
tabulate age if age < 24 [iw=xweight]
Regards
Muneer]]>Muneer Kalliyil2018-12-06T01:57:59-00:00Re: Mismatch between National report and NFHS4 raw data analysis
https://userforum.dhsprogram.com/index.phpindex.php?t=rview&goto=16343&th=7505#msg_16343
First of all thank you so much for earlier clarification regarding mismatch in NFHS national report, Your solution was working perfectly.
However, now I have come across state-level reports. For example, I have taken the Jharkhand report. Table 61 of initial breastfeeding has been attached for your reference.
The table says that Number of last born children below 24 months is given as 4723.
However my estimation is not matching with state report, Stata coding is given below;
************************************************************ *******
****Jharkhand****
clear all
use "/Users/muneerkalliyil/Desktop/UNICEF/NFHS4/IAKR74FL.DTA"
gen survweight = v005/1000000
gen stratum = v023
svyset v021 [iw=survweight], strata(stratum)
gen age = v008 - b3
numlabel, add
****keeping only Jharkhand****
tab v024
keep if v024 == 15
****giving weight****
tabulate age if age < 24 & midx==1 [iw=survweight]
****without weight*****
tab age if age < 24 & midx==1
************************************************************ ********
I have estimated number of last born children below 24 months, giving weight and without giving as well.
Result says the number of children with weight = 2864, without weight = 4705.
Therefore, in both cases, the result is not matching with state report. It says the number is 4723. And the gap becomes large when we use weight.
My queries are;
Is it correct to use weight in the state level, if yes, is it different from national weight?
Why the gap is large with and without weight
Could you please sort out my problem as soon as possible
Looking forward to hearing from you
Thanks in advance
]]>Muneer Kalliyil2018-12-18T05:46:12-00:00Re: Mismatch between National report and NFHS4 raw data analysis
https://userforum.dhsprogram.com/index.phpindex.php?t=rview&goto=16346&th=7505#msg_16346
fredarnold2018-12-18T16:44:18-00:00Re: Mismatch between National report and NFHS4 raw data analysis
https://userforum.dhsprogram.com/index.phpindex.php?t=rview&goto=16348&th=7505#msg_16348
It is working perfectly]]>Muneer Kalliyil2018-12-19T01:21:47-00:00Re: Mismatch between National report and NFHS4 raw data analysis
https://userforum.dhsprogram.com/index.phpindex.php?t=rview&goto=16350&th=7505#msg_16350
Sorry for spamming your inbox again and again.
This time again with earlier same table, Table 61 Initial breastfeeding (attached below).
I am looking to the second column, Percentage who started breastfeeding within one hour of birth. The table says that 29.9% of Urban and 33.8% of Rural children have been put into breast within one hour of birth.
myStata code;
recode m34 (0/100 = 1) (101/236 = 0), gen(bf1hour)
svy: tab bf1hour if age < 24 & midx==1 & v102 == 1
svy: tab bf1hour if age < 24 & midx==1 & v102 == 2
and our results say
. svy: tab bf1hour if age < 24 & midx==1 & v102 == 1
(running tabulate on estimation sample)
Number of strata = 24 Number of obs = 828
Number of PSUs = 282 Population size = 858.55407
Design df = 258
RECODE of
m34 (when
child put
to
breast) proportions
0 .6929
1 .3071
Total 1
Key: proportions = cell proportions
. svy: tab bf1hour if age < 24 & midx==1 & v102 == 2
(running tabulate on estimation sample)
Number of strata = 72 Number of obs = 3735
Number of PSUs = 917 Population size = 3725.5765
Design df = 845
RECODE of
m34 (when
child put
to
breast) proportions
0 .6518
1 .3482
I agree it is a small difference, 30.71 in place of 29.9% and 34.82 in place of 33.8%, still, I am asking you because I got an exact matching solution from you for earlier problems.
Looking forward to hearing from you
Thanks in advance
]]>Muneer Kalliyil2018-12-19T16:09:51-00:00Re: Mismatch between National report and NFHS4 raw data analysis
https://userforum.dhsprogram.com/index.phpindex.php?t=rview&goto=16363&th=7505#msg_16363
. svy: tab bf1hour if age < 24 & midx == 1 & v102 == 1 & v024 == 15
(running tabulate on estimation sample)
Number of strata = 24 Number of obs = 852
Number of PSUs = 286 Population size = 881.91211
Design df = 262
-----------------------
RECODE of |
m34 (when |
child put |
to |
breast) | proportions
----------+------------
0 | .7011
1 | .2989
|
Total | 1
-----------------------
Key: proportions = cell proportions
]]>fredarnold2018-12-26T17:15:50-00:00Re: Mismatch between National report and NFHS4 raw data analysis
https://userforum.dhsprogram.com/index.phpindex.php?t=rview&goto=16364&th=7505#msg_16364
The problem was in recoding, now it is matching perfectly
Thank
keep in touch with]]>Muneer Kalliyil2018-12-27T14:07:11-00:00Re: Mismatch between National report and NFHS4 raw data analysis
https://userforum.dhsprogram.com/index.phpindex.php?t=rview&goto=16686&th=7505#msg_16686
When I use svyset command with state level weights for doing state level analysis for India, I use the weight variable sv005. In that, does my strata remain the same? That is v022?
Thank you,
Pooja
fredarnold wrote on Wed, 26 December 2018 12:15
It looks like you are trying to match the results for Jharkhand. In that case, you need to change the weight from v005 to sv005, which is necessary every time you are analyzing an individual state. As you can see below, the urban percentage for Jharkhand (29.9 percent) comes out exactly the same as the estimate in the Jharkhand state report.
. svy: tab bf1hour if age < 24 & midx == 1 & v102 == 1 & v024 == 15
(running tabulate on estimation sample)
Number of strata = 24 Number of obs = 852
Number of PSUs = 286 Population size = 881.91211
Design df = 262
-----------------------
RECODE of |
m34 (when |
child put |
to |
breast) | proportions
----------+------------
0 | .7011
1 | .2989
|
Total | 1
-----------------------
Key: proportions = cell proportions
]]>Pooja Arora2019-02-17T15:34:38-00:00Re: Mismatch between National report and NFHS4 raw data analysis
https://userforum.dhsprogram.com/index.phpindex.php?t=rview&goto=16692&th=7505#msg_16692
Muneer Kalliyil2019-02-19T13:58:04-00:00Re: Mismatch between National report and NFHS4 raw data analysis
https://userforum.dhsprogram.com/index.phpindex.php?t=rview&goto=16694&th=7505#msg_16694
Following is a response from Senior DHS Stata Specialist, Tom Pullum:
Yes, v022 is the stratum id at the state level as well as at the national level. The strata that are outside the state will be ignored. ]]>Bridgette-DHS2019-02-19T16:44:22-00:00