The DHS Program User Forum
Discussions regarding The DHS Program data and results
Home » Countries » India » Mismatch in educational attainment and education in single years (Mismatch in educational attainment (mv/v149) and education in single years (mv/v133) in the 2019-21 recode file for India.)
Mismatch in educational attainment and education in single years [message #28449] Tue, 09 January 2024 07:08 Go to next message
Navya is currently offline  Navya
Messages: 6
Registered: May 2022
Member
Dear all,

I am currently working with the men's and individual's recode files for the 2019-21 survey wave for India, and I noticed that there may be some inconsistency between the educational attainment variable (mv/v149) and education in single years variable (mv/v133) for both files. From what I understand, (m)v133 gives the single years of education for the individual, going from 0 years to 20 years for both files, and (m)v149 categorizes these years into "no education", "incomplete primary", "incomplete secondary", "complete secondary", and "higher". According to this, individuals who completed 11 years of education should be categorized into "incomplete secondary" and those with 12 years should be categorized into "complete secondary". However, this result seems to be flipped (individuals with 11 years are being categorized into "complete secondary" and those with 12 years are under "incomplete secondary" for both files). I have attached a screenshot of the tabulate result I get in Stata for the individual's recode file. I only see this result in the 2019-21 survey datasets for India, and not in the older datasets. Am I missing something here?

Thank you for your help.
Re: Mismatch in educational attainment and education in single years [message #28452 is a reply to message #28449] Tue, 09 January 2024 09:20 Go to previous messageGo to next message
Bridgette-DHS is currently offline  Bridgette-DHS
Messages: 3043
Registered: February 2013
Senior Member
Following is a response from Senior DHS staff member, Tom Pullum:


I'll describe the variables for men in the MR file, with the "mv" prefix. The same applies for women in the IR file, with the "v" prefix.

There are four relevant variables for education:

. describe mv106 mv107 mv133 mv149

              storage   display    value
variable name   type    format     label      variable label
-----------------------------------------------------------------------------------------
mv106           byte    %8.0g      MV106      educational level
mv107           byte    %8.0g      MV107      highest year of education (at level in mv106)
mv133           byte    %8.0g      MV133      total number of years of education
mv149           byte    %8.0g      MV149      educational attainment

mv149 and mv107 are almost exactly the same. The only difference is that mv149 divides primary into incomplete and complete:

. tab mv149 mv106

         educational |              educational level
          attainment | no educat    primary  secondary     higher |     Total
---------------------+--------------------------------------------+----------
        no education |    12,269          0          0          0 |    12,269 
  incomplete primary |         0     11,710          0          0 |    11,710 
incomplete secondary |         0          0     55,807          0 |    55,807 
  complete secondary |         0          0      4,211          0 |     4,211 
              higher |         0          0          0     17,842 |    17,842 
---------------------+--------------------------------------------+----------
               Total |    12,269     11,710     60,018     17,842 |   101,839

mv133 is a recode of mv106 and mv107. The following table gives the mean of mv133 for each combination of mv106 and mv107. (Ignore the "Total" column and row.) 

. tab mv107 mv106, summarize(mv133) means

                Means of total number of years of education

   highest |
   year of |
education |
(at level |       educational level
in mv106) |   primary  secondary     higher |     Total
-----------+---------------------------------+----------
no years  |         0          .          . |         0
         1 |         1          6         13 | 8.5814143
         2 |         2          7         14 | 8.1771985
         3 |         3          8         15 |  10.65447
         4 |         4          9         16 | 8.6599261
         5 |         5         10         17 | 9.5647399
         6 |         .         11         18 | 11.566783
         7 |         .         12         19 | 12.099424
         8 |         .          .         20 |        20
-----------+---------------------------------+----------
     Total | 3.9836038  9.5509347  15.147237 | 9.9378475

Hope this answers your question. Please let us know if it does not.

Re: Mismatch in educational attainment and education in single years [message #28520 is a reply to message #28452] Wed, 24 January 2024 01:20 Go to previous messageGo to next message
Navya is currently offline  Navya
Messages: 6
Registered: May 2022
Member
Hi,

Thank you so much for your quick response. I am still having some trouble reconciling the values for (m)v149 (educational attainment) with (m)v107 (highest year of education (at level in (m)v106)) and (m)v133 (total number of years of education).

. describe mv106 mv107 mv133 mv149

Variable      Storage   Display    Value
    name         type    format    label      Variable label
---------------------------------------------------------------------------------------------------------------------------------------------------
mv106           byte    %8.0g      MV106      educational level
mv107           byte    %8.0g      MV107      highest year of education (at level in mv106)
mv133           byte    %8.0g      MV133      total number of years of education
mv149           byte    %8.0g      MV149      educational attainment

. tab mv107 mv149

      highest year of |
  education (at level |           educational attainment
            in mv106) | incomplet  incomplet  complete      higher |     Total
----------------------+--------------------------------------------+----------
no years completed at |       239          0          0          0 |       239 
                    1 |       366      2,470          0      2,071 |     4,907 
                    2 |     1,172      4,915          0      2,237 |     8,324 
                    3 |     1,636     10,124          0      9,066 |    20,826 
                    4 |     2,455     11,970          0      1,004 |    15,429 
                    5 |     5,842     13,488          0      2,797 |    22,127 
                    6 |         0          0      4,211        371 |     4,582 
                    7 |         0     12,840          0        185 |    13,025 
                    8 |         0          0          0        111 |       111 
----------------------+--------------------------------------------+----------
                Total |    11,710     55,807      4,211     17,842 |    89,570 

It seems to me that in the above table, the figure 4,211 should come under "incomplete secondary" and the figure 12,840 should come under "complete secondary". I see a similar result with mv133 and mv149 as well.

. tab mv133 mv149

total number |
 of years of |                 educational attainment
   education | no educat  incomplet  incomplet  complete      higher |     Total
-------------+-------------------------------------------------------+----------
           0 |    12,269        239          0          0          0 |    12,508 
           1 |         0        366          0          0          0 |       366 
           2 |         0      1,172          0          0          0 |     1,172 
           3 |         0      1,636          0          0          0 |     1,636 
           4 |         0      2,455          0          0          0 |     2,455 
           5 |         0      5,842          0          0          0 |     5,842 
           6 |         0          0      2,470          0          0 |     2,470 
           7 |         0          0      4,915          0          0 |     4,915 
           8 |         0          0     10,124          0          0 |    10,124 
           9 |         0          0     11,970          0          0 |    11,970 
          10 |         0          0     13,488          0          0 |    13,488 
          11 |         0          0          0      4,211          0 |     4,211 
          12 |         0          0     12,840          0          0 |    12,840 
          13 |         0          0          0          0      2,071 |     2,071 
          14 |         0          0          0          0      2,237 |     2,237 
          15 |         0          0          0          0      9,066 |     9,066 
          16 |         0          0          0          0      1,004 |     1,004 
          17 |         0          0          0          0      2,797 |     2,797 
          18 |         0          0          0          0        371 |       371 
          19 |         0          0          0          0        185 |       185 
          20 |         0          0          0          0        111 |       111 
-------------+-------------------------------------------------------+----------
       Total |    12,269     11,710     55,807      4,211     17,842 |   101,839 


I am seeing similar results for the individual recode file as well. Please let me know if I'm missing something.

Thank you for your help!
Re: Mismatch in educational attainment and education in single years [message #28524 is a reply to message #28520] Wed, 24 January 2024 13:31 Go to previous messageGo to next message
Bridgette-DHS is currently offline  Bridgette-DHS
Messages: 3043
Registered: February 2013
Senior Member
Following is a response from Senior DHS staff member, Tom Pullum:

You are asking about the Kenya 2022 survey. An erratum is being issued on the coding of education in this survey.

I suggest that you look at the relationship among v106, v107, v133, and v149 in the previous survey, the Kenya 2014 survey, because the relationship was ok in that survey. In the 2014 survey, you will see how v133 is supposed to be constructed as a recode of v106 and v107. In the IR file, do this:

. tab v107 v106, summarize(v133) means

                    Means of education in single years

   highest |
   year of |   highest educational level
education |   primary  secondary     higher |     Total
-----------+---------------------------------+----------
no years  |         0          8         12 | 8.7214076
         1 |         1          9         13 | 9.2506473
         2 |         2         10         14 | 10.554584
         3 |         3         11         15 | 9.9238281
         4 |         4         12         16 | 11.115502
         5 |         5          .         17 | 5.3082022
         6 |         6         14         18 | 6.2527945
         7 |         7          .         19 | 7.0332318
         8 |         8          .         20 | 8.0017804
-----------+---------------------------------+----------
     Total | 6.6739256  10.794648  14.333705 | 8.7562835"

Ignore the "total" row and column. You will see that each combination of v106 and v107 gives a value of v133. (There are two combinations that give v133=12 and two that give v133=14.) That is, v133 is a recode of v106 and v107.

If you enter "tab v106 v149" you will see that v149 is a recode of v106, which just distinguishes between whether the person completed the primary or secondary levels. That's based on the value of v107 within each level of v106.

That's all there is to it. v106 and v107 are the key variables; v133 and v149 are constructed variables and they were constructed incorrectly in the Kenya 2022 survey. You can either reconstruct them yourself, for your use, or wait for the next version of the Kenya 2022 data files.

Re: Mismatch in educational attainment and education in single years [message #28525 is a reply to message #28524] Wed, 24 January 2024 13:50 Go to previous messageGo to next message
Navya is currently offline  Navya
Messages: 6
Registered: May 2022
Member
Hi,

Thank you for your quick response and all your help. I will construct the education attainment indicator based on the key variables (m)v106 and (m)v107.
Re: Mismatch in educational attainment and education in single years [message #28974 is a reply to message #28449] Sat, 06 April 2024 15:06 Go to previous message
fred.arnold@icf.com is currently offline  fred.arnold@icf.com
Messages: 79
Registered: May 2021
Senior Member
I think that the problem may be that the NFHS education questions do not follow the DHS standard in that there are no questions about the level of education in NFHS surveys. The only education questions are "What is the highest grade you completed?" (Not the highest grade completed at that level, since there are no levels). I'm not sure how the NFHS-5 variables are created from the available education questions, but they were probably based on the highest grade completed, with decisions being made about what level that grade might reflect.
Previous Topic: NFHS-2
Next Topic: Order of birth of the living children in NFHS-5
Goto Forum:
  


Current Time: Sat Apr 27 19:13:07 Coordinated Universal Time 2024