Home » Topics » General » Regional variable v024
Regional variable v024 [message #28999] |
Tue, 09 April 2024 07:47  |
eugenio.cassese@uniboccon
Messages: 2 Registered: April 2024
|
Member |
|
|
I am working with dhs data of several country (the individual surveys) and I often find that the regional variable v024 (and even other variables such as religion) sometimes present values that are not codified. E.g., in Bangladesh the v024 variable is supposed to go from 1 to 6, but tabulating the variable this is the result I get:
region | Freq. Percent Cum.
------------+-----------------------------------
barishal | 12,084 11.23 11.23
chittagong | 17,964 16.70 27.93
dhaka | 21,921 20.38 48.31
khulna | 15,446 14.36 62.67
rajshani | 18,963 17.63 80.29
6 | 12,253 11.39 91.68
7 | 6,719 6.25 97.93
8 | 2,229 2.07 100.00
------------+-----------------------------------
Total | 107,579 100.00
Are these human errors when compiling datasets or are there perhaps other regions that are not indicated in the dictionary? If so, where can I find such a list?
P.s.: the question extends to all other variables for which this scenario might occur.
Many thanks in advance!
|
|
|
Re: Regional variable v024 [message #29009 is a reply to message #28999] |
Wed, 10 April 2024 09:03   |
Janet-DHS
Messages: 938 Registered: April 2022
|
Senior Member |
|
|
Following is a response from DHS staff member, Tom Pullum:
There have been several previous posts on what happens when you append files from different surveys, even within the same country. I believe the problem is that you have combined (appended) files with different labels for v024. When you do this, the last label will over-write the earlier labels. In your example, it appears that one of the surveys included regions numbered 6, 7, and 8, but the label for the last file in the append DID NOT include those regions. As a result, categories 6, 7, and 8 have numerical codes but no labels.
One way to handle this is to give a new name to v024 in each survey, such as v024_BD7R, and a new name to the label. After you have your final file, you can do a recode which synthesizes the different v024_* variables into a single variable. But sometimes a synthesis is impossible.
Many other variables, such as source of drinking water, types of health facilities for treating child illness, foods, etc., are coded differently in different surveys, and must be recoded. You have to check the actual labels in each survey, for example with "label list V024". Sometimes the codes will shift even if the names of the regions are stable.
|
|
|
|
|
Goto Forum:
Current Time: Fri Apr 4 01:30:11 Coordinated Universal Time 2025
|