Home » Topics » General » Regional variable v024
Regional variable v024 [message #28999] |
Tue, 09 April 2024 07:47 |
eugenio.cassese@uniboccon
Messages: 2 Registered: April 2024
|
Member |
|
|
I am working with dhs data of several country (the individual surveys) and I often find that the regional variable v024 (and even other variables such as religion) sometimes present values that are not codified. E.g., in Bangladesh the v024 variable is supposed to go from 1 to 6, but tabulating the variable this is the result I get:
region | Freq. Percent Cum.
------------+-----------------------------------
barishal | 12,084 11.23 11.23
chittagong | 17,964 16.70 27.93
dhaka | 21,921 20.38 48.31
khulna | 15,446 14.36 62.67
rajshani | 18,963 17.63 80.29
6 | 12,253 11.39 91.68
7 | 6,719 6.25 97.93
8 | 2,229 2.07 100.00
------------+-----------------------------------
Total | 107,579 100.00
Are these human errors when compiling datasets or are there perhaps other regions that are not indicated in the dictionary? If so, where can I find such a list?
P.s.: the question extends to all other variables for which this scenario might occur.
Many thanks in advance!
|
|
|
Re: Regional variable v024 [message #29009 is a reply to message #28999] |
Wed, 10 April 2024 09:03 |
Janet-DHS
Messages: 891 Registered: April 2022
|
Senior Member |
|
|
Following is a response from DHS staff member, Tom Pullum:
There have been several previous posts on what happens when you append files from different surveys, even within the same country. I believe the problem is that you have combined (appended) files with different labels for v024. When you do this, the last label will over-write the earlier labels. In your example, it appears that one of the surveys included regions numbered 6, 7, and 8, but the label for the last file in the append DID NOT include those regions. As a result, categories 6, 7, and 8 have numerical codes but no labels.
One way to handle this is to give a new name to v024 in each survey, such as v024_BD7R, and a new name to the label. After you have your final file, you can do a recode which synthesizes the different v024_* variables into a single variable. But sometimes a synthesis is impossible.
Many other variables, such as source of drinking water, types of health facilities for treating child illness, foods, etc., are coded differently in different surveys, and must be recoded. You have to check the actual labels in each survey, for example with "label list V024". Sometimes the codes will shift even if the names of the regions are stable.
|
|
|
|
Re: Regional variable v024 [message #29073 is a reply to message #29030] |
Fri, 19 April 2024 16:58 |
Janet-DHS
Messages: 891 Registered: April 2022
|
Senior Member |
|
|
Following is a response from DHS staff member, Tom Pullum:
The v variables (variables with prefix v, hv, or mv) are fixed (so far as I know). They will mean the same thing in every survey. However, the specific codes can be different. Source of drinking water would be an example. It has the same variable name in every survey but the codes and categories can be different. This generalization would apply to almost all variables except the survey-specific variables that begin with "s".
As a general rule, the more recent the surveys, the greater the standardization. A good thing about the IPUMS files is that that their staff have checked very carefully for potential inconsistencies and have developed even more consistent labels and codes than in the DHS files.
|
|
|
Goto Forum:
Current Time: Tue Nov 26 15:09:54 Coordinated Universal Time 2024
|