Home » Topics » General » Regional variable v024 
	
		
		
			| Regional variable v024 [message #28999] | 
			Tue, 09 April 2024 07:47   | 
		 
		
			
				
				
				
					
						  
						eugenio.cassese@uniboccon
						 Messages: 2 Registered: April 2024 
						
					 | 
					Member  | 
					 | 
		 
		 
	 | 
 
	
		I am working with dhs data of several country (the individual surveys) and I often find that the regional variable v024 (and even other variables such as religion) sometimes present values that are not codified. E.g., in Bangladesh the v024 variable is supposed to go from 1 to 6, but tabulating the variable this is the result I get: 
 
     region |      Freq.     Percent        Cum. 
------------+----------------------------------- 
   barishal |     12,084       11.23       11.23 
 chittagong |     17,964       16.70       27.93 
      dhaka |     21,921       20.38       48.31 
     khulna |     15,446       14.36       62.67 
   rajshani |     18,963       17.63       80.29 
          6 |     12,253       11.39       91.68 
          7 |      6,719        6.25       97.93 
          8 |      2,229        2.07      100.00 
------------+----------------------------------- 
      Total |    107,579      100.00 
 
 
Are these human errors when compiling datasets or are there perhaps other regions that are not indicated in the dictionary? If so, where can I find such a list?  
P.s.: the question extends to all other variables for which this scenario might occur. 
 
Many thanks in advance! 
		
		
		
 |  
	| 
		
	 | 
 
 
 |  
	
		
		
			| Re: Regional variable v024 [message #29009 is a reply to message #28999] | 
			Wed, 10 April 2024 09:03    | 
		 
		
			
				
				
				
					
						  
						Janet-DHS
						 Messages: 938 Registered: April 2022 
						
					 | 
					Senior Member  | 
					 | 
		 
		 
	 | 
 
	
		Following is a response from DHS staff member, Tom Pullum: 
 
There have been several previous posts on what happens when you append files from different surveys, even within the same country.  I believe the problem is that you have combined (appended) files with different labels for v024.  When you do this, the last label will over-write the earlier labels. In your example, it appears that one of the surveys included regions numbered 6, 7, and 8, but the label for the last file in the append DID NOT include those regions.  As a result, categories 6, 7, and 8 have numerical codes but no labels. 
 
One way to handle this is to give a new name to v024 in each survey, such as v024_BD7R, and a new name to the label. After you have your final file, you can do a recode which synthesizes the different v024_* variables into a single variable.  But sometimes a synthesis is impossible. 
 
Many other variables, such as source of drinking water, types of health facilities for treating child illness, foods, etc., are coded differently in different surveys, and must be recoded. You have to check the actual labels in each survey, for example with "label list V024". Sometimes the codes will shift even if the names of the regions are stable.   
		
		
		
 |  
	| 
		
	 | 
 
 
 |  
	| 
		
 |  
	
		
		
			| Re: Regional variable v024 [message #29073 is a reply to message #29030] | 
			Fri, 19 April 2024 16:58   | 
		 
		
			
				
				
				
					
						  
						Janet-DHS
						 Messages: 938 Registered: April 2022 
						
					 | 
					Senior Member  | 
					 | 
		 
		 
	 | 
 
	
		Following is a response from DHS staff member, Tom Pullum: 
 
The v variables (variables with prefix v, hv, or mv) are fixed (so far as I know).  They will mean the same thing in every survey.  However, the specific codes can be different.  Source of drinking water would be an example.  It has the same variable name in every survey but the codes and categories can be different.  This generalization would apply to almost all variables except the survey-specific variables that begin with "s". 
 
As a general rule, the more recent the surveys, the greater the standardization.  A good thing about the IPUMS files is that that their staff have checked very carefully for potential inconsistencies and have developed even more consistent labels and codes than in the DHS files.
		
		
		
 |  
	| 
		
	 | 
 
 
 |   
Goto Forum:
 
 Current Time: Tue Nov 4 07:17:41 Coordinated Universal Time 2025 
 |