The DHS Program User Forum
Discussions regarding The DHS Program data and results
Home » Topics » Wealth Index » How to handle continuous variable in constructing the index
How to handle continuous variable in constructing the index [message #8673] Sun, 29 November 2015 09:56 Go to next message
gyamada is currently offline  gyamada
Messages: 7
Registered: November 2014
Location: Baltimore
Member
Hi,

I am reviewing the document by Dr. Rutstein and SPSS syntax files, and I have questions about how to handle continuous variables in constructing wealth index.

My first question is about how to handle missing values of continuous variables. May I know why we use mean instead of median? I feel that land area and number of household per sleeping room are usually skewed and thought that median might be another option.

My next question is about number of household per sleeping room. May I know why the number is obtained by rounding down to the nearest integer instead of leaving the decimals?

Faithfully,

Goro
Re: How to handle continuous variable in constructing the index [message #8707 is a reply to message #8673] Thu, 03 December 2015 14:45 Go to previous messageGo to next message
Liz-DHS
Messages: 1381
Registered: February 2013
Senior Member
Dear User,
Here is a response from Dr. Shea Rutstein:
Quote:
1. Use of mean substitution instead of median:
a. For land size, most households have no land so the median may end up as zero. However, that variable is the result of two questions, whether the household owns agricultural land and if so, what the size of the land is. The missing and unknown responses for land size are for those with land from the first question and so the median should not be zero.
b. For number of household members per sleeping room, the distribution is not very skewed so the mean and median should produce similar results. In any case, the missing data would come from number of sleeping rooms, not number of household members, and almost all households have less than 10 sleeping rooms.
2. The use of integers for number of members per sleeping room is for convenience when tabulating the frequency distribution and means of wealth by number of members per sleeping room to check on the results. Alternatively, a continuous variable can be and has been used in a few surveys, but hardly any difference in results has been found.
3. One should remember that the wealth index is not an absolute measure of net worth, and is derived from a sample of households, and as such is subject to sampling variance and non-sampling error. Additionally, the wealth index also depends to a small extent on the assets, dwelling materials, amenities and other indicators chosen for the survey. Small differences in household wealth score due to handling of continuous variables will not invalidate the relative position of households. However, some non-linearity is taken into account of these variables for number of farm animals by type, and will be tried for land size as well.
Re: How to handle continuous variable in constructing the index [message #8749 is a reply to message #8707] Sat, 12 December 2015 03:46 Go to previous messageGo to next message
gyamada is currently offline  gyamada
Messages: 7
Registered: November 2014
Location: Baltimore
Member
Thank you, Dr. Rutstein, for your information!

Following up one of your comments on skewed data (number of farm animals and land area), I would like to know what type of consideration could be more commonly thought. Could log transformation or truncation at certain value/percentile could be an option?

Sincerely,

Goro
Re: How to handle continuous variable in constructing the index [message #14818 is a reply to message #8749] Mon, 07 May 2018 11:09 Go to previous message
sadya2018@gmail.com
Messages: 97
Registered: April 2018
Location: Ethiopia, in Africa
Senior Member
Thanks all!!

Hassen Ali Hamza (BSc in Public Health,Master of Public Health Candidate)
Previous Topic: Use of wealth score for regression
Next Topic: Merge DHS 1993 wealth index with women and household files
Goto Forum:
  


Current Time: Tue Nov 13 13:57:14 Eastern Standard Time 2018