The DHS Program User Forum
Discussions regarding The DHS Program data and results
Home » Data » Weighting data » Selecting sample within one standard deviation in R (Selecting sample within one standard deviation (help with R))
Re: Selecting sample within one standard deviation in R [message #24066 is a reply to message #24064] Tue, 15 February 2022 16:52 Go to previous message
Bridgette-DHS is currently offline  Bridgette-DHS
Messages: 3036
Registered: February 2013
Senior Member

Following is a response from DHS Research & Data Analysis Director, Tom Pullum:

You have an extremely skewed distribution. The "68%" rule works for normally distributed variables, and the normal approximation doesn't work for your variable. I can think of two options. One would be to take the log of the frequency, which will have a distribution that is more nearly normal, but there's the problem that you can't take the log of 0. Another option would be to calculate the percentiles of the distribution. If you identify the 25th and 75th percentiles, then you have the boundaries for the middle 50%. Or identify the 16th and 84th percentiles, which enclose the middle 68%.

 
Read Message
Read Message
Previous Topic: Seeking help in level1 and level2 weight generation
Next Topic: Calculating level weight for multicounty data - determining level of alpha to use
Goto Forum:
  


Current Time: Tue Apr 23 03:54:51 Coordinated Universal Time 2024