The DHS Program User Forum
Discussions regarding The DHS Program data and results
Home » Data » Dataset use in Stata » Imputation of missing data
Re: Imputation of missing data [message #29921 is a reply to message #29911] Thu, 22 August 2024 12:28 Go to previous message
Bridgette-DHS is currently offline  Bridgette-DHS
Messages: 3190
Registered: February 2013
Senior Member

Following is a response from Senior DHS staff member, Tom Pullum:

In this kind of a situation the researcher (you) must make a "judgment call" about what to do. I can suggest potential strategies. One possibility is to drop those cases. Naturally, we don't like to do that. Another would be to assign these men the mean or median or modal value for all men or for all men in some subpopulation that includes these men, such as their district. I would not recommend anything more complicated. Elaborate methods, such as multiple imputation, do exist, but with only 41 cases that would be a waste of effort.

Whatever you do, it would be good to include a comment or footnote describing it, so someone could potentially match your results. If you look at the tables in DHS final reports, you will sometimes find a footnote that says what was done with missing (distinct from Not Applicable) values.

You can also look at whether any estimates appear to change or differ, depending on how you handled such cases. With only 41 questionable cases, you will probably find that the results are not sensitive to whatever option you choose.


 
Read Message
Read Message
Previous Topic: Poolled logistic regression
Next Topic: Issues with Pooled Multi-Country DHS Data
Goto Forum:
  


Current Time: Sat Nov 9 13:51:45 Coordinated Universal Time 2024