The DHS Program User Forum
Discussions regarding The DHS Program data and results
Home » Topics » Water and Sanitation » District-Level WaSH Indicators (District-level Indicators, SEs, and Bootstrapping)
District-Level WaSH Indicators [message #28738] Fri, 01 March 2024 08:45 Go to previous message
Mikaela22 is currently offline  Mikaela22
Messages: 1
Registered: March 2024
Member
Hello!

Project: I am combining DHS WaSH indicators from the 2011 Mozambique DHS with data from a cluster-randomised trial in Mozambique assessing the performance of various treatment strategies on Schistosomiasis prevalence. I am attempting to model individual-level infection status after 5-years of mass-drug administration to see if there is any effect modification of the treatment strategy (villages were randomised to different treatment strategies) by different WaSH indicators at the district-level, specifically using an improved water / sanitation source.

I will be using multi-level logistic regression to capture the clustering of the data i.e., (1) individuals in (2) villages (the treatment-level) in (3) districts.

The cluster-randomised trial was conducted in one province in Mozambique, so I am only working with 8 districts and attempting to calculate a district-level indicator e.g., percentage of households in that district using an improved water source. I have used GPS data to locate the clusters in corresponding districts and have followed the suggested methodology (the complex sample design weighting) to generate estimates. However, as has been extensively discussed previously, the SEs are too large to be usable.

I propose the following methodology to resolve this and would appreciate some input:
- Use a bootstrap (I saw a link to a wild bootstrap mentioned in a previous post?) to calculate more precise standard errors - how would I go about using the sampling weights here?
- Use weights within the multi-level logistic regression model to account for the uncertainty around the district-level estimates.

I understand that using DHS data in this way to generate district-level indicators is not ideal, however, this project is more for hypothesis generation and identifying areas for future research.

Do you have any comments on what I have proposed, or is there anything else I should be thinking about in terms of using this data and conducting this analysis in the best way?

I appreciate any feedback!

Kind regards!
 
Read Message
Read Message
Previous Topic: Are the area clusters for one country the same across years?
Next Topic: Improved water and sanitation variable definitions
Goto Forum:
  


Current Time: Fri Nov 29 06:54:29 Coordinated Universal Time 2024