Home » Data » Merging data files » Appending Multi-phase Nigerian DHS Surveys (Handling Inconsistent Value Labels and Creating a Survey Design for Multi-phase DHS Surveys)
Appending Multi-phase Nigerian DHS Surveys [message #29961] |
Fri, 30 August 2024 03:41 |
Oby
Messages: 6 Registered: August 2024
|
Member |
|
|
Hello,
I'm working with multiple NDHS datasets from different survey years (2003, 2008, 2013, and 2018) and have encountered some challenges. I'd appreciate your guidance on the following issues:
1. Inconsistent Value Labels Across Years:
When attempting to append these datasets using bind_rows(), I receive warnings about conflicting value labels for certain variables.
Question: Should I convert these labelled variables to factors using a function like as_factor() before appending, or is there a better approach to standardize value labels across these different datasets? What is the best practice for ensuring that the labels are consistent before appending?
2. Creating a Survey Design with Different Sampling Designs:
Each of the four surveys has a different sampling design. After appending the datasets, I need to create a combined survey design object for analysis.
Questions: a.) How should I go about creating a survey design object that appropriately accounts for the different sampling designs across the four surveys?
b.) Are there other specific adjustments or considerations I need to make when combining these datasets for analysis?
Thank you for your assistance! I look forward to your advice on these issues.
Regards,
Oby
|
|
|
|
|
|
|
Re: Appending Multi-phase Nigerian DHS Surveys [message #30150 is a reply to message #30135] |
Fri, 04 October 2024 12:52 |
Oby
Messages: 6 Registered: August 2024
|
Member |
|
|
Hi Bridgette,
I have a follow-up question about the survey design (please note that I am working with the Nigerian DHS for 2003, 2008, 2013 and 2018 ir file). I would like clarification on the correct approach for creating the strata_id for my pooled dataset. Based on my understanding from the survey reports, the stratification for all four surveys seems to be based on urban and rural areas within each state. However, v023 for 2003 is based on regions and urban/rural residence, for 2008 it's based on only states. Do I use the v023 that way?
I am considering constructing the strata_id in my pooled dataset by combining state (sstate) and urban/rural residence (v025) for all four surveys and then grouping that with the survey variable to create a unique strata_id for each survey phase.
Could you please confirm if this is the correct approach to creating the strata_id for the pooled dataset, or if there is another method?
Thank you for your assistance.
[Updated on: Fri, 04 October 2024 12:55] Report message to a moderator
|
|
|
|
Re: Appending Multi-phase Nigerian DHS Surveys [message #30159 is a reply to message #30156] |
Mon, 07 October 2024 12:10 |
Oby
Messages: 6 Registered: August 2024
|
Member |
|
|
Thank you for your response Bridgette. The do file you provided says I should use v023 for 2003 and v022 for 2008, 2013 and 2018. However, for 2013 and 2018, v022 is not the same as combining v024 and v025( i.e region and rural/urban residence), but rather it is the combination of v024, sstate and v025 ( i.e region, state and rural/urban residence).
Also, for 2013, I noticed that there are 73 strata instead of 74 since we have 37 states (including the capital territory)in Nigeria, so merging it with v025 should ordinarily result in 74 strata as in 2018.
So I'm a bit confused as to whether to use the v023(for 2003) and v022( for 2008, 2013 and 2018) that way.
|
|
|
|
|
|
Goto Forum:
Current Time: Thu Nov 28 07:11:17 Coordinated Universal Time 2024
|