The DHS Program User Forum
Discussions regarding The DHS Program data and results
Home » Data » Sampling » Combining individuals DHS datasets in Python (MR or IR with AR: cluster, stratum and weights in Python)
Combining individuals DHS datasets in Python [message #17951] Thu, 25 July 2019 08:22 Go to previous message
Triphon is currently offline  Triphon
Messages: 1
Registered: July 2019

I am merging MR and IR datasets with HIV status (AR) for multiple countries. From what I've read, weights are hiv05 (preferred over v005/mv005), cluster (id) is v021/mv021 and stratum depends but let's say v023/mv023.
I am coding in Python (for various reasons) and there is no package/library such as svydesign in R. I could use R and then back to Python but switching from pandas categorical to R factor is not straightforward (data type not string). In Python, I can handle the weights, but not the cluster and the stratum (to my knowledge). Does someone knows what would be the best option here? For prediction and variables selection purposes, not taking into account cluster and stratum would be problematic?
Read Message
Read Message
Previous Topic: Design Effects across surveys
Next Topic: Missing observations in Mali BR
Goto Forum:

Current Time: Tue Jun 22 01:34:15 Coordinated Universal Time 2021