The DHS Program User Forum
Discussions regarding The DHS Program data and results
Home » Data » Sampling » Combining individuals DHS datasets in Python (MR or IR with AR: cluster, stratum and weights in Python)
Combining individuals DHS datasets in Python [message #17951] Thu, 25 July 2019 08:22
Triphon is currently offline  Triphon
Messages: 1
Registered: July 2019
Member
Hi,

I am merging MR and IR datasets with HIV status (AR) for multiple countries. From what I've read, weights are hiv05 (preferred over v005/mv005), cluster (id) is v021/mv021 and stratum depends but let's say v023/mv023.
I am coding in Python (for various reasons) and there is no package/library such as svydesign in R. I could use R and then back to Python but switching from pandas categorical to R factor is not straightforward (data type not string). In Python, I can handle the weights, but not the cluster and the stratum (to my knowledge). Does someone knows what would be the best option here? For prediction and variables selection purposes, not taking into account cluster and stratum would be problematic?
Previous Topic: Enumerations Areas in Armenia DHS 2005 and 2010
Goto Forum:
  


Current Time: Sun Aug 25 14:37:41 Eastern Daylight Time 2019