The DHS Program User Forum
Discussions regarding The DHS Program data and results
Home » Topics » Biomarkers » Slight difference in HIV numbers in South Africa 2016 dataset (IR, MR) (Slight difference in HIV numbers within subgroups age, region, education, wealth)
Slight difference in HIV numbers in South Africa 2016 dataset (IR, MR) [message #21634] Tue, 01 December 2020 16:16 Go to previous message
evavanempel is currently offline  evavanempel
Messages: 1
Registered: December 2020
Dear DHS team,

I am using the South Africa 2016 men and women dataset, and merged the HIV biomarker dataset into them. I would like to ask for some help as the HIV numbers are slightly different from the SA 2016 report. The numbers are almost the same as in the report but within the age/region/education/wealth groups there sometimes is one (or a few) extra or missing person(s). I have attached my findings of the women dataset (+HIV biomarker) vs. the report for you to see where the numbers differ.

The steps I have taken:
- Rename variables and sort ascending by cluster (V001), number (V002) and line (V003)
- Merge HIV biomarker dataset into women's dataset
- Compute weight variables (V005 and HIV05) and divide by one million
- Turn on weight for HIV (HIV05). SPSS code:
o COMPUTE WGT_HIV= HIV05/1000000.
o Weight by WGT_HIV.

I am aware that I have to include 'inconclusive (HIV03=9)' in the HIV negative group. However, this does not seem to solve this issue.
I also tried these same steps with another dataset (Zambia 2018 men, women and HIV biomarker dataset) and there are always one or a few persons extra or missing within the age/region/education/wealth groups.

There seems to be a step that I am missing, would you be able to help me with this problem?

Thank you in advance!
Read Message
Read Message
Previous Topic: Locating glucose variables - South Africa 2016 DHS
Next Topic: BMI, Hypertension, Diabetes and Anemia Code for Ghana
Goto Forum:

Current Time: Mon Mar 1 06:08:56 Coordinated Universal Time 2021