The DHS Program User Forum
Discussions regarding The DHS Program data and results
Home » Topics » General » DHS-MIS (Inquiries Regarding Dataset)
DHS-MIS [message #28736] Fri, 01 March 2024 07:59 Go to next message
Sancho is currently offline  Sancho
Messages: 5
Registered: February 2024
Member
Hello everyone,

I hope you're all doing well.

I'm reaching out to request some support. I'm working with a database that has 40,394 observations. However, upon examining the report, I noticed that the total number of households is 15,705, from which information was obtained for 15,254 women of reproductive age, as mentioned in the report. Thus, I've come to understand that there are duplications in the database I possess and realized the need to remove them to reach the expected 15,254 observations.

However, my concern arose when I noticed that, upon attempting to remove the duplications to reach the expected number of surveyed women of reproductive age (15,254), the total observations drastically decreased to 257 when using the V002 variable, while the V003 variable showed 30 observations. When employing the CASEID variable, I obtained 11,116 observations.

I would like to better understand how I can overcome this issue and obtain an accurate count of the surveyed women of reproductive age.

Thank you in advance for your attention, and I hope to receive guidance on how to proceed regarding this matter.

Best regards,

Sancho Xavier
Re: DHS-MIS [message #28743 is a reply to message #28736] Fri, 01 March 2024 11:12 Go to previous messageGo to next message
Bridgette-DHS is currently offline  Bridgette-DHS
Messages: 3199
Registered: February 2013
Senior Member

Following is a response from Senior DHS Stata Specialist, Tom Pullum:

First, you did not identify the survey you are working with, but I believe it is the Tanzania 2022 DHS (which is not an MIS). 15,705 households were selected. These are the cases in the HR file. The number of people listed in the households, as either de jure or de facto residents, is 40,394. These are the cases in the PR file. They include men, women, children. Eligible women for the interviews are women age 15-49 who are de facto residents. Some households have no such women, some have 1, some have 2, etc. The total number of women who were eligible AND were interviewed is 15,254. These are the cases in the IR file. There is an average of about one woman per household, but the ratio of women to households does not have to be this close to 1 and it certainly does not need to be exactly 1. The cluster ID is v001. Households are numbered within clusters, with v002 as the household ID. Individuals within the household are given a line number, and that's v003. "caseid" is a string variable that includes columns for v001, v002, and v003. It takes 15,254 unique values.

If you open the IR file in Stata and enter "codebook v001 v002 v003 caseid" and "list v001 v002 v003 caseid if _n<=100, table clean" you will get a list of the ID variables for the first 100 women in the file. It may help you to see how the data are structured. As I said, there is no problem.

Re: DHS-MIS [message #28744 is a reply to message #28743] Fri, 01 March 2024 11:39 Go to previous messageGo to next message
Sancho is currently offline  Sancho
Messages: 5
Registered: February 2024
Member
I'm working with the TZBR82FL database from Tanzania (2022), and I'm analyzing the data using R. I would like to know what is necessary to obtain the mentioned number of 15,254 women. Should I request another file, or can I continue with the 40,394? This is my first experience working with DHS databases. I already have the TZBR82FL database for SPSS, which contains 40,394 observations.
  • Attachment: Capturar.PNG
    (Size: 41.53KB, Downloaded 83 times)
  • Attachment: Capturar1.PNG
    (Size: 29.00KB, Downloaded 110 times)

[Updated on: Fri, 01 March 2024 11:40]

Report message to a moderator

Re: DHS-MIS [message #28745 is a reply to message #28744] Fri, 01 March 2024 13:12 Go to previous messageGo to next message
Bridgette-DHS is currently offline  Bridgette-DHS
Messages: 3199
Registered: February 2013
Senior Member

Following is a response from Senior DHS Stata Specialist, Tom Pullum:

There are different files for different units of analysis. The BR file ("BR" is in "TZBR82FL") is a file of births (the "B" is for "births"). Each case is a birth in the birth histories of women in the IR file (the "I" is for "individuals"). That file has 15,254 cases in it. I suggest that you switch to that file.

Please read more about dataset types here: https://dhsprogram.com/data/Dataset-Types.cfm
Re: DHS-MIS [message #28746 is a reply to message #28745] Fri, 01 March 2024 18:27 Go to previous messageGo to next message
Sancho is currently offline  Sancho
Messages: 5
Registered: February 2024
Member
I am writing to seek your guidance on which database you would recommend for studying anemia in children aged 6 to 59 months. I am aware that this outcome of interest is present in the TZBR82FL database. Considering my specific interest, I would like to know which database you suggest I use for my analysis.

It seems that I can use the PR file to obtain the prevalence of anemia in children aged 6-59 months and its associated factors, according to this guideline I read at this link: https://dhsprogram.com/data/Guide-to-DHS-Statistics/Anemia_S tatus.htm.

[Updated on: Sun, 03 March 2024 07:13]

Report message to a moderator

Re: DHS-MIS [message #28755 is a reply to message #28746] Mon, 04 March 2024 11:03 Go to previous messageGo to next message
Bridgette-DHS is currently offline  Bridgette-DHS
Messages: 3199
Registered: February 2013
Senior Member

Following is a response from Senior DHS Stata Specialist, Tom Pullum:

You have a couple of earlier postings but this is the first time you have actually said what you are studying. The anemia data for children are in the BR and KR files (hw57) and in the PR file (hc57). You probably want to use the KR file. It includes other information about the child and the mother, as well as household-level variables. DHS Methodological Report 18 (https://www.dhsprogram.com/pubs/pdf/MR18/MR18.pdf) may be helpful.



Re: DHS-MIS [message #28756 is a reply to message #28755] Mon, 04 March 2024 11:25 Go to previous messageGo to next message
Sancho is currently offline  Sancho
Messages: 5
Registered: February 2024
Member
After analyzing the data available in the BR file, I was able to identify the association I was investigating, specifically the relationship between maternal anemia and the likelihood of a child also developing anemia. Based on these findings, I would like to confirm if I can proceed with just this file (BR), considering that the relevant data is contained within it and that I have already been able to answer my research question using only the data from the BR file. My main objective is to understand the association between maternal anemia and the likelihood of anemia in children.
Re: DHS-MIS [message #28757 is a reply to message #28756] Mon, 04 March 2024 15:30 Go to previous message
Bridgette-DHS is currently offline  Bridgette-DHS
Messages: 3199
Registered: February 2013
Senior Member

Following is a response from Senior DHS Stata Specialist, Tom Pullum:

Yes, you can stay with the BR file, or you can switch to the KR file without losing anything. The KR file is like the BR file, with children as cases. But it is reduced to children born in the past 5 years, who are the only children who have anemia data. You would look at the relationship between v457 (for the mother) and hw57 (for the child).
Previous Topic: About extracting information on household characteristics
Next Topic: Review of DHS Questionnaires
Goto Forum:
  


Current Time: Sat Nov 30 18:14:21 Coordinated Universal Time 2024