The DHS Program User Forum
Discussions regarding The DHS Program data and results
Home » Topics » Child Health » duplicate caseid
Re: duplicate caseid [message #24424 is a reply to message #24397] Fri, 13 May 2022 10:59 Go to previous messageGo to previous message
Janet-DHS is currently offline  Janet-DHS
Messages: 891
Registered: April 2022
Senior Member
Following is response from DHS Research & Data Analysis Director, Tom Pullum:

What survey and what file are you using? In the KR file, for example, there is a record for every child born in the past five years. caseid is the mother's ID code. Because many women had more than one child in the past five years, there will be several records with the same value of caseid, but children of the same mother will have different values of bidx (1, 2, etc.). In the IR file, there should never be a repeat of the same caseid, although very rarely we will find a duplicate. To check for duplicates in the IR file, in Stata, enter "gen ncases=1", then "collapse (sum) ncases, by(caseid)", then "tab ncases" and "list if ncases>1, table clean".
 
Read Message
Read Message
Read Message
Read Message
Read Message
Read Message
Read Message
Previous Topic: How to do logistic regression for infant mortality
Next Topic: Tuberculosis and Childhood Tuberculosis
Goto Forum:
  


Current Time: Wed Nov 27 06:45:20 Coordinated Universal Time 2024