The DHS Program User Forum
Discussions regarding The DHS Program data and results
Home » Countries » India » Are the caseid for IR file unique across rounds for DHS India? (Are the caseid for IR file unique across rounds for DHS India?)
Are the caseid for IR file unique across rounds for DHS India? [message #24915] Tue, 02 August 2022 10:13 Go to next message
teeb is currently offline  teeb
Messages: 11
Registered: July 2022
Member
I appended IR files from round 1 through 5 and found duplicates in terms of caseid when I checked for duplicates.

Is a caseid unique to the particular round? Or two women from different rounds can have same caseid? Were strictly different women surveyed in different rounds?

I also created a womanid by adding a round prefix to the existing caseid and the womanid did not have duplicates.

I'm not sure if this indicates an error on my part while cleaning data or the structure of the data is such.

Would appreciate any help.
Thanks

PS: in fact I went through caseids for the different rounds and I can see the caseids for IR file are not unique to a round. A woman in DHS 1992-93 has same caseid as that in DHS 1998-99. So the follow up question is are these different women or the same?

[Updated on: Tue, 02 August 2022 10:19]

Report message to a moderator

Re: Are the caseid for IR file unique across rounds for DHS India? [message #24917 is a reply to message #24915] Tue, 02 August 2022 15:47 Go to previous messageGo to next message
Bridgette-DHS is currently offline  Bridgette-DHS
Messages: 3199
Registered: February 2013
Senior Member
Following is a response from DHS Research & Data Analysis Director, Tom Pullum:

No women appear in more than one round. The samples in successive surveys are statistically independent. Even the sample clusters are different.

As shown below, there are no duplicates in the IR files for any of the India surveys. Duplicates would be indicated by any values greater than 1 when the number of cases with each ID are added up. Earlier posts have stated that v024 (region) must be included in the unique identifiers. I collapsed over "v024 v001 v002 v003". In another run I collapsed over "caseid" and got exactly the same result. There are no duplicates in the India IR files.

. local lpvs 23 42 52 74 7B

. 
. foreach lpv of local lpvs {
  2. 
. use caseid v024 v001 v002 v003 using "C:\Users\26216\ICF\Analysis - Shared Resources\Data\DHSdata\IAIR`lpv'FL.DTA", clear 
  3. 
. gen dups=1
  4. 
. collapse (sum) dups,by(v024 v001 v002 v003)
  5. 
. tab dups
  6. 
. }

/index.php?t=getfile&id=1892&private=0
  • Attachment: collapse.jpg
    (Size: 66.73KB, Downloaded 451 times)
Re: Are the caseid for IR file unique across rounds for DHS India? [message #24920 is a reply to message #24917] Wed, 03 August 2022 00:53 Go to previous messageGo to next message
teeb is currently offline  teeb
Messages: 11
Registered: July 2022
Member
Thanks for the prompt response and for offering clarification on the sampling strategy.

If I understood the code correctly, this looks for duplicates in terms of caseid within each round.

However, my confusion arises from the fact that a caseid which appears in IR file of DHS 1992-93 also appears in IR file of DHS 1998-99. For eg, please see the attachments.

Would be grateful if you could offer some explanation.

Thanks again /index.php?t=getfile&id=1894&private=0/index.php?t=getfile&id=1895&private=0
Re: Are the caseid for IR file unique across rounds for DHS India? [message #24924 is a reply to message #24920] Wed, 03 August 2022 15:38 Go to previous messageGo to next message
Bridgette-DHS is currently offline  Bridgette-DHS
Messages: 3199
Registered: February 2013
Senior Member

Following is a response from DHS Research & Data Analysis Director, Tom Pullum:

As I said, each survey has different clusters, households, and individuals. However, v024 (region) has codes 1, 2, 3, etc. Within each value of v024, v001 (cluster) has codes 1, 2, 3, etc. Within each value of v001, v002 (household) is numbered 1, 2, 3, etc. Within each value of v002, v003 (line number) is 1, 2, 3, etc. (I am giving variable names in the IR file, they will be different in the PR and MR files.)

I think you are making this more complicated than necessary. The numbering system will lead to people in different surveys (in the same country or different countries) having the same ID code. The ID codes are not like passport numbers that are connected with specific individuals. They are simply a device for distinguishing different cases within the same survey.

Re: Are the caseid for IR file unique across rounds for DHS India? [message #24950 is a reply to message #24924] Mon, 08 August 2022 05:55 Go to previous message
teeb is currently offline  teeb
Messages: 11
Registered: July 2022
Member
Thanks a lot! This cleared up my confusion
Previous Topic: Where can I find the data for Sibling composition in India
Next Topic: Replication of table 2.26 School attendance by state/union territory in the India Country Report
Goto Forum:
  


Current Time: Sun Nov 24 20:17:32 Coordinated Universal Time 2024