The DHS Program User Forum
Discussions regarding The DHS Program data and results
Home » Data » Merging data files » merge IR and KR: duplicate line number in IR
Re: merge IR and KR: duplicate line number in IR [message #23360 is a reply to message #23359] Fri, 27 August 2021 22:45 Go to previous message
bun_2019fall is currently offline  bun_2019fall
Messages: 9
Registered: August 2021
Member
Hi Bruno,

Thank you for much for looking into the file, and I really appreciated it!

The "caseid" solution is almost perfect, except for one file "DRIR21FL":

use "V:\DHS raw survey/DRIR21DT/DRIR21FL.DTA", clear
*duplicates by caseid
duplicates tag caseid, generate(dup)
sort caseid
tab dup /*4 duplicates*/
*complete duplicates
duplicates tag, gen(dup1)
tab dup1 /*no duplicates*/


The "caseid" itself has 4 duplicates in this dataset, out of all DHS IR files that I have downloaded as of July 2021. I further checked, these observations are not complete duplicates. That said, to proceed with data analysis, I wonder if I should just drop the 4 observations? I wonder if DHS has any best practice regarding this issue? Thank you, again!
 
Read Message
Read Message
Read Message
Read Message
Read Message
Read Message
Read Message
Previous Topic: Merging data files
Next Topic: I share you summary of GPS & survey & year for bulk dowonloaders
Goto Forum:
  


Current Time: Sun Dec 22 15:16:39 Coordinated Universal Time 2024