The DHS Program User Forum: Dataset use in Stata » Keeping caseid in and keeping missing observations out when using Stata "collapse"

Home » Data » Dataset use in Stata » Keeping caseid in and keeping missing observations out when using Stata "collapse"

Show: Today's Messages :: Show Polls :: Message Navigator

Re: Keeping caseid in and keeping missing observations out when using Stata "collapse" [message #5617 is a reply to message #5611]

Wed, 17 June 2015 09:26

Bridgette-DHS
Messages: 3039
Registered: February 2013

Senior Member

Following is a response from DHS Senior Stata Specialist, Tom Pullum:

If you collapse by v001, you cannot include caseid in the "by" part of the collapse command. You should replace "by(v001 caseid)" with "by(v001)". The collapsed file will have one record per cluster.

caseid is a combination of v001, v002, and v003. They are numeric variables but caseid is a string with embedded blanks.

To merge the collapsed data back onto the individual records in the IR files, you only need to sort both files on v001. However, when I do this I sort the IR file on v001 v002 v003, even though it's not really required. Since your cluster-level file does not contain v002 and v003, they are irrelevant for the merge.

So I recommend lines such as the following:

[your sort command]
sort v001
save temp.dta, replace
use IRdata.dta, clear
sort v001 v002 v003
merge v001 using temp.dta
keep if _merge==3

Like many Stata users, I prefer the old version of the merge command, but the newer one will also work.

Report message to a moderator

[Message index]

		Keeping caseid in and keeping missing observations out when using Stata "collapse" By: Lizzynaija on Tue, 16 June 2015 10:54
		Re: Keeping caseid in and keeping missing observations out when using Stata "collapse" By: Bridgette-DHS on Wed, 17 June 2015 09:26
		Re: Keeping caseid in and keeping missing observations out when using Stata "collapse" By: Lizzynaija on Thu, 18 June 2015 22:15
		Re: Keeping caseid in and keeping missing observations out when using Stata "collapse" By: Lizzynaija on Tue, 23 June 2015 20:46
		Re: Keeping caseid in and keeping missing observations out when using Stata "collapse" By: Bridgette-DHS on Thu, 25 June 2015 11:53
		Re: Keeping caseid in and keeping missing observations out when using Stata "collapse" By: Lizzynaija on Thu, 16 July 2015 10:16
		Re: Keeping caseid in and keeping missing observations out when using Stata "collapse" By: Krishna on Mon, 06 February 2017 02:38
		Re: Keeping caseid in and keeping missing observations out when using Stata "collapse" By: Bridgette-DHS on Mon, 06 February 2017 09:13
		Re: Keeping caseid in and keeping missing observations out when using Stata "collapse" By: Hassen on Sat, 05 May 2018 21:30
		Re: Keeping caseid in and keeping missing observations out when using Stata "collapse" By: Bridgette-DHS on Mon, 07 May 2018 08:47

Previous Topic:	merging wealth index from Ethiopia DHS 2000 to women's file
Next Topic:	What is the difference between hw70_1 hw70_2 hw70_3 etc?

Goto Forum:

-=] Back to Top [=-

[ Syndicate this forum (XML) ] [

] [

]

Current Time: Thu Apr 25 06:50:54 Coordinated Universal Time 2024