Pakistan DHS 2012 district codes [message #26906] |
Tue, 23 May 2023 10:33 |
Magda
Messages: 2 Registered: May 2023
|
Member |
|
|
Hello,
I am interested in using Pakistan DHS 2012 and DHS 2017. In the DHS2017 individual recode dataset PKIR71FL we have a variable "sdist" with district codes and district names reported in the do file. The DHS 2012 dataset PKIR61FL has the same variable "sdist" with what look to be largely the same codes but we are not given the names corresponding to each code in the do file. Is it safe to assume the codes in 2012 are the same as the codes in 2017? Alternatively, is there other way to tell location in the 2012 data that is more specific than just regions?
Thank you for any tips you may have.
Best,
Magda
|
|
|
|
|
|
|
|
Re: Pakistan DHS 2012 district codes [message #26960 is a reply to message #26951] |
Fri, 02 June 2023 16:43 |
Janet-DHS
Messages: 895 Registered: April 2022
|
Senior Member |
|
|
Following is a response from DHS staff member, Tom Pullum:
District-level estimates are no more representative in the second survey than in the first survey. There was no relevant difference in the survey designs. The 2012 survey had about 94,000 household members and the 2017-18 had about 101,000, including some geographic areas that were not in the 2012 survey, so the number of cases per district was about the same.
The estimates are unbiased at the district level but DHS would say they are "not representative" because the standard errors would be so high.
What you want to do, looking at variation across districts, sounds good, and you can do that even if you do not have names for the districts. But you have to be cautious about over-interpreting the extreme values. Statistical instability because of small numbers of cases in many of the districts can lead to exaggeration of the estimated range across districts. You could google something like "Bayesian adjustment for hot spots", for example, to find ways adjust for this effect.
|
|
|
Re: Pakistan DHS 2012 district codes [message #27708 is a reply to message #26960] |
Sun, 24 September 2023 22:49 |
owraza
Messages: 37 Registered: December 2013 Location: Nebraska, US
|
Member |
|
|
Janet-DHS wrote on Sat, 03 June 2023 01:13Following is a response from DHS staff member, Tom Pullum:
The estimates are unbiased at the district level but DHS would say they are "not representative" because the standard errors would be so high.
What you want to do, looking at variation across districts, sounds good, and you can do that even if you do not have names for the districts. But you have to be cautious about over-interpreting the extreme values. Statistical instability because of small numbers of cases in many of the districts can lead to exaggeration of the estimated range across districts. You could google something like "Bayesian adjustment for hot spots", for example, to find ways adjust for this effect.
So it would be safe to say that using geographically weighted regression, one can create a spatial surface to depict variation across districts where aim is not to estimate at district level.
|
|
|
|
|