The DHS Program User Forum
Discussions regarding The DHS Program data and results
Home » Countries » Other countries » SNIR7QDT.zip Senegal - two levels of compression (double zipped)
SNIR7QDT.zip Senegal - two levels of compression (double zipped) [message #13740] Fri, 15 December 2017 11:57 Go to next message
maja is currently offline  maja
Messages: 3
Registered: December 2017
Member
The SNIR7QDT.zip file has a "double layer of zipping". (There is a second zip file in there, that doesn't fit the naming convention, so I don't know what it could be: SNIRG0DT.zip)

I'm not sure where to report this error, but it should be a simple thing to fix?

I can obviously fix it manually on my side, but it is messing up the automation/reproducibility of my workflow.

Please direct me where this query can be addressed?

Re: SNIR7QDT.zip Senegal - two levels of compression (double zipped) [message #13741 is a reply to message #13740] Fri, 15 December 2017 15:56 Go to previous messageGo to next message
Bridgette-DHS is currently offline  Bridgette-DHS
Messages: 3199
Registered: February 2013
Senior Member
There is a footnote on the page that reads as follows:

Footnote:

Each zip file includes two datasets:
SNxx7Qxx.xxx for the 2016 survey alone.
SNxxG0xx.xxx for the combined 2015/2016 surveys.
Re: SNIR7QDT.zip Senegal - two levels of compression (double zipped) [message #13765 is a reply to message #13741] Wed, 20 December 2017 07:11 Go to previous messageGo to next message
maja is currently offline  maja
Messages: 3
Registered: December 2017
Member
Thank you Bridgette,

You are of course correct. It seems the 2014 Senegal files have a similar structure, with the second level having both the "expected" file, and another one.

I have a few questions:

1. Are there any other exceptions like these two, is there a reference list of them?

2. The 2014 files (SNxx70xx.zip) have both the "expected" SNxx70xx.zip files and the SNxx6Rxx.zip files. Am I correct in saying that the second set of files is not named in line with the Distribution File Naming Convention as described here: https://dhsprogram.com/data/File-Types-and-Names.cfm? Namely the "R" should refer to the 4th DHSVI survey, but that is not the case?

3. Similarly the "G" in the SNxxG0xx.zip files that are found inside SNxx7Qxx.zip. What is the "G"?

4. Perhaps more simply: when the footnote say the surveys have been combined, does this just mean the merging like so:

SNxx6R = SNxx6D + SNxx70 and SNxxG0 = SNxx7Q + SNxx7H ?

Is there any particular reason for this approach?

Many thanks for your time.

Re: SNIR7QDT.zip Senegal - two levels of compression (double zipped) [message #13767 is a reply to message #13765] Wed, 20 December 2017 13:02 Go to previous messageGo to next message
Bridgette-DHS is currently offline  Bridgette-DHS
Messages: 3199
Registered: February 2013
Senior Member
1. There are no other exceptions like the Senegal surveys.

2. Yes, the "R" is the 4th Survey under DHS VI. The 2012-13 was done under DHS VI, whereas the 2014 was done under DHS VII.
3. The "G"(7th letter in the alphabet), indicates the Fifth survey under DHS "7". This is unique to Senegal, as it's the 1st time we have a fifth survey under a contract.

4. Senegal: Continuous DHS, 2014 Page - each zip file includes two datasets:
SNxx70xx.xxx contains the data for the 2014 survey alone.
SNxx6Rxx.xxx contains the data for the combined 2012-13/2014 surveys - used to produce subnational regional results.

Senegal: Continuous DHS, 2016 Page - each zip file includes two datasets:
SNxx7Qxx.xxx contains the data for the 2016 survey alone
SNxxG0xx.xxx contains the data for the combined 2015-2016 surveys - used to produce subnational regional results.
Re: SNIR7QDT.zip Senegal - two levels of compression (double zipped) [message #13768 is a reply to message #13767] Wed, 20 December 2017 13:26 Go to previous messageGo to next message
maja is currently offline  maja
Messages: 3
Registered: December 2017
Member
Thank you for your quick reply Bridgette!

I am still not clear on point 4. though:
When you say the "combined surveys", are these just (row-wise) merged from the other two files that I list explicitly:
SNxx6R = SNxx6D + SNxx70 and SNxxG0 = SNxx7Q + SNxx7H

My point being: if I have already e.g. downloaded SNxx6Dxx, I should then not use 6R, because I would be duplicating those cases?

Re: SNIR7QDT.zip Senegal - two levels of compression (double zipped) [message #14068 is a reply to message #13768] Fri, 09 February 2018 08:37 Go to previous message
Bridgette-DHS is currently offline  Bridgette-DHS
Messages: 3199
Registered: February 2013
Senior Member
The combined data merely means it contains two sets of observations, from two surveys conducted in two different points in time. And yes, the sn??6R*.zip file contains combined data for 2012-13 & 2014.

Previous Topic: Unmet need in Madagascar 2008
Next Topic: peru DHS country specific variables
Goto Forum:
  


Current Time: Thu Nov 21 22:26:40 Coordinated Universal Time 2024