The DHS Program User Forum: Weighting data » Query on Cluster-Level Modeling with DHS Data and Sampling Weights

Home » Data » Weighting data » Query on Cluster-Level Modeling with DHS Data and Sampling Weights (Building models whereby the response variable and covariates, are aggregated at the cluster level.)

Show: Today's Messages :: Show Polls :: Message Navigator

Re: Query on Cluster-Level Modeling with DHS Data and Sampling Weights [message #30101 is a reply to message #30090]

Mon, 23 September 2024 14:55

Bridgette-DHS
Messages: 3230
Registered: February 2013

Senior Member

Following is a response from Senior DHS staff member, Tom Pullum:

When I said "this" would not be a good analysis of the data, I was referring to my own example. I was not judging what you are doing!

In the glm command that I suggest, the outcome is the numerator of a proportion (which is the mean of a 0/1 variable) and the option "family(binomial cases)" specifies the denominator (as "cases"). This is equivalent to a model in which the cluster-level outcome is a proportion and it is weighted by the number of cases in the denominator. A fitted proportion for a cluster would be the fitted frequency divided by "cases".

This part of the collapse command: "(mean) v190_*" will construct five proportions that add to one. These will be the proportions of "cases" that are in wealth quintiles 1, 2, 3, 4, 5. On the right hand side of the estimation command you could include all five of those proportions (as I did); one will be aliased because of the linear constraint (they add to 1). However, I would recommend a recode to a single proportion, such as the proportion in the bottom two quintiles, which will give just one coefficient and be easier to interpret.

You could perhaps include other covariates after the "mean" portion of the collapse command, but you could also just use your geospatial variables.

Hope this helps but let us know if you have questions specifically for the geospatial team.

Report message to a moderator

[Message index]

		Query on Cluster-Level Modeling with DHS Data and Sampling Weights By: sayianka on Mon, 16 September 2024 04:21
		Re: Query on Cluster-Level Modeling with DHS Data and Sampling Weights By: Bridgette-DHS on Fri, 20 September 2024 06:41
		Re: Query on Cluster-Level Modeling with DHS Data and Sampling Weights By: sayianka on Sun, 22 September 2024 00:33
		Re: Query on Cluster-Level Modeling with DHS Data and Sampling Weights By: Bridgette-DHS on Mon, 23 September 2024 14:55
		Re: Query on Cluster-Level Modeling with DHS Data and Sampling Weights By: Bridgette-DHS on Tue, 24 September 2024 10:08

Previous Topic:	Calculating residuals for adjusted logistic regression models after using the svy command
Next Topic:	DHS weights and random effects in a GLMM

Goto Forum:

-=] Back to Top [=-

[ Syndicate this forum (XML) ] [

] [

]

Current Time: Sun Dec 14 19:55:16 Coordinated Universal Time 2025