Model-based approach for household clustering with mixed scale variables


The Ministry of Social Development in Mexico is in charge of creating and assigning social programmes targeting specific needs in the population for the improvement of quality of life. To better target the social programmes, the Ministry is aimed to find clusters of households with the same needs based on demographic characteristics as well as poverty conditions of the household. Available data consists of continuous, ordinal, and nominal variables and the observations are not iid but come from a survey sample based on a complex design. We propose a Bayesian nonparametric mixture model that jointly models this mixed scale data and accommodates for the different sampling probabilities. The performance of the model is assessed via simulated data. A full analysis of socio-economic conditions in households in the State of Mexico is presented.

Advances in Data Analysis and Classification
Chris U. Carmona
Chris U. Carmona
Doctoral Researcher in Statistical Machine Learning
comments powered by Disqus