The cluster-weighted model (CWM) is a member of the family of mixtures of regression models and is also known as a mixture of regressions with random covariates. CWMs refer to the framework of model-based clustering and naturally apply when the research interest requires modeling the relationship between a response variable and a set of covariates using a regression-based approach such as a generalized linear model with the sample being suspected of comprising heterogeneous latent classes. A command for fitting these models is not yet available in Stata, so the aim of this article is to introduce the package cwmglm, which fits CWMs based on the most common generalized linear models with random covariates. Moreover, cwmglm allows the estimation of parsimonious models of Gaussian distributions, with the parameterization of the variance–covariance matrix based on the eigenvalue decomposition. These features are completely new for Stata users. The cwmglm package features goodness-of-fit, bootstrapping, and model-selection tools. We illustrate the use of cwmglm with real and simulated datasets.

Spinelli, D., Ingrassia, S., Vittadini, G. (2024). Cluster-weighted models using Stata. THE STATA JOURNAL, 24(4), 711-745 [10.1177/1536867X241297922].

Cluster-weighted models using Stata

Spinelli D
Primo
;
Vittadini G
2024

Abstract

The cluster-weighted model (CWM) is a member of the family of mixtures of regression models and is also known as a mixture of regressions with random covariates. CWMs refer to the framework of model-based clustering and naturally apply when the research interest requires modeling the relationship between a response variable and a set of covariates using a regression-based approach such as a generalized linear model with the sample being suspected of comprising heterogeneous latent classes. A command for fitting these models is not yet available in Stata, so the aim of this article is to introduce the package cwmglm, which fits CWMs based on the most common generalized linear models with random covariates. Moreover, cwmglm allows the estimation of parsimonious models of Gaussian distributions, with the parameterization of the variance–covariance matrix based on the eigenvalue decomposition. These features are completely new for Stata users. The cwmglm package features goodness-of-fit, bootstrapping, and model-selection tools. We illustrate the use of cwmglm with real and simulated datasets.
Articolo in rivista - Articolo scientifico
cluster-weighted model; cwmglm; finite mixtures of regressions with random covariates; Gaussian parsimonious models; model-based clustering; postestimation; saturated mixture regression model; st0762;
English
24-dic-2024
2024
24
4
711
745
reserved
Spinelli, D., Ingrassia, S., Vittadini, G. (2024). Cluster-weighted models using Stata. THE STATA JOURNAL, 24(4), 711-745 [10.1177/1536867X241297922].
File in questo prodotto:
File Dimensione Formato  
Spinelli-2024-Stata Journal-VoR.pdf

Solo gestori archivio

Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Licenza: Tutti i diritti riservati
Dimensione 631.34 kB
Formato Adobe PDF
631.34 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/480679
Citazioni
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
Social impact