In the model-based clustering literature, we find several methodologies to study latent structures underlying the data, among which mixtures of factor analyzers. However, none of them can detect hierarchical relationships among latent variables. The Ultrametric Gaussian Mixture Model (UGMM) is intended to reach this goal by identifying a hierarchy of variables, starting by partitioning the variables into a reduced number of groups per mixture component. Nonetheless, up to now, it requires complete observations, which is often not the case in real data collections. In this paper, we propose the extension of UGMM in the missing data framework. The proposal is applied to a real data set for inspecting the relationships among features of songs of different genres.

Greselin, F., Zaccaria, G. (2023). Handling missing data in complex phenomena: an ultrametric model-based approach for clustering. In Book of the Short Papers - SIS 2023 (pp.961-966). Torino : Pearson.

Handling missing data in complex phenomena: an ultrametric model-based approach for clustering

Greselin, F;Zaccaria, G
2023

Abstract

In the model-based clustering literature, we find several methodologies to study latent structures underlying the data, among which mixtures of factor analyzers. However, none of them can detect hierarchical relationships among latent variables. The Ultrametric Gaussian Mixture Model (UGMM) is intended to reach this goal by identifying a hierarchy of variables, starting by partitioning the variables into a reduced number of groups per mixture component. Nonetheless, up to now, it requires complete observations, which is often not the case in real data collections. In this paper, we propose the extension of UGMM in the missing data framework. The proposal is applied to a real data set for inspecting the relationships among features of songs of different genres.
paper
ultrametricity, Gaussian mixture models, missing information, hierarchy of latent concepts, heterogeneous populations, features of songs
English
SIS 2023 - Statistical Learning, Sustainability and Impact Evaluation
2023
Chelli, FM; Ciommi, M; Ingrassia, S; Mariani, F; Recchioni, MC
Book of the Short Papers - SIS 2023
9788891935618
2023
961
966
https://it.pearson.com/content/dam/region-core/italy/pearson-italy/pdf/Docenti/Università/bozza-book-compresso.pdf
reserved
Greselin, F., Zaccaria, G. (2023). Handling missing data in complex phenomena: an ultrametric model-based approach for clustering. In Book of the Short Papers - SIS 2023 (pp.961-966). Torino : Pearson.
File in questo prodotto:
File Dimensione Formato  
Zaccaria-2023-SIS 2023-VoR.pdf

Solo gestori archivio

Descrizione: Articolo
Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Licenza: Tutti i diritti riservati
Dimensione 1.38 MB
Formato Adobe PDF
1.38 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/439340
Citazioni
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
Social impact