Human gut microbiome studies are of critical importance to evaluate the health of an individual. Understanding the interactions within microbial communities has a crucial role in comprehending human biological systems. Topic modelling techniques are usually employed to analyze textual documents and exploit a latent variable framework to find hidden topics in the data. Quite interestingly, a clear semantic correspondence between text and microbiome analyses allows applying these techniques to detect enterotypes. One of the most commonly used models in the field is the Latent Dirichlet Allocation which, however, suffers from some limitations due to the stiffness of its standard topic prior distribution, the Dirichlet. This study proposes a flexible Dirichlet distribution as topic prior distribution in the context of microbial systems. The goal is to detect the important taxa characterizing different enterotypes.
Giampino, A., Ascari, R., Migliorati, S. (2025). Microbiome Enterotype Detection via a Latent Variable Allocation Model. In A. Pollice, P. Mariani (a cura di), Methodological and Applied Statistics and Demography IV SIS 2024, Short Papers, Contributed Sessions 2 (pp. 96-101). Springer [10.1007/978-3-031-64447-4_16].
Microbiome Enterotype Detection via a Latent Variable Allocation Model
Giampino, Alice
;Ascari, Roberto;Migliorati, Sonia
2025
Abstract
Human gut microbiome studies are of critical importance to evaluate the health of an individual. Understanding the interactions within microbial communities has a crucial role in comprehending human biological systems. Topic modelling techniques are usually employed to analyze textual documents and exploit a latent variable framework to find hidden topics in the data. Quite interestingly, a clear semantic correspondence between text and microbiome analyses allows applying these techniques to detect enterotypes. One of the most commonly used models in the field is the Latent Dirichlet Allocation which, however, suffers from some limitations due to the stiffness of its standard topic prior distribution, the Dirichlet. This study proposes a flexible Dirichlet distribution as topic prior distribution in the context of microbial systems. The goal is to detect the important taxa characterizing different enterotypes.File | Dimensione | Formato | |
---|---|---|---|
Giampino-2025-Methodological and Applied Statistics-VoR.pdf
Solo gestori archivio
Tipologia di allegato:
Publisher’s Version (Version of Record, VoR)
Licenza:
Tutti i diritti riservati
Dimensione
770.69 kB
Formato
Adobe PDF
|
770.69 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.