The use of hierarchical mixture priors with shared atoms has recently flourished in the Bayesian literature for partially exchangeable data. Leveraging on nested levels of mixtures, these models allow the estimation of a two-layered data partition: across groups and across observations. This paper discusses and compares the properties of such modeling strategies when the mixing weights are assigned either a finite-dimensional Dirichlet distribution or a Dirichlet process prior. Based on these considerations, we introduce a novel hierarchical nonparametric prior based on a finite set of shared atoms, a specification that enhances the flexibility of the induced random measures and the availability of fast posterior inference. To support these findings, we analytically derive the induced prior correlation structure and partially exchangeable partition probability function. Additionally, we develop a novel mean-field variational algorithm for posterior inference to boost the applicability of our nested model to large multivariate data. We then assess and compare the performance of the different shared-atom specifications via simulation. We also show that our variational proposal is highly scalable and that the accuracy of the posterior density estimate and the estimated partition is comparable with state-of-the-art Gibbs sampler algorithms. Finally, we apply our model to a real dataset of Spotify’s song features, simultaneously segmenting artists and songs with similar characteristics.

D’Angelo, L., Denti, F. (2024). A finite-infinite shared atoms nested model for the Bayesian analysis of large grouped data. BAYESIAN ANALYSIS [10.1214/24-BA1458].

A finite-infinite shared atoms nested model for the Bayesian analysis of large grouped data

D’Angelo, Laura;Denti, Francesco
2024

Abstract

The use of hierarchical mixture priors with shared atoms has recently flourished in the Bayesian literature for partially exchangeable data. Leveraging on nested levels of mixtures, these models allow the estimation of a two-layered data partition: across groups and across observations. This paper discusses and compares the properties of such modeling strategies when the mixing weights are assigned either a finite-dimensional Dirichlet distribution or a Dirichlet process prior. Based on these considerations, we introduce a novel hierarchical nonparametric prior based on a finite set of shared atoms, a specification that enhances the flexibility of the induced random measures and the availability of fast posterior inference. To support these findings, we analytically derive the induced prior correlation structure and partially exchangeable partition probability function. Additionally, we develop a novel mean-field variational algorithm for posterior inference to boost the applicability of our nested model to large multivariate data. We then assess and compare the performance of the different shared-atom specifications via simulation. We also show that our variational proposal is highly scalable and that the accuracy of the posterior density estimate and the estimated partition is comparable with state-of-the-art Gibbs sampler algorithms. Finally, we apply our model to a real dataset of Spotify’s song features, simultaneously segmenting artists and songs with similar characteristics.
Articolo in rivista - Articolo scientifico
Dirichlet process , finite mixture , multivariate data , partially exchangeable data , variational Bayes
English
19-set-2024
2024
open
D’Angelo, L., Denti, F. (2024). A finite-infinite shared atoms nested model for the Bayesian analysis of large grouped data. BAYESIAN ANALYSIS [10.1214/24-BA1458].
File in questo prodotto:
File Dimensione Formato  
DAngelo-2024-Bayesian Anal-VoR.pdf

accesso aperto

Descrizione: Open Access Policy - https://projecteuclid.org/journals/bayesian-analysis/scope-and-details
Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Licenza: Creative Commons
Dimensione 24.75 MB
Formato Adobe PDF
24.75 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/538811
Citazioni
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
Social impact