Discrete random structures are important tools in Bayesian nonparametrics and the resulting models have proven effective in density estimation, clustering, topic modeling and prediction, among others. In this paper, we consider nested processes and study the dependence structures they induce. Dependence ranges between homogeneity, corresponding to full exchangeability, and maximum heterogeneity, corresponding to (unconditional) independence across samples. The popular nested Dirichlet process is shown to degenerate to the fully exchangeable case when there are ties across samples at the observed or latent level. To overcome this drawback, inherent to nesting general discrete random measures, we introduce a novel class of latent nested processes. These are obtained by adding common and group-specific completely random measures and, then, normalizing to yield dependent random probability measures. We provide results on the partition distributions induced by latent nested processes, and develop a Markov Chain Monte Carlo sampler for Bayesian inferences. A test for distributional homogeneity across groups is obtained as a by-product. The results and their inferential implications are showcased on synthetic and real data.

Camerlenghi, F., Dunson, D., Lijoi, A., Pruenster, I., Rodriguez, A. (2019). Latent nested nonparametric priors (with discussion). BAYESIAN ANALYSIS, 14(4), 1303-1356 [10.1214/19-BA1169].

Latent nested nonparametric priors (with discussion)

Camerlenghi, F
;
2019

Abstract

Discrete random structures are important tools in Bayesian nonparametrics and the resulting models have proven effective in density estimation, clustering, topic modeling and prediction, among others. In this paper, we consider nested processes and study the dependence structures they induce. Dependence ranges between homogeneity, corresponding to full exchangeability, and maximum heterogeneity, corresponding to (unconditional) independence across samples. The popular nested Dirichlet process is shown to degenerate to the fully exchangeable case when there are ties across samples at the observed or latent level. To overcome this drawback, inherent to nesting general discrete random measures, we introduce a novel class of latent nested processes. These are obtained by adding common and group-specific completely random measures and, then, normalizing to yield dependent random probability measures. We provide results on the partition distributions induced by latent nested processes, and develop a Markov Chain Monte Carlo sampler for Bayesian inferences. A test for distributional homogeneity across groups is obtained as a by-product. The results and their inferential implications are showcased on synthetic and real data.
Articolo in rivista - Articolo scientifico
Bayesian nonparametrics; Completely random measures; Dependent nonparametric priors; Heterogeneity; Mixture models; Nested processes.
English
2019
14
4
1303
1356
reserved
Camerlenghi, F., Dunson, D., Lijoi, A., Pruenster, I., Rodriguez, A. (2019). Latent nested nonparametric priors (with discussion). BAYESIAN ANALYSIS, 14(4), 1303-1356 [10.1214/19-BA1169].
File in questo prodotto:
File Dimensione Formato  
19-BA1169.pdf

Solo gestori archivio

Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Dimensione 1.24 MB
Formato Adobe PDF
1.24 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/191807
Citazioni
  • Scopus 30
  • ???jsp.display-item.citation.isi??? 27
Social impact