Bicocca Open Archive

The flexible Dirichlet (FD) distribution (Ongaro and Migliorati in J. Multvar. Anal. 114: 412–426, 2013) makes it possible to preserve many theoretical properties of the Dirichlet one, without inheriting its lack of flexibility in modeling the various independence concepts appropriate for compositional data, i.e. data representing vectors of proportions. In this paper we tackle the potential of the FD from an inferential and applicative viewpoint. In this regard, the key feature appears to be the special structure defining its Dirichlet mixture representation. This structure determines a simple and clearly interpretable differentiation among mixture components which can capture the main features of a large variety of data sets. Furthermore, it allows a substantially greater flexibility than the Dirichlet, including both unimodality and a varying number of modes. Very importantly, this increased flexibility is obtained without sharing many of the inferential difficulties typical of general mixtures. Indeed, the FD displays the identifiability and likelihood behavior proper to common (non-mixture) models. Moreover, thanks to a novel non random initialization based on the special FD mixture structure, an efficient and sound estimation procedure can be devised which suitably combines EM-types algorithms. Reliable complete-data likelihood-based estimators for standard errors can be provided as well.

Migliorati, S., Ongaro, A., Monti, G. (2017). A structured Dirichlet mixture model for compositional data: inferential and applicative issues. STATISTICS AND COMPUTING, 27(4), 963-983 [10.1007/s11222-016-9665-y].

A structured Dirichlet mixture model for compositional data: inferential and applicative issues

MIGLIORATI, SONIA^Primo;ONGARO, ANDREA^Secondo;MONTI, GIANNA SERAFINA^Ultimo

2017

Abstract

The flexible Dirichlet (FD) distribution (Ongaro and Migliorati in J. Multvar. Anal. 114: 412–426, 2013) makes it possible to preserve many theoretical properties of the Dirichlet one, without inheriting its lack of flexibility in modeling the various independence concepts appropriate for compositional data, i.e. data representing vectors of proportions. In this paper we tackle the potential of the FD from an inferential and applicative viewpoint. In this regard, the key feature appears to be the special structure defining its Dirichlet mixture representation. This structure determines a simple and clearly interpretable differentiation among mixture components which can capture the main features of a large variety of data sets. Furthermore, it allows a substantially greater flexibility than the Dirichlet, including both unimodality and a varying number of modes. Very importantly, this increased flexibility is obtained without sharing many of the inferential difficulties typical of general mixtures. Indeed, the FD displays the identifiability and likelihood behavior proper to common (non-mixture) models. Moreover, thanks to a novel non random initialization based on the special FD mixture structure, an efficient and sound estimation procedure can be devised which suitably combines EM-types algorithms. Reliable complete-data likelihood-based estimators for standard errors can be provided as well.

Scheda breve

Scheda completa

Scheda completa (DC)

	Sottotipologia
	
				Articolo in rivista - Articolo scientifico
			
	Parole chiave
	
				Dirichlet mixture; EM type algorithms; Identifiability; Multimodality; Simplex distribution;
			
	Parole chiave
	
				Simplex distribution; Dirichlet mixture; Identifiability; Multimodality; EM type algorithms
			
	Lingua del contenuto
	
				English
			
	Data ahead of print o Data prima pubblicazione Online
	
				2016
			
	Data di pubblicazione
	
				2017
			
	Rivista
	
				STATISTICS AND COMPUTING
			
	Numero del volume
	
				27
			
	Fascicolo
	
				4
			
	Pagina iniziale
	
				963
			
	Pagina finale
	
				983
			
	DOI dell'articolo
	
				https://dx.doi.org/10.1007/s11222-016-9665-y
			
	Fulltext
	
				reserved
			
	Citazione
	
				Migliorati, S., Ongaro, A., Monti, G. (2017). A structured Dirichlet mixture model for compositional data: inferential and applicative issues. STATISTICS AND COMPUTING, 27(4), 963-983 [10.1007/s11222-016-9665-y].
			
	Appare nelle tipologie:
	
				01 - Articolo su rivista

File in questo prodotto:

File	Dimensione	Formato
Migliorati2017_Article_AStructuredDirichletMixtureMod.pdf Solo gestori archivio Tipologia di allegato: Publisher’s Version (Version of Record, VoR) Dimensione 1.12 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.12 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/111309

Citazioni

17

10

Social impact