The Dirichlet-multinomial is one of the most known compound distributions for multivariate count data. Let X|p ∼ multinomial(n,p) and P ∼ Dirichlet(α), then the marginal distribution of X is the Dirichlet-multinomial distribution. Because of the severe covariance structure imposed by the Dirichlet prior, covariance among distinct elements of X assumes only negative values and this could be unrealistic in some particular scenarios. In the literature there exist several other distributions defined on the simplex: a recent proposal is the Extended Flexible Dirichlet (EFD), a generalization of the Dirichlet with a less strict dependence structure. A new distribution for count data, called EFD-multinomial, can be obtained by compounding the multinomial model with an EFD prior on the parameters P. Due to the covariance structure of the EFD, it allows for positive dependence for some pairs of count categories. Furthermore, thanks to its finite mixture representation, an EM-based estimation procedure can be derived. Some theoretical properties of the EFD-multinomial distribution are shown, and a preliminary simulation study is performed to evaluate the behavior of the EM-based MLE under several scenarios, including positively correlated counts.

Ascari, R., Migliorati, S., Ongaro, A. (2018). A new Dirichlet-multinomial mixture model for count data. In Programme and Abstracts 12th International Conference on Computational and Financial Econometrics (CFE 2018).

A new Dirichlet-multinomial mixture model for count data

Ascari, R
;
Migliorati, S;Ongaro, A
2018

Abstract

The Dirichlet-multinomial is one of the most known compound distributions for multivariate count data. Let X|p ∼ multinomial(n,p) and P ∼ Dirichlet(α), then the marginal distribution of X is the Dirichlet-multinomial distribution. Because of the severe covariance structure imposed by the Dirichlet prior, covariance among distinct elements of X assumes only negative values and this could be unrealistic in some particular scenarios. In the literature there exist several other distributions defined on the simplex: a recent proposal is the Extended Flexible Dirichlet (EFD), a generalization of the Dirichlet with a less strict dependence structure. A new distribution for count data, called EFD-multinomial, can be obtained by compounding the multinomial model with an EFD prior on the parameters P. Due to the covariance structure of the EFD, it allows for positive dependence for some pairs of count categories. Furthermore, thanks to its finite mixture representation, an EM-based estimation procedure can be derived. Some theoretical properties of the EFD-multinomial distribution are shown, and a preliminary simulation study is performed to evaluate the behavior of the EM-based MLE under several scenarios, including positively correlated counts.
abstract + slide
Compositional Data, Mixture model
English
International Conference of the ERCIM (European Research Consortium for Informatics and Mathematics) Working Group on Computational and Methodological Statistics (CMStatistics 2018)
2018
Programme and Abstracts 12th International Conference on Computational and Financial Econometrics (CFE 2018)
978-9963-2227-5-9
2018
reserved
Ascari, R., Migliorati, S., Ongaro, A. (2018). A new Dirichlet-multinomial mixture model for count data. In Programme and Abstracts 12th International Conference on Computational and Financial Econometrics (CFE 2018).
File in questo prodotto:
File Dimensione Formato  
BoACFECMStatistics2018.pdf

Solo gestori archivio

Tipologia di allegato: Other attachments
Dimensione 4.26 MB
Formato Adobe PDF
4.26 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/212674
Citazioni
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
Social impact