Given a sample of size n from a population of individuals belonging to different species with unknown proportions, a problem of practical interest consists in making inference on the probability D-n(l) that the (n + 1)-th draw coincides with a species with frequency l in the sample, for any 1 = 0, 1,. . . , n. This paper contributes to the methodology of Bayesian nonparametric inference for Dn(l). Specifically, under the general framework of Gibbs-type priors we show how to derive credible intervals for a Bayesian nonparametric estimation of D-n(l), and we investigate the large n asymptotic behaviour of such an estimator. Of particular interest are special cases of our results obtained under the specification of the two parameter Poisson Dirichlet prior and the normalized generalized Gamma prior. With respect for these prior specifications, the proposed results are illustrated through a simulation study and a benchmark Expressed Sequence Tags dataset. To the best our knowledge, this provides the first comparative study between the two parameter Poisson Dirichlet prior and the normalized generalized Gamma prior in the context of Bayesian nonparemetric inference for D-n(l).

Arbel, J., Favaro, S., Nipoti, B., Teh, Y. (2017). Bayesian nonparametric inference for discovery probabilities: Credible intervals and large sample asymptotics. STATISTICA SINICA, 27(2), 839-858 [10.5705/ss.202015.0250].

Bayesian nonparametric inference for discovery probabilities: Credible intervals and large sample asymptotics

Nipoti B.;
2017

Abstract

Given a sample of size n from a population of individuals belonging to different species with unknown proportions, a problem of practical interest consists in making inference on the probability D-n(l) that the (n + 1)-th draw coincides with a species with frequency l in the sample, for any 1 = 0, 1,. . . , n. This paper contributes to the methodology of Bayesian nonparametric inference for Dn(l). Specifically, under the general framework of Gibbs-type priors we show how to derive credible intervals for a Bayesian nonparametric estimation of D-n(l), and we investigate the large n asymptotic behaviour of such an estimator. Of particular interest are special cases of our results obtained under the specification of the two parameter Poisson Dirichlet prior and the normalized generalized Gamma prior. With respect for these prior specifications, the proposed results are illustrated through a simulation study and a benchmark Expressed Sequence Tags dataset. To the best our knowledge, this provides the first comparative study between the two parameter Poisson Dirichlet prior and the normalized generalized Gamma prior in the context of Bayesian nonparemetric inference for D-n(l).
Articolo in rivista - Articolo scientifico
Asymptotics; Bayesian nonparametrics; Credible intervals; Discovery probability; Gibbs-type priors; Good-Turing estimator; Normalized generalized Gamma prior; Smoothing technique; Two-parameter Poisson-Dirichlet;
English
2017
27
2
839
858
open
Arbel, J., Favaro, S., Nipoti, B., Teh, Y. (2017). Bayesian nonparametric inference for discovery probabilities: Credible intervals and large sample asymptotics. STATISTICA SINICA, 27(2), 839-858 [10.5705/ss.202015.0250].
File in questo prodotto:
File Dimensione Formato  
2017_SS.pdf

accesso aperto

Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Dimensione 6.08 MB
Formato Adobe PDF
6.08 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/250013
Citazioni
  • Scopus 7
  • ???jsp.display-item.citation.isi??? 7
Social impact