In this paper we investigate a recently introduced class of nonparametric priors, termed generalized Dirichlet process priors. Such priors induce (exchangeable random) partitions which are characterized by a more elaborate clustering structure than those arising from other widely used priors. A natural area of application of these random probability measures is represented by species sampling problems and, in particular, prediction problems in genomics. To this end we study both the distribution of the number of distinct species present in a sample and the distribution of the number of new species conditionally on an observed sample. We also provide the Bayesian nonparametric estimator for the number of new species in an additional sample of given size and for the discovery probability as function of the size of the additional sample. Finally, the study of its conditional structure is completed by the determination of the posterior distribution.

On a class of random probability measures with general predictive structure

FAVARO, STEFANO;PRUENSTER, Igor;
2011-01-01

Abstract

In this paper we investigate a recently introduced class of nonparametric priors, termed generalized Dirichlet process priors. Such priors induce (exchangeable random) partitions which are characterized by a more elaborate clustering structure than those arising from other widely used priors. A natural area of application of these random probability measures is represented by species sampling problems and, in particular, prediction problems in genomics. To this end we study both the distribution of the number of distinct species present in a sample and the distribution of the number of new species conditionally on an observed sample. We also provide the Bayesian nonparametric estimator for the number of new species in an additional sample of given size and for the discovery probability as function of the size of the additional sample. Finally, the study of its conditional structure is completed by the determination of the posterior distribution.
2011
38
359
376
http://www3.interscience.wiley.com/cgi-bin/fulltext/123474966/HTMLSTART
Bayesian Nonparametrics; Dirichlet process; Exchangeable random partitions; Generalized gamma process; Lauricella hypergeometric function; Species sampling models
S. Favaro; I. Pruenster; S.G. Walker
File in questo prodotto:
File Dimensione Formato  
FPW_dirich.pdf

Open Access dal 28/05/2011

Tipo di file: POSTPRINT (VERSIONE FINALE DELL’AUTORE)
Dimensione 251.69 kB
Formato Adobe PDF
251.69 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/73181
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 10
  • ???jsp.display-item.citation.isi??? 10
social impact