Species sampling problems have a long history in ecological and biological studies and a number of statistical issues, including the evaluation of species richness, are still to be addressed. In this paper, motivated by Bayesian nonparametric inference for species sampling problems, we consider the practically important and technically challenging issue of developing a comprehensive posterior analysis of the so-called rare variants, namely those species with frequency less than or equal to a given abundance threshold. In particular, by adopting a Gibbs-type prior, we provide an explicit expression for the posterior joint distribution of the frequency counts of the rare variants, and we investigate some of its statistical properties. The proposed results are illustrated by means of two novel applications to a benchmark genomic dataset.
Titolo: | Posterior analysis of rare variants in Gibbs-type species sampling models |
Autori Riconosciuti: | |
Autori: | O. Cesari; S. Favaro; B. Nipoti |
Data di pubblicazione: | 2014 |
Abstract: | Species sampling problems have a long history in ecological and biological studies and a number of statistical issues, including the evaluation of species richness, are still to be addressed. In this paper, motivated by Bayesian nonparametric inference for species sampling problems, we consider the practically important and technically challenging issue of developing a comprehensive posterior analysis of the so-called rare variants, namely those species with frequency less than or equal to a given abundance threshold. In particular, by adopting a Gibbs-type prior, we provide an explicit expression for the posterior joint distribution of the frequency counts of the rare variants, and we investigate some of its statistical properties. The proposed results are illustrated by means of two novel applications to a benchmark genomic dataset. |
Volume: | 131 |
Pagina iniziale: | 79 |
Pagina finale: | 98 |
Digital Object Identifier (DOI): | 10.1016/j.jmva.2014.06.017 |
URL: | http://www.sciencedirect.com/science/article/pii/S0047259X1400147X |
Parole Chiave: | Bayesian nonparametric inference, Asymptotic credible intervals, Exchangeable random partition, Gibbs-type random probability measure, Index of diversity, Sampling formula, Species sampling problem, Rare variant, Two parameter Poisson–Dirichlet process |
Rivista: | JOURNAL OF MULTIVARIATE ANALYSIS |
Appare nelle tipologie: | 03A-Articolo su Rivista |
File in questo prodotto:
File | Descrizione | Tipologia | Licenza | |
---|---|---|---|---|
JMA_CFN.pdf | 1 Ver. finale autore | Open Access Visualizza/Apri |