Completely random measures (CRMs) provide a broad class of priors, arguably, the most popular, for Bayesian nonparametric (BNP) analysis of trait allocations. As a peculiar property, CRM priors lead to predictive distributions that share the following structure: for fixed prior’s parameters, a new data point exhibits a Poisson (random) number of “new” traits, i.e., traits not appearing in the sample, which depends on the sampling information only through the sample size. While such a Poisson posterior distribution is appealing for analytical tractability and for ease of interpretation, its independence from the sampling information is a critical drawback, as it makes the posterior distribution of “new” traits completely determined by the estimation of the unknown prior’s parameters. In this paper, we introduce the class of transform-scaled process (T-SP) priors as a tool to enrich the posterior distribution of “new” traits arising from CRM priors, while maintaining the same analytical tractability and ease of interpretation. In particular, we present a framework for posterior analysis of trait allocations under T-SP priors, showing that Stable T-SP priors, i.e., T-SP priors built from Stable CRMs, lead to predictive distributions such that, for fixed prior’s parameters, a new data point displays a negative-Binomial (random) number of “new” traits, which depends on the sampling information through the number of distinct traits and the sample size. Then, by relying on a hierarchical version of T-SP priors, we extend our analysis to the more general setting of trait allocations with multiple groups of data or subpopulations. The empirical effectiveness of our method is demonstrated through numerical experiments and applications to real data.
Transform-scaled process priors for trait allocations in Bayesian nonparametrics
Beraha, Mario
;Favaro, Stefano
2025-01-01
Abstract
Completely random measures (CRMs) provide a broad class of priors, arguably, the most popular, for Bayesian nonparametric (BNP) analysis of trait allocations. As a peculiar property, CRM priors lead to predictive distributions that share the following structure: for fixed prior’s parameters, a new data point exhibits a Poisson (random) number of “new” traits, i.e., traits not appearing in the sample, which depends on the sampling information only through the sample size. While such a Poisson posterior distribution is appealing for analytical tractability and for ease of interpretation, its independence from the sampling information is a critical drawback, as it makes the posterior distribution of “new” traits completely determined by the estimation of the unknown prior’s parameters. In this paper, we introduce the class of transform-scaled process (T-SP) priors as a tool to enrich the posterior distribution of “new” traits arising from CRM priors, while maintaining the same analytical tractability and ease of interpretation. In particular, we present a framework for posterior analysis of trait allocations under T-SP priors, showing that Stable T-SP priors, i.e., T-SP priors built from Stable CRMs, lead to predictive distributions such that, for fixed prior’s parameters, a new data point displays a negative-Binomial (random) number of “new” traits, which depends on the sampling information through the number of distinct traits and the sample size. Then, by relying on a hierarchical version of T-SP priors, we extend our analysis to the more general setting of trait allocations with multiple groups of data or subpopulations. The empirical effectiveness of our method is demonstrated through numerical experiments and applications to real data.| File | Dimensione | Formato | |
|---|---|---|---|
|
document.pdf
Accesso aperto
Dimensione
635.16 kB
Formato
Adobe PDF
|
635.16 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.



