Completely random measures (CRMs) provide a broad class of priors, arguably, the most popular, for Bayesian nonparametric (BNP) analysis of trait allocations. As a peculiar property, CRM priors lead to predictive distributions that share the following structure: for fixed prior’s parameters, a new data point exhibits a Poisson (random) number of “new” traits, i.e., traits not appearing in the sample, which depends on the sampling information only through the sample size. While such a Poisson posterior distribution is appealing for analytical tractability and for ease of interpretation, its independence from the sampling information is a critical drawback, as it makes the posterior distribution of “new” traits completely determined by the estimation of the unknown prior’s parameters. In this paper, we introduce the class of transform-scaled process (T-SP) priors as a tool to enrich the posterior distribution of “new” traits arising from CRM priors, while maintaining the same analytical tractability and ease of interpretation. In particular, we present a framework for posterior analysis of trait allocations under T-SP priors, showing that Stable T-SP priors, i.e., T-SP priors built from Stable CRMs, lead to predictive distributions such that, for fixed prior’s parameters, a new data point displays a negative-Binomial (random) number of “new” traits, which depends on the sampling information through the number of distinct traits and the sample size. Then, by relying on a hierarchical version of T-SP priors, we extend our analysis to the more general setting of trait allocations with multiple groups of data or subpopulations. The empirical effectiveness of our method is demonstrated through numerical experiments and applications to real data.

Transform-scaled process priors for trait allocations in Bayesian nonparametrics

Beraha, Mario
;
Favaro, Stefano
2025-01-01

Abstract

Completely random measures (CRMs) provide a broad class of priors, arguably, the most popular, for Bayesian nonparametric (BNP) analysis of trait allocations. As a peculiar property, CRM priors lead to predictive distributions that share the following structure: for fixed prior’s parameters, a new data point exhibits a Poisson (random) number of “new” traits, i.e., traits not appearing in the sample, which depends on the sampling information only through the sample size. While such a Poisson posterior distribution is appealing for analytical tractability and for ease of interpretation, its independence from the sampling information is a critical drawback, as it makes the posterior distribution of “new” traits completely determined by the estimation of the unknown prior’s parameters. In this paper, we introduce the class of transform-scaled process (T-SP) priors as a tool to enrich the posterior distribution of “new” traits arising from CRM priors, while maintaining the same analytical tractability and ease of interpretation. In particular, we present a framework for posterior analysis of trait allocations under T-SP priors, showing that Stable T-SP priors, i.e., T-SP priors built from Stable CRMs, lead to predictive distributions such that, for fixed prior’s parameters, a new data point displays a negative-Binomial (random) number of “new” traits, which depends on the sampling information through the number of distinct traits and the sample size. Then, by relying on a hierarchical version of T-SP priors, we extend our analysis to the more general setting of trait allocations with multiple groups of data or subpopulations. The empirical effectiveness of our method is demonstrated through numerical experiments and applications to real data.
2025
19
2
3532
3563
Bayesian nonparametrics; completely random measure; feature al-location; hierarchical scaled process prior; posterior analysis; predictive distribution; scaled process prior; trait allocation
Beraha, Mario; Favaro, Stefano
File in questo prodotto:
File Dimensione Formato  
document.pdf

Accesso aperto

Dimensione 635.16 kB
Formato Adobe PDF
635.16 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/2137059
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact