We consider predictive inference using a class of temporally dependent Dirichlet processes driven by Fleming–Viot diffusions, which have a natural bear- ing in Bayesian nonparametrics and lend the resulting family of random probabil- ity measures to analytical posterior analysis. Formulating the implied statistical model as a hidden Markov model, we fully describe the predictive distribution in- duced by these Fleming–Viot-driven dependent Dirichlet processes, for a sequence of observations collected at a certain time given another set of draws collected at several previous times. This is identified as a mixture of P ́olya urns, whereby the observations can be values from the baseline distribution or copies of previous draws collected at the same time as in the usual P ́olya urn, or can be sampled from a random subset of the data collected at previous times. We characterize the time-dependent weights of the mixture which select such subsets and discuss the asymptotic regimes. We describe the induced partition by means of a Chinese restaurant process metaphor with a conveyor belt, whereby new customers who do not sit at an occupied table open a new table by picking a dish either from the baseline distribution or from a time-varying offer available on the conveyor belt. We lay out explicit algorithms for exact and approximate posterior sampling of both observations and partitions, and illustrate our results on predictive problems with synthetic and real data.

Predictive inference with Fleming–Viot-driven dependent Dirichlet processes

Ruggiero, Matteo
2021-01-01

Abstract

We consider predictive inference using a class of temporally dependent Dirichlet processes driven by Fleming–Viot diffusions, which have a natural bear- ing in Bayesian nonparametrics and lend the resulting family of random probabil- ity measures to analytical posterior analysis. Formulating the implied statistical model as a hidden Markov model, we fully describe the predictive distribution in- duced by these Fleming–Viot-driven dependent Dirichlet processes, for a sequence of observations collected at a certain time given another set of draws collected at several previous times. This is identified as a mixture of P ́olya urns, whereby the observations can be values from the baseline distribution or copies of previous draws collected at the same time as in the usual P ́olya urn, or can be sampled from a random subset of the data collected at previous times. We characterize the time-dependent weights of the mixture which select such subsets and discuss the asymptotic regimes. We describe the induced partition by means of a Chinese restaurant process metaphor with a conveyor belt, whereby new customers who do not sit at an occupied table open a new table by picking a dish either from the baseline distribution or from a time-varying offer available on the conveyor belt. We lay out explicit algorithms for exact and approximate posterior sampling of both observations and partitions, and illustrate our results on predictive problems with synthetic and real data.
2021
16
2
371
395
https://projecteuclid.org/euclid.ba/1588125765
Ascolani, Filippo; Lijoi, Antonio; Ruggiero, Matteo
File in questo prodotto:
File Dimensione Formato  
euclid.ba.1588125765.pdf

Accesso riservato

Tipo di file: PDF EDITORIALE
Dimensione 2.88 MB
Formato Adobe PDF
2.88 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
2001.09868.pdf

Accesso riservato

Tipo di file: POSTPRINT (VERSIONE FINALE DELL’AUTORE)
Dimensione 537.64 kB
Formato Adobe PDF
537.64 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1766429
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 11
  • ???jsp.display-item.citation.isi??? 14
social impact