We propose a new model for cluster analysis in a Bayesian nonparametric framework. Our model combines two ingredients, species sampling mixture models of Gaussian distributions on one hand, and a deterministic clustering procedure (DBSCAN) on the other. Here, two observations from the underlying species sampling mixture model share the same cluster if the distance between the densities corresponding to their latent parameters is smaller than a threshold; this yields a random partition which is coarser than the one induced by the species sampling mixture. Since this procedure depends on the value of the threshold, we suggest a strategy to fix it. In addition, we discuss implementation and applications of the model; comparison with more standard clustering algorithms will be given as well. Supplementary materials for the article are available online. © 2014, © 2014 American Statistical Association, Institute of Mathematical Statistics, and Interface Foundation of North America.

A “Density-Based” Algorithm for Cluster Analysis Using Species Sampling Gaussian Mixture Models

ARGIENTO, Raffaele;
2014-01-01

Abstract

We propose a new model for cluster analysis in a Bayesian nonparametric framework. Our model combines two ingredients, species sampling mixture models of Gaussian distributions on one hand, and a deterministic clustering procedure (DBSCAN) on the other. Here, two observations from the underlying species sampling mixture model share the same cluster if the distance between the densities corresponding to their latent parameters is smaller than a threshold; this yields a random partition which is coarser than the one induced by the species sampling mixture. Since this procedure depends on the value of the threshold, we suggest a strategy to fix it. In addition, we discuss implementation and applications of the model; comparison with more standard clustering algorithms will be given as well. Supplementary materials for the article are available online. © 2014, © 2014 American Statistical Association, Institute of Mathematical Statistics, and Interface Foundation of North America.
2014
23
1126
1142
https://www.scopus.com/inward/record.uri?eid=2-s2.0-84908061167&doi=10.1080%2f10618600.2013.856796&partnerID=40&md5=58cac680a5f40d4b04e51e37ad727714
Argiento, R.; Cremaschi, A.; Guglielmi, A.
File in questo prodotto:
File Dimensione Formato  
12-3_JCGS_4aperto.pdf

Accesso aperto

Tipo di file: PREPRINT (PRIMA BOZZA)
Dimensione 1.55 MB
Formato Adobe PDF
1.55 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1635083
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 17
  • ???jsp.display-item.citation.isi??? 17
social impact