CINECA IRIS Institutional Research Information System

Instance-based classifiers that compute similarity between instances suffer from the presence of noise in the training set and from overfitting. In this paper we propose a new type of distancebased classifier that instead of computing distances between instances computes the distance between each test instance and the classes. Both the test instance and the classes are represented by patterns in the space of the frequent itemsets. We ranked the itemsets by metrics of itemset significance. Then we considered only the top portion of the ranking that leads the classifier to reach the maximum accuracy. We have experimented on a large collection of datasets from UCI archive with different proximity measures and different metrics of itemsets ranking. We show that our method has many benefits: it reduces the number of distance computations, improves the classification accuracy of state-of-the art classifiers, like decision trees, SVM, knn, Naive Bayes, rule-based classifiers and association rule-based ones and outperforms the competitors especially on noise data.

A Novel Distance-Based Classifier Built on Pattern Ranking

D. BACHAR;MEO, Rosa

2009-01-01

Abstract

Instance-based classifiers that compute similarity between instances suffer from the presence of noise in the training set and from overfitting. In this paper we propose a new type of distancebased classifier that instead of computing distances between instances computes the distance between each test instance and the classes. Both the test instance and the classes are represented by patterns in the space of the frequent itemsets. We ranked the itemsets by metrics of itemset significance. Then we considered only the top portion of the ranking that leads the classifier to reach the maximum accuracy. We have experimented on a large collection of datasets from UCI archive with different proximity measures and different metrics of itemsets ranking. We show that our method has many benefits: it reduces the number of distance computations, improves the classification accuracy of state-of-the art classifiers, like decision trees, SVM, knn, Naive Bayes, rule-based classifiers and association rule-based ones and outperforms the competitors especially on noise data.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2009
			
	Titolo dell'evento
	
				24th ACM Symposium on Applied Computing
			
	Luogo dell'evento
	
				Honolulu, Hawaii, USA
			
	Data dell'evento
	
				March, 2009
			
	Titolo del volume
	
				Proceedings of 24th ACM Symposium on Applied Computing
			
	Nome editore
	
				ACM
			
	N. Volume
	
				3
			
	Pagine (da)
	
				1427
			
	Pagine (a)
	
				1432
			
	Codice ISBN
	
				9781605581668
			
	URL del prodotto (archivi open access, fulltext su sito editore, etc.)
	
				http://www.acm.org/conferences/sac/sac2009/
			
	Parole Chiave
	
				instance-base learning; frequent itemsets; pattern ranking; associative classifiers
			
	Tutti gli autori
	
						D. BACHAR; R. MEO
					
	Appare nelle tipologie:
	
				04A-Conference paper in volume

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/50654

Citazioni

ND

1

ND

social impact