The discovery of the most recurrent association rules, in a large database of sales transactions requires that the sets of items bought together by a sufficiently large population of customers are identified. This is a critical task, since the number of generated itemsets grows exponentially with the total number of items. Most of the algorithms start identifying the sets with the lowest cardinality, and subsequently, increase it progressively. Our approach is different, since the sets to be considered at a time are determined by the items in the sets. The main advantage is a significant reduction of the CPU time required to update data structures in main memory. This paper presents an algorithm that requires only one pass on the database, presents linear scale-up property with the dimensions of the database and, as shown by the experiments, performs better than other classical algorithms.

A New Approach for the Discovery of Frequent Itemsets

MEO, Rosa
1999-01-01

Abstract

The discovery of the most recurrent association rules, in a large database of sales transactions requires that the sets of items bought together by a sufficiently large population of customers are identified. This is a critical task, since the number of generated itemsets grows exponentially with the total number of items. Most of the algorithms start identifying the sets with the lowest cardinality, and subsequently, increase it progressively. Our approach is different, since the sets to be considered at a time are determined by the items in the sets. The main advantage is a significant reduction of the CPU time required to update data structures in main memory. This paper presents an algorithm that requires only one pass on the database, presents linear scale-up property with the dimensions of the database and, as shown by the experiments, performs better than other classical algorithms.
1999
First International Conference on Data Warehouse and Knowledge Discovery
Firenze, Italy
SEPTEMBER
DataWarehousing and Knowledge Discovery
Springer
1676
-
193
202
9783540664581
http://portal.acm.org/toc.cfm?id=646108&type=proceeding&coll=GUIDE&dl=GUIDE&CFID=6552899&CFTOKEN=18484246
R. MEO
File in questo prodotto:
File Dimensione Formato  
DAWAK99.pdf

Open Access dal 02/03/2003

Tipo di file: PDF EDITORIALE
Dimensione 878.24 kB
Formato Adobe PDF
878.24 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/18593
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? ND
social impact