CINECA IRIS Institutional Research Information System

The popularity of social bookmarking sites has made them prime targets for spammers. Many of these systems require an administrator's time and energy to manually filter or remove spam. Here we discuss the motivations of social spam, and present a study of automatic detection of spammers in a social tagging system. We identify and analyze six distinct features that address various properties of social spam, finding that each of these features provides for a helpful signal to discriminate spammers from legitimate users. These features are then used in various machine learning algorithms for classification, achieving over 98% accuracy in detecting social spammers with 2% false positives. These promising results provide a new baseline for future efforts on social spam. We make our dataset publicly available to the research community.

Social Spam Detection

B. Markines;CATTUTO C;F. Menczer

2009-01-01

Abstract

The popularity of social bookmarking sites has made them prime targets for spammers. Many of these systems require an administrator's time and energy to manually filter or remove spam. Here we discuss the motivations of social spam, and present a study of automatic detection of spammers in a social tagging system. We identify and analyze six distinct features that address various properties of social spam, finding that each of these features provides for a helpful signal to discriminate spammers from legitimate users. These features are then used in various machine learning algorithms for classification, achieving over 98% accuracy in detecting social spammers with 2% false positives. These promising results provide a new baseline for future efforts on social spam. We make our dataset publicly available to the research community.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2009
			
	Titolo dell'evento
	
				AIRWEB 2009
			
	Luogo dell'evento
	
				Madrid, Spain
			
	Data dell'evento
	
				April 21 - 21, 2009
			
	Titolo del volume
	
				AIRWeb '09 Proceedings of the 5th International Workshop on Adversarial Information Retrieval on the Web
			
	Nome editore
	
				ACM
			
	Pagine (da)
	
				41
			
	Pagine (a)
	
				48
			
	DOI
	
				https://dx.doi.org/10.1145/1531914.1531924
			
	URL del prodotto (archivi open access, fulltext su sito editore, etc.)
	
				https://dl.acm.org/citation.cfm?id=1531924
			
	Tutti gli autori
	
						B. Markines; CATTUTO C; F. Menczer
					
	Appare nelle tipologie:
	
				04A-Conference paper in volume

File in questo prodotto:

File	Dimensione	Formato
p41-markines.pdf Accesso riservato Dimensione 1.1 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.1 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1730882

Citazioni

ND

106

ND

social impact