The coverage of the semantic gap in video indexing and retrieval has gone through a continuous increase of the vocabulary of high-level features or semantic descriptors, sometimes organized in light-scale, corpus-specific, computational ontologies. This paper presents a computer-supported manual annotation method that relies on a very large scale, shared, commonsense ontologies for the selection of semantic descriptors. The ontological terms are accessed through a linguistic interface that relies on multi-lingual dictionaries and action/event template structures (or frames). The manual generation or check of annotations provides ground truth data for evaluation purposes and training data for knowledge acquisition. The novelty of the approach relies on the use of widely shared large-scale ontologies, that prevent arbitrariness of annotation and favor interoperability. We test the viability of the approach by carrying out some user studies on the annotation of narrative videos.

Commonsense knowledge for the collection of ground truth data on semantic descriptors

LOMBARDO, Vincenzo;DAMIANO, Rossana
2012-01-01

Abstract

The coverage of the semantic gap in video indexing and retrieval has gone through a continuous increase of the vocabulary of high-level features or semantic descriptors, sometimes organized in light-scale, corpus-specific, computational ontologies. This paper presents a computer-supported manual annotation method that relies on a very large scale, shared, commonsense ontologies for the selection of semantic descriptors. The ontological terms are accessed through a linguistic interface that relies on multi-lingual dictionaries and action/event template structures (or frames). The manual generation or check of annotations provides ground truth data for evaluation purposes and training data for knowledge acquisition. The novelty of the approach relies on the use of widely shared large-scale ontologies, that prevent arbitrariness of annotation and favor interoperability. We test the viability of the approach by carrying out some user studies on the annotation of narrative videos.
14th IEEE International Symposium on Multimedia, ISM 2012
Irvine, CA, usa
2012
Proceedings - 2012 IEEE International Symposium on Multimedia, ISM 2012
IEEE
78
83
9780769548753
9780769548753
http://ieeexplore.ieee.org/document/6424635/
Concept ontology; Linguistic interface; Video annotation; Computer Graphics and Computer-Aided Design; 1707; Software
Lombardo, Vincenzo; Damiano, Rossana
File in questo prodotto:
File Dimensione Formato  
ISM12_4aperto.pdf

Accesso aperto

Tipo di file: POSTPRINT (VERSIONE FINALE DELL’AUTORE)
Dimensione 742.87 kB
Formato Adobe PDF
742.87 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1619171
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 9
  • ???jsp.display-item.citation.isi??? 7
social impact