This work is part of two ongoing projects whose main goal is to demonstrate how semantic technologies can support an effective access to historical archives. In this paper we present a full pipeline, from rough texts up to the final user interface, aimed at creating and exploiting such representations. The pipeline is structured in three modules - handling information extraction, semantic representations, and queries - and offers external applications the possibility of accessing, and thus re-using, the output of each module, by providing a tagged text, a SPARQL endpoint, and a RESTful web service. In the paper, we describe the details of a proof-of-concept implementation of the pipeline architecture that focuses on time expressions. Moreover, we present an example application that exploits the pipeline to enable users to access historical documents by searching and browsing events and time specifications, thus demonstrating the effectiveness of an access to historical texts based on a rich semantic representation of their content.

A Pipeline Supporting a Smart Access to Historical Documents based on a Rich Semantic Representation of Their Content: A Case Study on Time Expressions

Anna Goy;Diego Magro
2018-01-01

Abstract

This work is part of two ongoing projects whose main goal is to demonstrate how semantic technologies can support an effective access to historical archives. In this paper we present a full pipeline, from rough texts up to the final user interface, aimed at creating and exploiting such representations. The pipeline is structured in three modules - handling information extraction, semantic representations, and queries - and offers external applications the possibility of accessing, and thus re-using, the output of each module, by providing a tagged text, a SPARQL endpoint, and a RESTful web service. In the paper, we describe the details of a proof-of-concept implementation of the pipeline architecture that focuses on time expressions. Moreover, we present an example application that exploits the pipeline to enable users to access historical documents by searching and browsing events and time specifications, thus demonstrating the effectiveness of an access to historical texts based on a rich semantic representation of their content.
2018
14th International Conference on Web Information Systems and Technologies (WEBIST)
Siviglia (Spagna)
18-20 settembre 2018
Proceedings of the 14th International Conference on Web Information Systems and Technologies
SciTePress - Science and Technology Publications
1
199
206
978-989-758-324-7
http://www.scitepress.org/PublicationsDetail.aspx?ID=t5Hj+mfcW2Q=&t=1
Semantic Web, Web-based Intelligent Systems, Ontology, Web Services, Digital Humanities
Alessandro Baldo, Anna Goy, Diego Magro
File in questo prodotto:
File Dimensione Formato  
webist2018_paper_from_proceedings.pdf

Accesso riservato

Descrizione: articolo
Tipo di file: PDF EDITORIALE
Dimensione 388.02 kB
Formato Adobe PDF
388.02 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
WEBIST2018_xIRIS_PostPrint.pdf

Accesso aperto

Tipo di file: POSTPRINT (VERSIONE FINALE DELL’AUTORE)
Dimensione 266.53 kB
Formato Adobe PDF
266.53 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1686035
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
social impact