CINECA IRIS Institutional Research Information System

Although EVT has slowly evolved into a versatile and powerful tool for publishing historical sources and literary texts on the basis of the XML/TEI format, none of the modifications-and periodic rewrites-of its code base has significantly changed the way in which edition data is handled. Even today, in fact, the main goal is to visualise and navigate such data, no doubt in a sophisticated way (support for multiple edition levels, support for named entities management, text-image linking), but in any case limited to what we might call “core functionalities” of a DSE. Texts encoded in XML/TEI, on the other hand, are a potential treasure trove of information just waiting to be interrogated and made available to the user. In this article I identify three use cases – the processing of special characters encoded by means of the and elements; the management of named entities, realia and other interesting elements of the text; the use of ontologies within an XML/TEI document – related to literary texts in Old English in order to propose a TEI encoding and a subsequent processing able to put the user in a position to receive answers to complex, transversal queries involving cross-links between different types of elements.

Semi-structured data processing: implementation hypotheses and use cases taken from Old English texts

Rosselli Del Turco Roberto

2021-01-01

Abstract

Although EVT has slowly evolved into a versatile and powerful tool for publishing historical sources and literary texts on the basis of the XML/TEI format, none of the modifications-and periodic rewrites-of its code base has significantly changed the way in which edition data is handled. Even today, in fact, the main goal is to visualise and navigate such data, no doubt in a sophisticated way (support for multiple edition levels, support for named entities management, text-image linking), but in any case limited to what we might call “core functionalities” of a DSE. Texts encoded in XML/TEI, on the other hand, are a potential treasure trove of information just waiting to be interrogated and made available to the user. In this article I identify three use cases – the processing of special characters encoded by means of the and elements; the management of named entities, realia and other interesting elements of the text; the use of ontologies within an XML/TEI document – related to literary texts in Old English in order to propose a TEI encoding and a subsequent processing able to put the user in a position to receive answers to complex, transversal queries involving cross-links between different types of elements.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2021
			
	Titolo rivista
	
				UMANISTICA DIGITALE
			
	N. Volume
	
				10
			
	Fascicolo
	
				1
			
	Pagine (da)
	
				387
			
	Pagine (a)
	
				407
			
	DOI
	
				https://dx.doi.org/10.6092/issn.2532-8816/12598
			
	URL del prodotto (archivi open access, fulltext su sito editore, etc.)
	
				https://umanisticadigitale.unibo.it/article/view/12598
			
	Parole Chiave
	
				Filologia germanica, filologia digitale, edizioni digitali, data processing, XML/TEI, EVT, elaborazione dati, data mining, inglese antico, letteratura anglosassone
			
	Tutti gli autori
	
						Rosselli Del Turco Roberto
					
	Appare nelle tipologie:
	
				03A-Articolo su Rivista

File in questo prodotto:

File	Dimensione	Formato
2021 Rosselli Del Turco - Elaborazione di dati semi-strutturati, casi d'uso tratti da testi in inglese antico.pdf Accesso aperto Descrizione: Articolo completo Tipo di file: PDF EDITORIALE Dimensione 908.3 kB Formato Adobe PDF Visualizza/Apri	908.3 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1847142

Citazioni

ND

1

ND

social impact