This paper presents the integration of CompL-it, a Linked Open Data (LOD) computational lexicon for contemporary Italian, into LiITA (Linking Italian), a Knowledge Base (KB) designed for linguistic interoperability. CompL-it contains over 101k lexical entries enriched with detailed morphological and semantic information, derived from multiple authoritative sources and modelled using the OntoLex-Lemon vocabulary. The linking process involved aligning lexical entries with lemmas in the LiITA’s Lemma Bank (LB), addressing both exact and ambiguous matches through systematic and semantically informed strategies. Moreover, 12,739 new lemmas were added to the LiITA LB. This integration enhances the expressiveness and interoperability of LiITA, enabling complex SPARQL queries that exploit the semantic network encoded in CompL-it. Examples are provided to demonstrate the advantages of querying interlinked resources.

Linking CompL-it to the LiITA Knowledge Base

Valerio Basile;Andrea Di Fabio;Eliana Di Palma;
2025-01-01

Abstract

This paper presents the integration of CompL-it, a Linked Open Data (LOD) computational lexicon for contemporary Italian, into LiITA (Linking Italian), a Knowledge Base (KB) designed for linguistic interoperability. CompL-it contains over 101k lexical entries enriched with detailed morphological and semantic information, derived from multiple authoritative sources and modelled using the OntoLex-Lemon vocabulary. The linking process involved aligning lexical entries with lemmas in the LiITA’s Lemma Bank (LB), addressing both exact and ambiguous matches through systematic and semantically informed strategies. Moreover, 12,739 new lemmas were added to the LiITA LB. This integration enhances the expressiveness and interoperability of LiITA, enabling complex SPARQL queries that exploit the semantic network encoded in CompL-it. Examples are provided to demonstrate the advantages of querying interlinked resources.
2025
Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025)
Cagliari
24 - 26/09/2026
Proceedings of the Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025)
CEUR Workshop Proceedings
595
602
979-12-243-0587-3
https://aclanthology.org/2025.clicit-1.57/
Eleonora Litta, Marco Passarotti, Giovanni Moretti, Paolo Brasolin, Francesco Mambrini, Valerio Basile, Andrea Di Fabio, Eliana Di Palma, Emiliano Gio...espandi
File in questo prodotto:
File Dimensione Formato  
2025.clicit-1.57.pdf

Accesso aperto

Tipo di file: PDF EDITORIALE
Dimensione 1.2 MB
Formato Adobe PDF
1.2 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/2121955
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact