Public procurement procedures are often characterised by complex workflows, which can pose challenges in terms of transparency, compliance, and operational efficiency. Automated process analysis is particularly important for understanding the complexity of temporal events, as it facilitates the understanding of the contents of procurement procedures by citizens and helps to increase trust in public authorities. This study explores process mining techniques applied to legal data with the aim of understanding how these procedures take place. By reconstructing event logs from the official dataset, we analyse the actual sequence of activities involved in the tendering process. First, we demonstrate how Large Language Models can be leveraged to extract relevant information from legal texts, thus enriching the event log with additional case-specific details, including dates and financial data. Second, we introduce a prediction task that estimates the overall duration of a tender process based on its initial activities. Through this combined approach, we investigate whether integrating textual insights enhances the accuracy of predictive models. The analysis is grounded in a real-world dataset. It offers evidence of how the integration of Natural Language Processing and process-oriented methods can support informed decision-making.

Legal process mining for the analysis of public procurement workflows

Nai, Roberto;Sulis, Emilio;Audrito, Davide;Trifiletti, Vittoria Margherita Sofia;Di Caro, Luigi;Genga, Laura
2026-01-01

Abstract

Public procurement procedures are often characterised by complex workflows, which can pose challenges in terms of transparency, compliance, and operational efficiency. Automated process analysis is particularly important for understanding the complexity of temporal events, as it facilitates the understanding of the contents of procurement procedures by citizens and helps to increase trust in public authorities. This study explores process mining techniques applied to legal data with the aim of understanding how these procedures take place. By reconstructing event logs from the official dataset, we analyse the actual sequence of activities involved in the tendering process. First, we demonstrate how Large Language Models can be leveraged to extract relevant information from legal texts, thus enriching the event log with additional case-specific details, including dates and financial data. Second, we introduce a prediction task that estimates the overall duration of a tender process based on its initial activities. Through this combined approach, we investigate whether integrating textual insights enhances the accuracy of predictive models. The analysis is grounded in a real-world dataset. It offers evidence of how the integration of Natural Language Processing and process-oriented methods can support informed decision-making.
2026
61
1
13
Event log enrichment; Legal informatics; Predictive process monitoring; Process mining; Public procurement
Nai, Roberto; Sulis, Emilio; Audrito, Davide; Trifiletti, Vittoria Margherita Sofia; Di Caro, Luigi; Genga, Laura
File in questo prodotto:
File Dimensione Formato  
1-s2.0-S2212473X26000246-main.pdf

Accesso aperto

Tipo di file: PDF EDITORIALE
Dimensione 2.55 MB
Formato Adobe PDF
2.55 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/2130852
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact