There is a growing attention, in the research communities of political economics, onto the potential of text analytics in classifying documents with economic content. This interest extends the data analytics approach that has been the traditional base for economic theory with scientific perspective. To devise a general method for prediction applicability, we identify some phases of a methodology and perform tests on a large well-structured repository of resource contracts containing documents related to resources. The majority of these contracts involve mining resources. In this paper we prove that, by the usage of text analytics measures, we can cluster these documents on three indicators: fairness of the contract content, transparency of the document themselves, and applicability of the clauses of the contract intended to guarantee execution on an international basis. We achieve these results, consistent with a gold-standard test obtained with human experts, using text similarity based on the basic notions of bag of words, the index tf-idf, and three distinct cut-off measures.

Text Analytics Can Predict Contract Fairness, Transparency and Applicability

Baronchelli, Adelaide;
2021-01-01

Abstract

There is a growing attention, in the research communities of political economics, onto the potential of text analytics in classifying documents with economic content. This interest extends the data analytics approach that has been the traditional base for economic theory with scientific perspective. To devise a general method for prediction applicability, we identify some phases of a methodology and perform tests on a large well-structured repository of resource contracts containing documents related to resources. The majority of these contracts involve mining resources. In this paper we prove that, by the usage of text analytics measures, we can cluster these documents on three indicators: fairness of the contract content, transparency of the document themselves, and applicability of the clauses of the contract intended to guarantee execution on an international basis. We achieve these results, consistent with a gold-standard test obtained with human experts, using text similarity based on the basic notions of bag of words, the index tf-idf, and three distinct cut-off measures.
2021
17th International Conference on Web Information Systems and Technologies
On-line streaming
26-27 Ottobre
Proceedings of the 17th International Conference on Web Information Systems and Technologies
SCITEPRESS
316
323
978-989-758-536-4
http://hdl.handle.net/10072/412940
Text Analytics; Web Analytics; Web Repository; Economic Analysis
Assolini, Nicola; Baronchelli, Adelaide; Cristani, Matteo; Pasetto, Luca; Olivieri, Francesco; Ricciuti, Roberto; Tomazzoli, Claudio
File in questo prodotto:
File Dimensione Formato  
Baronchelli_webist_2025.pdf

Accesso aperto

Tipo di file: PDF EDITORIALE
Dimensione 658.66 kB
Formato Adobe PDF
658.66 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/2102110
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? 0
social impact