CINECA IRIS Institutional Research Information System

Genetic Programming (GP) has the potential to generate intrinsically explainable models. Despite that, in practice, this potential is not fully achieved because the solutions usually grow too much during the evolution. The excessive growth together with the functional and structural complexity of the solutions increase the computational cost and the risk of overfitting. Thus, many approaches have been developed to prevent the solutions to grow excessively in GP. However, it is still an open question how these approaches can be used for improving the interpretability of the models. This article presents an empirical study of eight structural complexity metrics that have been used as evaluation criteria in multi-objective optimisation. Tree depth, size, visitation length, number of unique features, a proxy for human interpretability, number of operators, number of non-linear operators and number of consecutive nonlinear operators were tested. The results show that potentially the best approach for generating good interpretable GP models is to use the combination of more than one structural complexity metric.

A Comparison of Structural Complexity Metrics for Explainable Genetic Programming

Karina Brotto Rebuli;Mario Giacobini;Sara Silva;Leonardo Vanneschi

2023-01-01

Abstract

Genetic Programming (GP) has the potential to generate intrinsically explainable models. Despite that, in practice, this potential is not fully achieved because the solutions usually grow too much during the evolution. The excessive growth together with the functional and structural complexity of the solutions increase the computational cost and the risk of overfitting. Thus, many approaches have been developed to prevent the solutions to grow excessively in GP. However, it is still an open question how these approaches can be used for improving the interpretability of the models. This article presents an empirical study of eight structural complexity metrics that have been used as evaluation criteria in multi-objective optimisation. Tree depth, size, visitation length, number of unique features, a proxy for human interpretability, number of operators, number of non-linear operators and number of consecutive nonlinear operators were tested. The results show that potentially the best approach for generating good interpretable GP models is to use the combination of more than one structural complexity metric.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2023
			
	Titolo dell'evento
	
				Genetic and Evolutionary Computation Conference
			
	Luogo dell'evento
	
				Lisbon, Portugal
			
	Data dell'evento
	
				15/7/2023
			
	Titolo del volume
	
				PROCEEDINGS OF THE 2023 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2023 COMPANION
			
	Nome editore
	
				ASSOC COMPUTING MACHINERY
			
	Pagine (da)
	
				539
			
	Pagine (a)
	
				542
			
	DOI
	
				https://dx.doi.org/10.1145/3583133.3590595
			
	URL del prodotto (archivi open access, fulltext su sito editore, etc.)
	
				https://dl.acm.org/doi/abs/10.1145/3583133.3590595
			
	Parole Chiave
	
				explainable AI, interpretable models, complexity metrics
			
	Tutti gli autori
	
						Karina Brotto Rebuli, Mario Giacobini, Sara Silva, Leonardo Vanneschi
					
	Appare nelle tipologie:
	
				04A-Conference paper in volume

File in questo prodotto:

File	Dimensione	Formato
3583133.3590595.pdf Accesso aperto Descrizione: Rebuli_et_al_2023 Tipo di file: PDF EDITORIALE Dimensione 489.98 kB Formato Adobe PDF Visualizza/Apri	489.98 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1947375

Citazioni

ND

3

0

social impact