Factorization machines (FMs) are a class of general predictors for sparse data. One major benefit of FMs is their ability to capture the interactions across features when making recommendations. In this paper, we note that the interactions captured by existing FMs generally represent correlations in the data and we argue that such correlations, unless informed by the true causality structures underlying the data, may be spurious and may result in unwanted bias. To tackle this challenge, we propose a Causally-Informed Factorization Machine (CIFM) model that introduces a novel causal injection mechanism. CIFM leverages a priori causal knowledge, described in the form of a causal graph, to boost the representational ability of FMs and achieve better predictions. Specifically, given a (potentially learned) causal graph which describes the causal relationships among features, CIFM distills this structural information into a pairwise causal impact matrix and guides the learning process to ensure that the learned representations capture those relationships that are consistent with the causal relationships. Extensive evaluations of CIFM, along with its integrations with NeuralFM and DeepFM, conducted with synthetic and real-world data sets, demonstrate the effectiveness of causal injection in generating better recommendations.

Causally Informed Factorization Machines

Candan K. S.;Sapino M. L.
2024-01-01

Abstract

Factorization machines (FMs) are a class of general predictors for sparse data. One major benefit of FMs is their ability to capture the interactions across features when making recommendations. In this paper, we note that the interactions captured by existing FMs generally represent correlations in the data and we argue that such correlations, unless informed by the true causality structures underlying the data, may be spurious and may result in unwanted bias. To tackle this challenge, we propose a Causally-Informed Factorization Machine (CIFM) model that introduces a novel causal injection mechanism. CIFM leverages a priori causal knowledge, described in the form of a causal graph, to boost the representational ability of FMs and achieve better predictions. Specifically, given a (potentially learned) causal graph which describes the causal relationships among features, CIFM distills this structural information into a pairwise causal impact matrix and guides the learning process to ensure that the learned representations capture those relationships that are consistent with the causal relationships. Extensive evaluations of CIFM, along with its integrations with NeuralFM and DeepFM, conducted with synthetic and real-world data sets, demonstrate the effectiveness of causal injection in generating better recommendations.
2024
2024 IEEE International Conference on Big Data, BigData 2024
Washington DC
15-18 dicembre 2024
Proceedings of the 2024 IEEE International Conference on Big Data, BigData 2024
IEEE
448
455
https://www.scopus.com/record/display.uri?eid=2-s2.0-85218007178&origin=recordpage
Causality; Factorization Machines
Li M.-L.; Candan K.S.; Sapino M.L.
File in questo prodotto:
File Dimensione Formato  
Causally Informed Factorization Machines.pdf

Accesso riservato

Dimensione 684.01 kB
Formato Adobe PDF
684.01 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/2077358
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
social impact