Background Deep neural networks (DNNs) are promising for analyzing high-dimensional transcriptomic data in cancer research but are limited by data scarcity and heterogeneity. Transfer learning (TL), which leverages large datasets to improve performance on smaller ones, has been widely applied in classification and clustering, yet its use in survival prediction remains underexplored. Methods We evaluated transfer learning for disease-free survival prediction using RNA-seq data from 7509 patients across 27 tumor types from The Cancer Genome Atlas. A pan-cancer DNN was optimized via random search across 750 configurations. Tumor-specific baseline models were trained independently, while transfer learning models were built with a leave-one-out pre-training strategy followed by fine-tuning on the target tumor. Performance was assessed with the concordance index. Biological relevance was evaluated through a LIME-like interpretability approach and gene-set enrichment analysis. Results Transfer learning outperformed tumor-specific baselines in 24 of 27 tumor types, with the greatest improvements in smaller cohorts. Fine-tuning consistently enhanced predictions by adapting pan-cancer features to tumor-specific data. Enrichment analysis confirmed that the models captured both general oncogenic mechanisms and tumor-specific pathways. Conclusions Transfer learning improves survival prediction from transcriptomic data, especially in small cohorts, by exploiting molecular patterns shared across cancers. These findings support transfer learning as a robust strategy for survival modeling and a step toward foundation models in molecular oncology.
Deep learning with limited data: a transfer learning approach for transcriptomic survival prediction
Lombardi, A. M.;
2026-01-01
Abstract
Background Deep neural networks (DNNs) are promising for analyzing high-dimensional transcriptomic data in cancer research but are limited by data scarcity and heterogeneity. Transfer learning (TL), which leverages large datasets to improve performance on smaller ones, has been widely applied in classification and clustering, yet its use in survival prediction remains underexplored. Methods We evaluated transfer learning for disease-free survival prediction using RNA-seq data from 7509 patients across 27 tumor types from The Cancer Genome Atlas. A pan-cancer DNN was optimized via random search across 750 configurations. Tumor-specific baseline models were trained independently, while transfer learning models were built with a leave-one-out pre-training strategy followed by fine-tuning on the target tumor. Performance was assessed with the concordance index. Biological relevance was evaluated through a LIME-like interpretability approach and gene-set enrichment analysis. Results Transfer learning outperformed tumor-specific baselines in 24 of 27 tumor types, with the greatest improvements in smaller cohorts. Fine-tuning consistently enhanced predictions by adapting pan-cancer features to tumor-specific data. Enrichment analysis confirmed that the models captured both general oncogenic mechanisms and tumor-specific pathways. Conclusions Transfer learning improves survival prediction from transcriptomic data, especially in small cohorts, by exploiting molecular patterns shared across cancers. These findings support transfer learning as a robust strategy for survival modeling and a step toward foundation models in molecular oncology.| File | Dimensione | Formato | |
|---|---|---|---|
|
1-s2.0-S0010482526001812-main (1).pdf
Accesso aperto
Tipo di file:
PDF EDITORIALE
Dimensione
6.6 MB
Formato
Adobe PDF
|
6.6 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.



