Background: Biological processes are based on complex networks of cells and molecules. Single cell multi‐omics is a new tool aiming to provide new incites in the complex network of events controlling the functionality of the cell. Methods: Since single cell technologies provide many sample measurements, they are the ideal environment for the application of Deep Learning and Machine Learning approaches. An autoencoder is composed of an encoder and a decoder sub-model. An autoencoder is a very powerful tool in data compression and noise removal. However, the decoder model remains a black box from which is impossible to depict the contribution of the single input elements. We have recently developed a new class of autoencoders, called Sparsely Connected Autoencoders (SCA), which have the advantage of providing a controlled association among the input layer and the decoder module. This new architecture has the benefit that the decoder model is not a black box anymore and can be used to depict new biologically interesting features from single cell data. Results: Here, we show that SCA hidden layer can grab new information usually hidden in single cell data, like providing clustering on meta‐features difficult, i.e. transcription factors expression, or not technically not possible, i.e. miRNA expression, to depict in single cell RNAseq data. Furthermore, SCA representation of cell clusters has the advantage of simulating a conventional bulk RNAseq, which is a data transformation allowing the identification of similarity among independent experiments. Conclusions: In our opinion, SCA represents the bioinformatics version of a universal “Swiss‐knife” for the extraction of hidden knowledgeable features from single cell omics data.

Sparsely connected autoencoders: A multi‐purpose tool for single cell omics analysis

Alessandri L.;Contaldo S. G.;Beccuti M.;Cordero F.;Arigoni M.;Calogero R. A.
2021-01-01

Abstract

Background: Biological processes are based on complex networks of cells and molecules. Single cell multi‐omics is a new tool aiming to provide new incites in the complex network of events controlling the functionality of the cell. Methods: Since single cell technologies provide many sample measurements, they are the ideal environment for the application of Deep Learning and Machine Learning approaches. An autoencoder is composed of an encoder and a decoder sub-model. An autoencoder is a very powerful tool in data compression and noise removal. However, the decoder model remains a black box from which is impossible to depict the contribution of the single input elements. We have recently developed a new class of autoencoders, called Sparsely Connected Autoencoders (SCA), which have the advantage of providing a controlled association among the input layer and the decoder module. This new architecture has the benefit that the decoder model is not a black box anymore and can be used to depict new biologically interesting features from single cell data. Results: Here, we show that SCA hidden layer can grab new information usually hidden in single cell data, like providing clustering on meta‐features difficult, i.e. transcription factors expression, or not technically not possible, i.e. miRNA expression, to depict in single cell RNAseq data. Furthermore, SCA representation of cell clusters has the advantage of simulating a conventional bulk RNAseq, which is a data transformation allowing the identification of similarity among independent experiments. Conclusions: In our opinion, SCA represents the bioinformatics version of a universal “Swiss‐knife” for the extraction of hidden knowledgeable features from single cell omics data.
2021
22
23
12755
12766
Gene regulatory network; MiRNA; Pseudo‐bulk data; Single cell ATACseq; Single cell RNAseq; Sparsely connected autoencoders; Transcription factor; Adenocarcinoma of Lung; Computational Biology; Humans; Lung Neoplasms; Single-Cell Analysis; Whole Exome Sequencing; Cluster Analysis; Machine Learning; Neural Networks, Computer
Alessandri L.; Ratto M.L.; Contaldo S.G.; Beccuti M.; Cordero F.; Arigoni M.; Calogero R.A.
File in questo prodotto:
File Dimensione Formato  
Sparsely Connected Autoencoders A Multi-Purpose Tool for Single Cell omics Analysis - ijms-22-12755.pdf

Accesso aperto

Tipo di file: PDF EDITORIALE
Dimensione 23.47 MB
Formato Adobe PDF
23.47 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1829644
Citazioni
  • ???jsp.display-item.citation.pmc??? 1
  • Scopus 11
  • ???jsp.display-item.citation.isi??? 5
social impact