: Three-dimensional nuclear DNA architecture comprises well-studied intra-chromosomal (cis) folding and less characterized inter-chromosomal (trans) interfaces. Current predictive models of 3D genome folding overlook trans-genome organization. We present TwinC, an interpretable convolutional neural network model that reliably predicts trans contacts measurable through proximity ligation-dependent (in situ and intact Hi-C) and independent (DNA SPRITE) genome-wide chromatin conformation assays. TwinC achieves high predictive accuracy (AUROC=0.80) on a cross-chromosomal test set from in situ and intact Hi-C experiments in heart tissue. Furthermore, we train TwinC using in situ Hi-C data from the widely used GM12878 cell line and validate its performance with orthogonal DNA SPRITE assay in the same cell type. Mechanistically, the neural network learns the importance of compartments, chromatin accessibility, clustered transcription factor binding, and G-quadruplexes in forming trans contacts. In summary, TwinC models and interprets trans genome architecture, illuminating this poorly understood aspect of gene regulation.

Prediction and functional interpretation of inter-chromosomal genome architecture from DNA sequence with TwinC

Bertero, Alessandro
;
2026-01-01

Abstract

: Three-dimensional nuclear DNA architecture comprises well-studied intra-chromosomal (cis) folding and less characterized inter-chromosomal (trans) interfaces. Current predictive models of 3D genome folding overlook trans-genome organization. We present TwinC, an interpretable convolutional neural network model that reliably predicts trans contacts measurable through proximity ligation-dependent (in situ and intact Hi-C) and independent (DNA SPRITE) genome-wide chromatin conformation assays. TwinC achieves high predictive accuracy (AUROC=0.80) on a cross-chromosomal test set from in situ and intact Hi-C experiments in heart tissue. Furthermore, we train TwinC using in situ Hi-C data from the widely used GM12878 cell line and validate its performance with orthogonal DNA SPRITE assay in the same cell type. Mechanistically, the neural network learns the importance of compartments, chromatin accessibility, clustered transcription factor binding, and G-quadruplexes in forming trans contacts. In summary, TwinC models and interprets trans genome architecture, illuminating this poorly understood aspect of gene regulation.
2026
1
29
Jha, Anupama; Hristov, Borislav; Wang, Xiao; Wang, Sheng; Greenleaf, William J.; Kundaje, Anshul; Aiden, Erez Lieberman; Bertero, Alessandro; Noble, W...espandi
File in questo prodotto:
File Dimensione Formato  
s41467-026-72031-5_reference.pdf

Accesso aperto

Descrizione: Unedited final manuscript from publisher site
Tipo di file: POSTPRINT (VERSIONE FINALE DELL’AUTORE)
Dimensione 9.19 MB
Formato Adobe PDF
9.19 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/2136090
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact