Mouse E14 embryonic stem cells (ESCs) are a well-characterized and widespread used ESC line, often employed for genome-wide studies involving next generation sequencing analysis. More than 2×109 sequences made on Illumina platform derived from the genome of E14 ESCs were used to build a database of about 2.7×106 single nucleotide variants (SNVs). The identified variants are enriched in intergenic regions, but several thousands reside in gene exons and regulatory regions, such as promoters, enhancers, splicing sites and untranslated regions of RNA, thus indicating high probability of an important functional impact on the molecular biology of these cells. We created a new E14 genome assembly reference that increases the number of mapped reads of about 5%. We performed a Reduced Representation Bisulfite Sequencing on E14 ESCs and we obtained an increase of about 120,000 called CpGs and avoided about 20,000 wrong CpG calls with respect to the mm9 genome reference. © 2014 Elsevier Inc.

High-throughput single nucleotide variant discovery in E14 mouse embryonic stem cells provides a new reference genome assembly

Krepelova A.;Neri F.
2014-01-01

Abstract

Mouse E14 embryonic stem cells (ESCs) are a well-characterized and widespread used ESC line, often employed for genome-wide studies involving next generation sequencing analysis. More than 2×109 sequences made on Illumina platform derived from the genome of E14 ESCs were used to build a database of about 2.7×106 single nucleotide variants (SNVs). The identified variants are enriched in intergenic regions, but several thousands reside in gene exons and regulatory regions, such as promoters, enhancers, splicing sites and untranslated regions of RNA, thus indicating high probability of an important functional impact on the molecular biology of these cells. We created a new E14 genome assembly reference that increases the number of mapped reads of about 5%. We performed a Reduced Representation Bisulfite Sequencing on E14 ESCs and we obtained an increase of about 120,000 called CpGs and avoided about 20,000 wrong CpG calls with respect to the mm9 genome reference. © 2014 Elsevier Inc.
2014
104
2
121
127
E14; ESC; Genome reference; Genotyping; Sequencing; SNV; Animals; Cell Line; DNA, Intergenic; Databases, Factual; Embryonic Stem Cells; Genomics; High-Throughput Nucleotide Sequencing; Mice; Regulatory Sequences, Nucleic Acid; Reproducibility of Results; Sequence Analysis, DNA; Genome; Polymorphism, Single Nucleotide
Incarnato D.; Krepelova A.; Neri F.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1765454
Citazioni
  • ???jsp.display-item.citation.pmc??? 7
  • Scopus 10
  • ???jsp.display-item.citation.isi??? 10
social impact