Mouse E14 embryonic stem cells (ESCs) are a well-characterized and widespread used ESC line, often employed for genome-wide studies involving next generation sequencing analysis. More than 2×109 sequences made on Illumina platform derived from the genome of E14 ESCs were used to build a database of about 2.7×106 single nucleotide variants (SNVs). The identified variants are enriched in intergenic regions, but several thousands reside in gene exons and regulatory regions, such as promoters, enhancers, splicing sites and untranslated regions of RNA, thus indicating high probability of an important functional impact on the molecular biology of these cells. We created a new E14 genome assembly reference that increases the number of mapped reads of about 5%. We performed a Reduced Representation Bisulfite Sequencing on E14 ESCs and we obtained an increase of about 120,000 called CpGs and avoided about 20,000 wrong CpG calls with respect to the mm9 genome reference. © 2014 Elsevier Inc.
High-throughput single nucleotide variant discovery in E14 mouse embryonic stem cells provides a new reference genome assembly
Krepelova A.;Neri F.
2014-01-01
Abstract
Mouse E14 embryonic stem cells (ESCs) are a well-characterized and widespread used ESC line, often employed for genome-wide studies involving next generation sequencing analysis. More than 2×109 sequences made on Illumina platform derived from the genome of E14 ESCs were used to build a database of about 2.7×106 single nucleotide variants (SNVs). The identified variants are enriched in intergenic regions, but several thousands reside in gene exons and regulatory regions, such as promoters, enhancers, splicing sites and untranslated regions of RNA, thus indicating high probability of an important functional impact on the molecular biology of these cells. We created a new E14 genome assembly reference that increases the number of mapped reads of about 5%. We performed a Reduced Representation Bisulfite Sequencing on E14 ESCs and we obtained an increase of about 120,000 called CpGs and avoided about 20,000 wrong CpG calls with respect to the mm9 genome reference. © 2014 Elsevier Inc.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.