The problem of Haplotype Assembly is an essential step in human genome analysis. It is typically formalised as the Minimum Error Correction (MEC) problem which is NP-hard. MEC has been approached using heuristics, integer linear programming, and fixed-parameter tractability (FPT), including approaches whose runtime is exponential in the length of the DNA fragments obtained by the sequencing process. Technological improvements are currently increasing fragment length, which drastically elevates computational costs for such methods. We present pWhatsHap, a multi-core parallelisation of WhatsHap, a recent FPT optimal approach to MEC. WhatsHap moves complexity from fragment length to fragment overlap and is hence of particular interest when considering sequencing technology's current trends. pWhatsHap further improves the efficiency in solving the MEC problem, as shown by experiments performed on datasets with high coverage.

High-Performance Haplotype Assembly

ALDINUCCI, MARCO;Andrea Bracciali;
2015-01-01

Abstract

The problem of Haplotype Assembly is an essential step in human genome analysis. It is typically formalised as the Minimum Error Correction (MEC) problem which is NP-hard. MEC has been approached using heuristics, integer linear programming, and fixed-parameter tractability (FPT), including approaches whose runtime is exponential in the length of the DNA fragments obtained by the sequencing process. Technological improvements are currently increasing fragment length, which drastically elevates computational costs for such methods. We present pWhatsHap, a multi-core parallelisation of WhatsHap, a recent FPT optimal approach to MEC. WhatsHap moves complexity from fragment length to fragment overlap and is hence of particular interest when considering sequencing technology's current trends. pWhatsHap further improves the efficiency in solving the MEC problem, as shown by experiments performed on datasets with high coverage.
2015
Intl. meeting on Computational Intelligence methods for Bioinformatics and Biostatistics (CIBB 2014)
Cambridge, UK
June 26-28 2014
LNBI post conference proceedings of the Eleventh International Meeting on Computational Intelligence Methods for Bioinformatics and Biostatistics CIBB 2014
Springer-Verlag
1
13
Marco Aldinucci; Andrea Bracciali; Tobias Marschall; Murray Patterson; Nadia Pisanti; Massimo Torquati
File in questo prodotto:
File Dimensione Formato  
2014_pHaplo_cibb.pdf

Accesso aperto

Tipo di file: POSTPRINT (VERSIONE FINALE DELL’AUTORE)
Dimensione 854.58 kB
Formato Adobe PDF
854.58 kB Adobe PDF Visualizza/Apri
2015_CIBB_haplotype.pdf

Accesso riservato

Tipo di file: PDF EDITORIALE
Dimensione 574.72 kB
Formato Adobe PDF
574.72 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1523292
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? 1
social impact