ELOPER: Elongation of paired-end reads as a pre-processing tool for improved de novo genome assembly

David H. Silver, Shay Ben-Elazar, Alexei Bogoslavsky, Itai Yanai*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Motivation: Paired-end sequencing resulting in gapped short reads is commonly used for de novo genome assembly. Assembly methods use paired-end sequences in a two-step process, first treating each read-end independently, only later invoking the pairing to join the contiguous assemblies (contigs) into gapped scaffolds. Here, we present ELOPER, a pre-processing tool for pair-end sequences that produces a better read library for assembly programs.Results: ELOPER proceeds by simultaneously considering both ends of paired reads generating elongated reads. We show that ELOPER theoretically doubles read-lengths while halving the number of reads. We provide evidence that pre-processing read libraries using ELOPER leads to considerably improved assemblies as predicted from the Lander-Waterman model.

Original languageEnglish
Pages (from-to)1455-1457
Number of pages3
JournalBioinformatics
Volume29
Issue number11
DOIs
StatePublished - 1 Jun 2013
Externally publishedYes

Fingerprint

Dive into the research topics of 'ELOPER: Elongation of paired-end reads as a pre-processing tool for improved de novo genome assembly'. Together they form a unique fingerprint.

Cite this