Targeted Assembly of Short Sequence Reads

Warren, Rene; Holt, Robert

doi:10.1371/journal.pone.0019816

Resource type

Article

Date created

2011

Authors/Contributors

Author: Warren, Rene

Author: Holt, Robert

Abstract

As next-generation sequence (NGS) production continues to increase, analysis is becoming a significant bottleneck. However, in situations where information is required only for specific sequence variants, it is not necessary to assemble or align whole genome data sets in their entirety. Rather, NGS data sets can be mined for the presence of sequence variants of interest by localized assembly, which is a faster, easier, and more accurate approach. We present TASR, a streamlined assembler that interrogates very large NGS data sets for the presence of specific variants by only considering reads within the sequence space of input target sequences provided by the user. The NGS data set is searched for reads with an exact match to all possible short words within the target sequence, and these reads are then assembled stringently to generate a consensus of the target and flanking sequence. Typically, variants of a particular locus are provided as different target sequences, and the presence of the variant in the data set being interrogated is revealed by a successful assembly outcome. However, TASR can also be used to find unknown sequences that flank a given target. We demonstrate that TASR has utility in finding or confirming genomic mutations, polymorphisms, fusions and integration events. Targeted assembly is a powerful method for interrogating large data sets for the presence of sequence variants of interest. TASR is a fast, flexible and easy to use tool for targeted assembly.

Published as

Warren RL, Holt RA (2011) Targeted Assembly of Short Sequence Reads. PLoS ONE 6(5): e19816. doi:10.1371/journal.pone.0019816

Publication details

Publication title

PLoS ONE

Document title

Targeted Assembly of Short Sequence Reads

Date

2011

Volume

6

Issue

5

Publisher DOI

10.1371/journal.pone.0019816

Copyright statement

Copyright is held by the author(s).

Permissions

You are free to copy, distribute and transmit this work under the following conditions: You must give attribution to the work (but not in any way that suggests that the author endorses you or your use of the work); You may not use this work for commercial purposes.

Scholarly level

Faculty/Staff

Peer reviewed?

Yes

Language

English

Member of collection

Molecular Biology and Biochemistry

Download file	Size
journal.pone_.0019816.pdf	237.08 KB

Targeted Assembly of Short Sequence Reads

Views & downloads - as of June 2023