國家衛生研究院 NHRI:Item 3990099045/9480

English | 正體中文 | 简体中文 | Items with full text/Total items : 12145/12927 (94%)
Visitors : 855379 Online Users : 1063

RC Version 6.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.

Scope

please add "double quotation mark" for query phrases to get precise results

please goto advance search for comprehansive author search

Adv. Search

Home ‧ Login ‧ Upload ‧ Help ‧ About ‧ Administer

Goto mobile version

國家衛生研究院 NHRI > 群體健康科學研究所 > 廖玉潔 > 期刊論文 > Item 3990099045/9480

Please use this identifier to cite or link to this item: http://ir.nhri.org.tw/handle/3990099045/9480

Title:	Evaluation and validation of assembling corrected PacBio long reads for microbial genome completion via hybrid approaches
Authors:	Lin, HH;Liao, YC
Contributors:	Division of Biostatistics and Bioinformatics
Abstract:	Despite the ever-increasing output of next-generation sequencing data along with developing assemblers, dozens to hundreds of gaps still exist in de novo microbial assemblies due to uneven coverage and large genomic repeats. Third-generation single-molecule, real-time (SMRT) sequencing technology avoids amplification artifacts and generates kilobase-long reads with the potential to complete microbial genome assembly. However, due to the low accuracy (~85%) of third-generation sequences, a considerable amount of long reads (>50X) are required for self-correction and for subsequent de novo assembly. Recently-developed hybrid approaches, using next-generation sequencing data and as few as 5X long reads, have been proposed to improve the completeness of microbial assembly. In this study we have evaluated the contemporary hybrid approaches and demonstrated that assembling corrected long reads (by runCA) produced the best assembly compared to long-read scaffolding (e.g., AHA, Cerulean and SSPACE-LongRead) and gap-filling (SPAdes). For generating corrected long reads, we further examined long-read correction tools, such as ECTools, LSC, LoRDEC, PBcR pipeline and proovread. We have demonstrated that three microbial genomes including Escherichia coli K12 MG1655, Meiothermus ruber DSM1279 and Pdeobacter heparinus DSM2366 were successfully hybrid assembled by runCA into near-perfect assemblies using ECTools-corrected long reads. In addition, we developed a tool, Patch, which implements corrected long reads and pre-assembled contigs as inputs, to enhance microbial genome assemblies. With the additional 20X long reads, short reads of S. cerevisiae W303 were hybrid assembled into 115 contigs using the verified strategy, ECTools + runCA. Patch was subsequently applied to upgrade the assembly to a 35-contig draft genome. Our evaluation of the hybrid approaches shows that assembling the ECTools-corrected long reads via runCA generates near complete microbial genomes, suggesting that genome assembly could benefit from re-analyzing the available hybrid datasets that were not assembled in an optimal fashion.
Date:	2015-12-07
Relation:	PLoS ONE. 2015 Dec 7;10(12):Article number e0144305.
Link to:	http://dx.doi.org/10.1371/journal.pone.0144305
JIF/Ranking 2023:	http://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=NHRI&SrcApp=NHRI_IR&KeyISSN=1932-6203&DestApp=IC2JCR
Cited Times(WOS):	https://www.webofscience.com/wos/woscc/full-record/WOS:000366902700095
Cited Times(Scopus):	http://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=84955502827
Appears in Collections:	[廖玉潔] 期刊論文

Files in This Item:

File	Description	Size	Format
PUB26641475.pdf		380Kb	Adobe PDF	500	View/Open

All items in NHRI are protected by copyright, with all rights reserved.

Related Items in TAIR

DSpace Software Copyright © 2002-2004 MIT & Hewlett-Packard / Enhanced by NTU Library IR team Copyright © - Feedback