Comprehensive Evaluation of Error-Correction Methodologies for Genome Sequencing Data
Yun Heo 1, Gowthami Manikandan 1, Anand Ramachandran 1, Deming Chen 1
, editors.
In: Bioinformatics [Internet]. Brisbane (AU): Exon Publications; 2021 Mar 20. Chapter 6.
Affiliations expand
- PMID: 33877761
- Bookshelf ID: NBK569557
- DOI: 10.36255/exonpublications.bioinformatics.2021.ch6
Free Books & Documents
Excerpt
Next generation sequencing (NGS) technologies like Illumina and third generation sequencing (TGS) technologies like PacBio and Oxford Nanopore Technology use different techniques for sequencing and provide reads of different lengths and error profiles. Many tools exist for error correction of such sequencing data, improving the quality of downstream analyses. In this chapter, we evaluate the performance of 23 error-correction tools, providing insight into their strengths and weaknesses. This is accomplished through a set of algorithms we have developed and implemented as SPECTACLE, a Software Package for Error Correction Tool Assessment on nuCLEic acid sequences, and a dataset for NGS and TGS reads that we compiled emphasizing challenging scenarios for error correction tools. This chapter provides the reader an understanding of available tools, including advice on selecting appropriate tools for different circumstances. It also provides insights regarding aspects of sequencing data to be addressed to improve tool accuracy.
Copyright: The Authors.
Sections
基因组测序数据纠错方法的综合评估
- PMID: 33877761
- 书架编号: NBK569557
- DOI: 10.36255/exonpublications.bioinformatics.2021.ch6
摘抄
Illumina 等下一代测序 (NGS) 技术和 PacBio 和 Oxford Nanopore Technology 等第三代测序 (TGS) 技术使用不同的测序技术并提供不同长度和错误特征的读数。许多工具可用于纠正此类测序数据的错误,从而提高下游分析的质量。
在本章中,我们评估了 23 种纠错工具的性能,深入了解它们的优缺点。这是通过我们开发和实施的一组算法来实现的,即 SPECTACLE,一个用于对核酸序列进行纠错工具评估的软件包,以及一个用于 NGS 和 TGS 读数的数据集,我们编译了这些数据集,强调了纠错工具的挑战性场景。本章让读者了解可用工具,包括针对不同情况选择合适工具的建议。它还提供了有关要解决的测序数据方面的见解,以提高工具准确性。
版权所有:作者。