Abstract
Background: Error Correction is an important task in the analysis and manipulations of NGS data. The purpose of error correction is to facilitate data analysis for large projects like de novo assembly project. Here we present a new hybrid algorithm for error correction of long reads using short reads. Our algorithm can be flexibly adapted to different types of errors. Next, we make a de novo assembly for corrected long reads.
Objective: We present MiRCA (MinIon Reads Correction Algorithm) a hybrid approach based on the sequences alignments that detects and corrects errors for MinIon long reads using Illumina short reads.
Methods: In our approach, we operate in four steps. First, we make a Quality Control and Cleaning data. Second, we use the contig forming for the Pre-Error Correction Step. Third, we use the alignment to align pre-assembled contig to long reads and we use this alignment to correction erroneous long reads. Finally, we do an assembly for the corrected long reads.
Results: The results of mapping of S.cerevisaeW303 and E.coli genomes shows that our error correction approach produce a high quality long reads with mapping rate ~99% to the reference genome in reasonable time. For denovo assembly, the corrected long reads gives good assembly in a short running time compared to other error correction tools.
Conclusion: MiRCA is a new hybrid approach that detects and corrects errors. It uses an alignmentbased approach using pre-assembled short reads as a reference to correct nanopore long reads. The experimental evaluation of the corrected long reads on the reference genome of S. cerevisae and E.coli shows that MiRCA ensures best error correction compared to existing related works.
Keywords: Error Correction, de novo assembly, long reads, short reads, minion, illumina, NGS, algorithm.
Graphical Abstract
Current Bioinformatics
Title:An Error Correction and DeNovo Assembly Approach for Nanopore Reads Using Short Reads
Volume: 13 Issue: 3
Author(s): Mehdi Kchouk and Mourad Elloumi*
Affiliation:
- Laboratory of Technologies of Information and Communication and Electrical Engineering National Superior School of Engineers of Tunis, University of Tunis,Tunisia
Keywords: Error Correction, de novo assembly, long reads, short reads, minion, illumina, NGS, algorithm.
Abstract: Background: Error Correction is an important task in the analysis and manipulations of NGS data. The purpose of error correction is to facilitate data analysis for large projects like de novo assembly project. Here we present a new hybrid algorithm for error correction of long reads using short reads. Our algorithm can be flexibly adapted to different types of errors. Next, we make a de novo assembly for corrected long reads.
Objective: We present MiRCA (MinIon Reads Correction Algorithm) a hybrid approach based on the sequences alignments that detects and corrects errors for MinIon long reads using Illumina short reads.
Methods: In our approach, we operate in four steps. First, we make a Quality Control and Cleaning data. Second, we use the contig forming for the Pre-Error Correction Step. Third, we use the alignment to align pre-assembled contig to long reads and we use this alignment to correction erroneous long reads. Finally, we do an assembly for the corrected long reads.
Results: The results of mapping of S.cerevisaeW303 and E.coli genomes shows that our error correction approach produce a high quality long reads with mapping rate ~99% to the reference genome in reasonable time. For denovo assembly, the corrected long reads gives good assembly in a short running time compared to other error correction tools.
Conclusion: MiRCA is a new hybrid approach that detects and corrects errors. It uses an alignmentbased approach using pre-assembled short reads as a reference to correct nanopore long reads. The experimental evaluation of the corrected long reads on the reference genome of S. cerevisae and E.coli shows that MiRCA ensures best error correction compared to existing related works.
Export Options
About this article
Cite this article as:
Kchouk Mehdi and Elloumi Mourad *, An Error Correction and DeNovo Assembly Approach for Nanopore Reads Using Short Reads, Current Bioinformatics 2018; 13 (3) . https://dx.doi.org/10.2174/1574893612666170530073736
DOI https://dx.doi.org/10.2174/1574893612666170530073736 |
Print ISSN 1574-8936 |
Publisher Name Bentham Science Publisher |
Online ISSN 2212-392X |

- Author Guidelines
- Bentham Author Support Services (BASS)
- Graphical Abstracts
- Fabricating and Stating False Information
- Research Misconduct
- Post Publication Discussions and Corrections
- Publishing Ethics and Rectitude
- Increase Visibility of Your Article
- Archiving Policies
- Peer Review Workflow
- Order Your Article Before Print
- Promote Your Article
- Manuscript Transfer Facility
- Editorial Policies
- Allegations from Whistleblowers