Abstract
Genome-to-genome comparisons require designating anchor points, which are given by Maximum Exact Matches (MEMs) between their sequences. For large genomes this is a challenging problem and the performance of existing solutions, even in parallel regimes, is not quite satisfactory. Results: We present a new algorithm, copMEM, that allows to sparsely sample both input genomes, with sampling steps being coprime. Despite being a single-threaded implementation, copMEM computes all MEMs of minimum length 100 between the human and mouse genomes in less than 2 minutes, using 7 GB of RAM memory.
Cite
CITATION STYLE
Grabowski, S., & Bieniecki, W. (2019). CopMEM: Finding maximal exact matches via sampling both genomes. Bioinformatics, 35(4), 677–678. https://doi.org/10.1093/bioinformatics/bty670
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.