Missing regions in short-read assemblies of prokaryote genomes are often attributed to biases in sequencing technologies and to repetitive elements, the former resulting in low sequencing coverage of certain loci and the latter to unresolved loops in the de novo assembly graph. We developed SASpector, a command-line tool that compares short-read assemblies (draft genomes) to their corresponding closed assemblies and extracts missing regions to analyze them at the sequence and functional level. SASpector allows to benchmark the need for resolved genomes, can be integrated into pipelines to control the quality of assemblies, and could be used for comparative investigations of missingness in assemblies for which both short-read and long-read data are available in the public databases.
CITATION STYLE
Lood, C., Correa Rojo, A., Sinar, D., Verkinderen, E., Lavigne, R., & Noort, V. van. (2022). SASpector: Analysis of missing genomic regions in draft genomes of prokaryotes. Bioinformatics, 38(10), 2920–2921. https://doi.org/10.1093/bioinformatics/btac208
Mendeley helps you to discover research relevant for your work.