Abstract
Motivation: Structural variants (SVs) play an important role in genetic research and precision medicine. As existing SV detection methods usually contain a substantial number of false positive calls, approaches to filter the detection results are needed. Results: We developed a novel deep learning-based SV filtering tool, CSV-Filter, for both short and long reads. CSV-Filter uses a novel multi-level grayscale image encoding method based on CIGAR strings of the alignment results and employs image augmentation techniques to improve SV feature extraction. CSV-Filter also utilizes self-supervised learning networks for transfer as classification models, and employs mixed-precision operations to accelerate training. The experiments showed that the integration of CSV-Filter with popular SV detection tools could considerably reduce false positive SVs for short and long reads, while maintaining true positive SVs almost unchanged. Compared with DeepSVFilter, a SV filtering tool for short reads, CSV-Filter could recognize more false positive calls and support long reads as an additional feature.
Cite
CITATION STYLE
Xia, Z., Xiang, W., Wang, Q., Li, X., Li, Y., Gao, J., … Cui, Y. (2024). CSV-Filter: a deep learning-based comprehensive structural variant filtering method for both short and long reads. Bioinformatics, 40(9). https://doi.org/10.1093/bioinformatics/btae539
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.