Abstract
We describe a sequence assembler, RePS (repeat-masked Phrap with scaffolding), that explicitly identifies exact 20mer repeats from the shotgun data and removes them prior to the assembly. The established software Phrap is used to compute meaningful error probabilities for each base. Clone-end-pairing information is used to construct scaffolds that order and orient the contigs. We show with real data for human and rice that reasonable assemblies are possible even at coverages of only 4× to 6×, despite having up to 42.2% in exact repeats.
Cite
CITATION STYLE
Wang, J., Wong, G. K. S., Ni, P., Han, Y., Huang, X., Zhang, J., … Li, S. (2002). RePS: A sequence assembler that masks exact repeats identified from the shotgun data. Genome Research, 12(5), 824–831. https://doi.org/10.1101/gr.165102
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.