Motivation: Superpositioning is an essential technique in structural biology that facilitates the comparison and analysis of conformational differences among topologically similar structures. Performing a superposition requires a one-to-one correspondence, or alignment, of the point sets in the different structures. However, in practice, some points are usually 'missing' from several structures, for example, when the alignment contains gaps. Current superposition methods deal with missing data simply by superpositioning a subset of points that are shared among all the structures. This practice is inefficient, as it ignores important data, and it fails to satisfy the common least-squares criterion. In the extreme, disregarding missing positions prohibits the calculation of a superposition altogether.Results: Here, we present a general solution for determining an optimal superposition when some of the data are missing. We use the expectation-maximization algorithm, a classic statistical technique for dealing with incomplete data, to find both maximum-likelihood solutions and the optimal least-squares solution as a special case. © The Author(s) 2012. Published by Oxford University Press.
CITATION STYLE
Theobald, D. L., & Steindel, P. A. (2012). Optimal simultaneous superpositioning of multiple structures with missing data. Bioinformatics, 28(15), 1972–1979. https://doi.org/10.1093/bioinformatics/bts243
Mendeley helps you to discover research relevant for your work.