We describe a statistical model that uses binomial logistic regression for predicting the solubility of heterologous proteins expressed in E. coli. The model is based on a set of proteins reported to have been expressed in E. coli in either soluble or insoluble form. The 22 parameters used in the final model based on proteins’ amino acid composition are discussed. The overall accuracy of the model developed is 94%. The way to use this model on the website http://www.ou.edu for the prediction of protein solubility is explained.
CITATION STYLE
Harrison, R. G., & Bagajewicz, M. J. (2015). Predicting the solubility of recombinant proteins in Escherichia coli. Methods in Molecular Biology, 1258, 403–408. https://doi.org/10.1007/978-1-4939-2205-5_23
Mendeley helps you to discover research relevant for your work.