Author: Bhandari, Bikash K.; Gardner, Paul P.; Lim, Chun Shen
Title: Solubility-Weighted Index: fast and accurate prediction of protein solubility Cord-id: 2rpr7aph Document date: 2020_3_26
ID: 2rpr7aph
Snippet: Motivation Recombinant protein production is a widely used technique in the biotechnology and biomedical industries, yet only a quarter of target proteins are soluble and can therefore be purified. Results We have discovered that global structural flexibility, which can be modeled by normalised B-factors, accurately predicts the solubility of 12,216 recombinant proteins expressed in Escherichia coli. We have optimised B-factors, and derived a new set of values for solubility scoring that further
Document: Motivation Recombinant protein production is a widely used technique in the biotechnology and biomedical industries, yet only a quarter of target proteins are soluble and can therefore be purified. Results We have discovered that global structural flexibility, which can be modeled by normalised B-factors, accurately predicts the solubility of 12,216 recombinant proteins expressed in Escherichia coli. We have optimised B-factors, and derived a new set of values for solubility scoring that further improves prediction accuracy. We call this new predictor the ‘Solubility-Weighted Index’ (SWI). Importantly, SWI outperforms many existing protein solubility prediction tools. Furthermore, we have developed ‘SoDoPE’ (Soluble Domain for Protein Expression), a web interface that allows users to choose a protein region of interest for predicting and maximising both protein expression and solubility. Availability The SoDoPE web server and source code are freely available at https://tisigner.com/sodope and https://github.com/Gardner-BinfLab/TISIGNER-ReactJS, respectively. The code and data for reproducing our analysis can be found at https://github.com/Gardner-BinfLab/SoDoPE_paper2020.
Search related documents:
Co phrase search for related documents- log likelihood and machine learning: 1, 2
- log likelihood and machine precision: 1
- log likelihood ratio test and machine precision: 1
- log likelihood ratio test value and machine precision: 1
- logistic linear and low weight: 1
- logistic linear and machine learning: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
- logistic linear and machine learning model: 1
- logistic linear regression and low weight: 1
- logistic linear regression and machine learning: 1, 2, 3, 4, 5, 6, 7, 8, 9
- logistic linear regression and machine learning model: 1
- logistic regression and low correlation: 1, 2, 3, 4
- logistic regression and low weight: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34
- logistic regression and machine learning: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74
- logistic regression and machine learning model: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48
- logistic regression and machine precision: 1, 2, 3, 4, 5
- low correlation and machine learning: 1, 2, 3
- low correlation and machine learning model: 1, 2
- low weight and machine learning: 1, 2, 3
- low weight and machine learning model: 1
Co phrase search for related documents, hyperlinks ordered by date