Selected article for: "recombinant protein and soluble expression"

Author: Bhandari, Bikash K; Gardner, Paul P; Lim, Chun Shen
Title: Solubility-Weighted Index: fast and accurate prediction of protein solubility
  • Cord-id: fhu6gcig
  • Document date: 2020_6_19
  • ID: fhu6gcig
    Snippet: MOTIVATION: Recombinant protein production is a widely used technique in the biotechnology and biomedical industries, yet only a quarter of target proteins are soluble and can therefore be purified. RESULTS: We have discovered that global structural flexibility, which can be modeled by normalized B-factors, accurately predicts the solubility of 12 216 recombinant proteins expressed in Escherichia coli. We have optimized these B-factors, and derived a new set of values for solubility scoring that
    Document: MOTIVATION: Recombinant protein production is a widely used technique in the biotechnology and biomedical industries, yet only a quarter of target proteins are soluble and can therefore be purified. RESULTS: We have discovered that global structural flexibility, which can be modeled by normalized B-factors, accurately predicts the solubility of 12 216 recombinant proteins expressed in Escherichia coli. We have optimized these B-factors, and derived a new set of values for solubility scoring that further improves prediction accuracy. We call this new predictor the ‘Solubility-Weighted Index’ (SWI). Importantly, SWI outperforms many existing protein solubility prediction tools. Furthermore, we have developed ‘SoDoPE’ (Soluble Domain for Protein Expression), a web interface that allows users to choose a protein region of interest for predicting and maximizing both protein expression and solubility. AVAILABILITY AND IMPLEMENTATION: The SoDoPE web server and source code are freely available at https://tisigner.com/sodope and https://github.com/Gardner-BinfLab/TISIGNER-ReactJS, respectively. The code and data for reproducing our analysis can be found at https://github.com/Gardner-BinfLab/SoDoPE_paper_2020. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

    Search related documents:
    Co phrase search for related documents
    • accurate fast and log likelihood: 1
    • accurate fast and logistic regression: 1
    • accurate fast and machine learning: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25
    • accurate fast and machine learning model: 1, 2, 3
    • accurate fast approach and machine learning: 1, 2
    • active area and logistic regression: 1, 2
    • active area and low weight: 1
    • active area and machine learning: 1, 2, 3, 4, 5, 6, 7
    • local region and logistic linear: 1
    • local region and logistic linear regression: 1
    • local region and logistic regression: 1, 2, 3, 4, 5
    • local region and machine learning: 1
    • log likelihood and logistic regression: 1, 2
    • log likelihood and machine learning: 1, 2
    • logistic linear and low weight: 1
    • logistic linear and machine learning: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
    • logistic linear regression and low weight: 1
    • logistic linear regression and machine learning: 1, 2, 3, 4, 5, 6, 7, 8, 9
    • logistic regression and machine learning: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25