Author: Mall, Raghvendra; Elbasir, Abdurrahman; Almeer, Hossam; Islam, Zeyaul; Kolatkar, Prasanna R; Chawla, Sanjay; Ullah, Ehsan
                    Title: A Modelling Framework for Embedding-based Predictions for Compound-Viral Protein Activity  Cord-id: 5eazkoou  Document date: 2021_2_26
                    ID: 5eazkoou
                    
                    Snippet: MOTIVATION: A global effort is underway to identify compounds for the treatment of COVID-19. Since de novo compound design is an extremely long, time-consuming, and expensive process, efforts are underway to discover existing compounds that can be repurposed for COVID-19 and new viral diseases. MODEL: We propose a machine learning representation framework that uses deep learning induced vector embeddings of compounds and viral proteins as features to predict compound-viral protein activity. The 
                    
                    
                    
                     
                    
                    
                    
                    
                        
                            
                                Document: MOTIVATION: A global effort is underway to identify compounds for the treatment of COVID-19. Since de novo compound design is an extremely long, time-consuming, and expensive process, efforts are underway to discover existing compounds that can be repurposed for COVID-19 and new viral diseases. MODEL: We propose a machine learning representation framework that uses deep learning induced vector embeddings of compounds and viral proteins as features to predict compound-viral protein activity. The prediction model in-turn uses a consensus framework to rank approved compounds against viral proteins of interest. RESULTS: Our consensus framework achieves a highmean Pearson correlation of 0.916, mean R2 of 0.840 and a low mean squared error of 0.313 for the task of compound-viral protein activity prediction on an independent test set. As a use case, we identify a ranked list of 47 compounds common to three main proteins of SARS-COV-2 virus (PL-PRO, 3CL-PRO and Spike protein) as potential targets including 21 antivirals, 15 anticancer, 5 antibiotics and 6 other investigationalhuman compounds.We performadditional molecular docking simulations to demonstrate thatmajority of these compounds have low binding energies and thus high binding affinity with the potential to be effective against the SARS-COV-2 virus. AVAILABILITY: All the source code and data is available at: https://github.com/raghvendra5688/Drug-Repurposing and https://dx.doi.org/10.17632/8rrwnbcgmx.3. We also implemented a web-server at: https://machinelearning-protein.qcri.org/index.html. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
 
  Search related documents: 
                                Co phrase  search for related documents- low dimensional and machine learning: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12
- low dimensional representation and machine learning: 1, 2
- low dimensional space and lstm encoder: 1
- low dimensional space and machine learning: 1
- lstm encoder and machine learning: 1, 2
 
                                Co phrase  search for related documents, hyperlinks ordered by date