Author: Mertsch, Darvin; Stanke, Mario
Title: End-to-end Learning of Evolutionary Models to Find Coding Regions in Genome Alignments Cord-id: 840ws59e Document date: 2021_3_10
ID: 840ws59e
Snippet: Motivation The comparison of genomes using models of molecular evolution is a powerful approach for finding or towards understanding functional elements. In particular, comparative genomics is a fundamental building brick in building high-quality, complete and consistent annotations of ever larger sets of alignable genomes. Results We here present our new program ClaMSA that classifies multiple sequence alignments using a phylogenetic model. It uses a novel continuous-time Markov chain machine l
Document: Motivation The comparison of genomes using models of molecular evolution is a powerful approach for finding or towards understanding functional elements. In particular, comparative genomics is a fundamental building brick in building high-quality, complete and consistent annotations of ever larger sets of alignable genomes. Results We here present our new program ClaMSA that classifies multiple sequence alignments using a phylogenetic model. It uses a novel continuous-time Markov chain machine learning layer, named CTMC, that is learned end-to-end together with (recurrent) neural networks for a learning task. We trained ClaMSA discriminately to classify aligned codon sequences that are candidates of coding regions into coding or non-coding and obtained six times fewer false positives for this task on vertebrate and fly alignments than existing methods at the same true positive rate. ClaMSA and the CTMC layer are general tools that could be used for other machine learning tasks on tree-related sequence data. Availability Freely from https://github.com/Gaius-Augustus/clamsa.
Search related documents:
Co phrase search for related documents- accuracy advantage and lstm model: 1
- accuracy advantage and lstm model short term memory: 1
- actual negative and logistic regression: 1
Co phrase search for related documents, hyperlinks ordered by date