Author: Tipirneni, Sindhu; Reddy, Chandan K.
Title: Self-supervised Transformer for Multivariate Clinical Time-Series with Missing Values Cord-id: xa8pvrzc Document date: 2021_7_29
ID: xa8pvrzc
Snippet: Multivariate time-series (MVTS) data are frequently observed in critical care settings and are typically characterized by excessive missingness and irregular time intervals. Existing approaches for learning representations in this domain handle such issues by either aggregation or imputation of values, which in-turn suppresses the fine-grained information and adds undesirable noise/overhead into the machine learning model. To tackle this challenge, we propose STraTS (Self-supervised Transformer
Document: Multivariate time-series (MVTS) data are frequently observed in critical care settings and are typically characterized by excessive missingness and irregular time intervals. Existing approaches for learning representations in this domain handle such issues by either aggregation or imputation of values, which in-turn suppresses the fine-grained information and adds undesirable noise/overhead into the machine learning model. To tackle this challenge, we propose STraTS (Self-supervised Transformer for TimeSeries) model which bypasses these pitfalls by treating time-series as a set of observation triplets instead of using the traditional dense matrix representation. It employs a novel Continuous Value Embedding (CVE) technique to encode continuous time and variable values without the need for discretization. It is composed of a Transformer component with Multi-head attention layers which enables it to learn contextual triplet embeddings while avoiding problems of recurrence and vanishing gradients that occur in recurrent architectures. Many healthcare datasets also suffer from the limited availability of labeled data. Our model utilizes self-supervision by leveraging unlabeled data to learn better representations by performing time-series forecasting as a self-supervision task. Experiments on real-world multivariate clinical time-series benchmark datasets show that STraTS shows better prediction performance than state-of-the-art methods for mortality prediction, especially when labeled data is limited. Finally, we also present an interpretable version of STraTS which can identify important measurements in the time-series data.
Search related documents:
Co phrase search for related documents- adam optimizer and long short term: 1, 2, 3
- adam optimizer and long short term memory: 1, 2, 3
- adam optimizer and loss function: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
- additional feature and low proportion: 1
- long short and loss function: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11
- long short and low memory: 1, 2
- long short and low proportion: 1, 2
- long short term and loss function: 1, 2, 3, 4, 5, 6, 7, 8, 9
- long short term and low proportion: 1
- long short term memory and loss function: 1, 2, 3, 4
Co phrase search for related documents, hyperlinks ordered by date