4.7 Article

Using causal discovery for feature selection in multivariate numerical time series

期刊

MACHINE LEARNING
卷 101, 期 1-3, 页码 377-395

出版社

SPRINGER
DOI: 10.1007/s10994-014-5460-1

关键词

Feature selection; Multivariate time series; Causal discovery; Prediction and regression; Granger causality

资金

  1. SA Water Corporation
  2. SA Water Centre for Water Management and Reuse
  3. Australian Research Council [DP140103617]
  4. National Natural Science Foundation of China [31171456]

向作者/读者索取更多资源

Time series data contains temporal ordering, which makes its feature selection different from the normal feature selection. Feature selection in multivariate time series has two tasks: identifying the relevant features and finding their effective window sizes of lagged values. The methods extended from normal feature selection methods do not solve this two-dimensional feature selection problem since they do not take lagged observations of features into consideration. In this paper, we present a method using the Granger causality discovery to identify causal features with effective sliding window sizes in multivariate numerical time series. The proposed method considers the influence of lagged observations of features on the target time series. We compare our proposed feature selection method with several normal feature selection methods on multivariate time series data using three well-known modeling methods. Our method outperforms other methods for predicting future values of target time series. In a real world case study on water quality monitoring data, we show that the features selected by our method contain four out of five features used by domain experts, and prediction performance on our features is better than that on features of domain experts using three modeling methods.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据