Segment and Combine Approach for Non-parametric Time-Series Classification

This paper presents a novel, generic, scalable, autonomous, and flexible supervised learning algorithm for the classification of multi-variate and variable length time series. The essential ingredients of the algorithm are randomization, segmentation of time-series, decision tree ensemble based learning of subseries classifiers, combination of subseries classification by voting, and cross-validation based temporal resolution adaptation. Experiments are carried out with this method on 10 synthetic and real-world datasets. They highlight the good behavior of the algorithm on a large diversity of problems. Our results are also highly competitive with existing approaches from the literature.

[1]  Pierre Geurts,et al.  Segment and Combine Approach for Biological Sequence Classification , 2005, 2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology.

[2]  Carlos J. Alonso,et al.  Boosting Interval-Based Literals: Variable Length and Early Classification , 2002 .

[3]  Shigeki Sagayama,et al.  Dynamic Time-Alignment Kernel in Support Vector Machine , 2001, NIPS.

[4]  Einoshin Suzuki,et al.  Decision-tree Induction from Time-series Data Based on a Standard-example Split Test , 2003, ICML.

[5]  Mohammed Waleed Kadous,et al.  Learning Comprehensible Descriptions of Multivariate Time Series , 1999, ICML.

[6]  Daniel P. Siewiorek,et al.  Generalized feature extraction for structural pattern recognition in time-series data , 2001 .

[7]  Claude Sammut,et al.  Classification of Multivariate Time Series and Structured Data Using Constructive Induction , 2005, Machine Learning.

[8]  Pierre Geurts,et al.  Pattern Extraction for Time Series Classification , 2001, PKDD.

[9]  Jan Komorowski,et al.  Principles of Data Mining and Knowledge Discovery , 2001, Lecture Notes in Computer Science.

[10]  Mineichi Kudo,et al.  Multidimensional curve classification using passing-through regions , 1999, Pattern Recognit. Lett..

[11]  Katharina Morik,et al.  Automatic Feature Extraction for Classifying Audio Data , 2005, Machine Learning.

[12]  Pierre Geurts,et al.  Contributions to decision tree induction: bias/variance tradeoff and time series classification , 2002 .

[13]  Abraham Kandel,et al.  Data Mining in Time Series Database , 2004 .

[14]  Eamonn J. Keogh,et al.  Making Time-Series Classification More Accurate Using Learned Constraints , 2004, SDM.

[15]  Raphaël Marée,et al.  Random subwindows for robust image classification , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).