Early classification of time series using multi-objective optimization techniques

Abstract In early classification of time series the objective is to build models which are able to make class-predictions for time series as accurately and as early as possible, when only a part of the series is available. It is logical to think that accuracy and earliness are conflicting objectives, since the more we wait, more data points from the series are available, and it is easier to make accurate class-predictions. Considering this, the problem can be very naturally formulated as a multi-objective optimization problem, and solved as such. However, the solutions proposed in the literature up to now, reduce the problem into a single-objective problem by combining both objectives somehow. In this paper, we present a novel multi-objective formulation of the problem of early classification, and we design a solution using multi-objective optimization techniques. This method will provide a variety of solutions which find different trade-offs between both objectives, allowing the user to select the most suitable solution a-posteriori, depending on the accuracy and earliness requirements of the problem at hand. To prove the usefulness of our proposal, we carry out an extensive experimentation process using 45 benchmark databases and we present a case study in the financial domain.

[1]  Olaf Mersmann Multiple Criteria Optimization Algorithms and Related Functions , 2014 .

[2]  Yong Hu,et al.  Application of evolutionary computation for rule discovery in stock algorithmic trading: A literature review , 2015, Appl. Soft Comput..

[3]  Mark A. Girolami,et al.  vbmp: Variational Bayesian Multinomial Probit Regression for multi-class classification in R , 2008, Bioinform..

[4]  An-Pin Chen,et al.  Using intelligent computing and data stream mining for behavioral finance associated with market profile and financial physics , 2018, Appl. Soft Comput..

[5]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[6]  Bin Weng,et al.  Application of machine learning techniques for stock market prediction , 2017 .

[7]  Romain Tavenard,et al.  Cost-Aware Early Classification of Time Series , 2016, ECML/PKDD.

[8]  Ömer Kaan Baykan,et al.  Predicting direction of stock price index movement using artificial neural networks and support vector machines: The sample of the Istanbul Stock Exchange , 2011, Expert Syst. Appl..

[9]  Carlos Agón,et al.  Time-series data mining , 2012, CSUR.

[10]  B. Malkiel The Efficient Market Hypothesis and Its Critics , 2003 .

[11]  Mohamed F. Ghalwash,et al.  Early classification of multivariate time series using a hybrid HMM/SVM model , 2012, 2012 IEEE International Conference on Bioinformatics and Biomedicine.

[12]  Yong Duan,et al.  Early classification on multivariate time series , 2015, Neurocomputing.

[14]  Philip S. Yu,et al.  Early classification on time series , 2012, Knowledge and Information Systems.

[15]  E. Fama EFFICIENT CAPITAL MARKETS: A REVIEW OF THEORY AND EMPIRICAL WORK* , 1970 .

[16]  Eamonn J. Keogh,et al.  Reliable early classification of time series based on discriminating the classes over time , 2016, Data Mining and Knowledge Discovery.

[17]  Hyrum S. Anderson,et al.  Classifying with confidence from incomplete information , 2013, J. Mach. Learn. Res..

[18]  Zhe George Zhang,et al.  Forecasting stock indices with back propagation neural network , 2011, Expert Syst. Appl..

[19]  Tak-Chung Fu,et al.  A review on time series data mining , 2011, Eng. Appl. Artif. Intell..

[20]  Sanjoy Dasgupta,et al.  Early Classification of Time Series by Simultaneously Optimizing the Accuracy and Earliness , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[21]  Chih-Chou Chiu,et al.  A hybrid approach by integrating wavelet-based feature extraction with MARS and SVR for stock index forecasting , 2013, Decis. Support Syst..

[22]  Allan Timmermann,et al.  Elusive Return Predictability , 2008 .

[23]  Juan José Rodríguez Diez,et al.  Early Fault Classification in Dynamic Systems Using Case-Based Reasoning , 2005, CAEPIA.

[24]  Mohamed F. Ghalwash,et al.  Utilizing temporal patterns for estimating uncertainty in interpretable early decision making , 2014, KDD.

[25]  Camelia Chira,et al.  Classifiers with a reject option for early time-series classification , 2013, 2013 IEEE Symposium on Computational Intelligence and Ensemble Learning (CIEL).

[26]  Eamonn J. Keogh,et al.  The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances , 2016, Data Mining and Knowledge Discovery.

[27]  Rohit J. Kate Using dynamic time warping distances as features for improved time series classification , 2016, Data Mining and Knowledge Discovery.

[28]  Johan Bollen,et al.  Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[29]  Luca Scrucca,et al.  GA: A Package for Genetic Algorithms in R , 2013 .

[30]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[31]  Tongda Zhang,et al.  Stock Market Forecasting Using Machine Learning Algorithms , 2012 .

[32]  Antoine Cornuéjols,et al.  Early Classification of Time Series as a Non Myopic Sequential Decision Making Problem , 2015, ECML/PKDD.

[33]  Philip S. Yu,et al.  Extracting Interpretable Features for Early Classification on Time Series , 2011, SDM.

[34]  R. Scott Evans,et al.  Automated detection of physiologic deterioration in hospitalized patients , 2015, J. Am. Medical Informatics Assoc..

[35]  Eamonn J. Keogh,et al.  Time series shapelets: a new primitive for data mining , 2009, KDD.