论文信息 - Power Data Classification: A Hybrid of a Novel Local Time Warping and LSTM

Power Data Classification: A Hybrid of a Novel Local Time Warping and LSTM

In this paper, for the purpose of data centre energy consumption monitoring and analysis, we propose to detect the running programs in a server by classifying the observed power consumption series. Time series classification problem has been extensively studied with various distance measurements developed; also recently the deep learning based sequence models have been proved to be promising. In this paper, we propose a novel distance measurement and build a time series classification algorithm hybridizing nearest neighbour and long short term memory (LSTM) neural network. More specifically, first we propose a new distance measurement termed as Local Time Warping (LTW), which utilizes a user-specified set for local warping, and is designed to be non-commutative and non-dynamic programming. Second we hybridize the 1NN-LTW and LSTM together. In particular, we combine the prediction probability vector of 1NN-LTW and LSTM to determine the label of the test cases. Finally, using the power consumption data from a real data center, we show that the proposed LTW can improve the classification accuracy of DTW from about 84% to 90%. Our experimental results prove that the proposed LTW is competitive on our data set compared with existed DTW variants and its non-commutative feature is indeed beneficial. We also test a linear version of LTW and it can significantly outperform existed linear runtime lower bound methods like LB_Keogh. Furthermore, with the hybrid algorithm, for the power series classification task we achieve an accuracy up to about 93%. Our research can inspire more studies on time series distance measurement and the hybrid of the deep learning models with other traditional models.

[1] C. Burrus,et al. DFT/FFT and Convolution Algorithms: Theory and Implementation , 1991 .

[2] Geoffrey E. Hinton,et al. Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[3] Matthew J. Hausknecht,et al. Beyond short snippets: Deep networks for video classification , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4] Lukás Burget,et al. Recurrent neural network based language model , 2010, INTERSPEECH.

[5] Gaël Varoquaux,et al. Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[6] James Large,et al. The Great Time Series Classification Bake Off: An Experimental Evaluation of Recently Proposed Algorithms. Extended Version , 2016, ArXiv.

[7] J. Chase,et al. Data Center Workload Monitoring , Analysis , and Emulation , 2005 .

[8] Eamonn J. Keogh,et al. A Complexity-Invariant Distance Measure for Time Series , 2011, SDM.

[9] Li Wei,et al. Fast time series classification using numerosity reduction , 2006, ICML.

[10] Eamonn J. Keogh,et al. Everything you know about Dynamic Time Warping is Wrong , 2004 .

[11] Gautam Das,et al. The Move-Split-Merge Metric for Time Series , 2013, IEEE Transactions on Knowledge and Data Engineering.

[12] Samir S. Soliman,et al. Signal classification using statistical moments , 1992, IEEE Trans. Commun..

[13] Eamonn J. Keogh,et al. Searching and Mining Trillions of Time Series Subsequences under Dynamic Time Warping , 2012, KDD.

[14] Philip Chan,et al. Toward accurate dynamic time warping in linear time and space , 2007, Intell. Data Anal..

[15] Trevor Darrell,et al. Long-term recurrent convolutional networks for visual recognition and description , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16] Jeffrey H. Reed,et al. A new approach to signal classification using spectral correlation and neural networks , 2005, First IEEE International Symposium on New Frontiers in Dynamic Spectrum Access Networks, 2005. DySPAN 2005..

[17] Jürgen Schmidhuber,et al. Learning to Forget: Continual Prediction with LSTM , 2000, Neural Computation.

[18] Li Wei,et al. Experiencing SAX: a novel symbolic representation of time series , 2007, Data Mining and Knowledge Discovery.

[19] David Vandyke,et al. Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems , 2015, EMNLP.

[20] Eamonn J. Keogh,et al. Fast Shapelets: A Scalable Algorithm for Discovering Time Series Shapelets , 2013, SDM.

[21] David W. Hosmer,et al. Applied Logistic Regression , 1991 .

[22] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[23] George C. Runger,et al. A time series forest for classification and feature extraction , 2013, Inf. Sci..

[24] Luiz André Barroso,et al. The Case for Energy-Proportional Computing , 2007, Computer.