Optimization of Alpha-Beta Log-Det Divergences and their Application in the Spatial Filtering of Two Class Motor Imagery Movements

The Alpha-Beta Log-Det divergences for positive definite matrices are flexible divergences that are parameterized by two real constants and are able to specialize several relevant classical cases like the squared Riemannian metric, the Steins loss, the S-divergence, etc. A novel classification criterion based on these divergences is optimized to address the problem of classification of the motor imagery movements. This research paper is divided into three main sections in order to address the above mentioned problem: (1) Firstly, it is proven that a suitable scaling of the class conditional covariance matrices can be used to link the Common Spatial Pattern (CSP) solution with a predefined number of spatial filters for each class and its representation as a divergence optimization problem by making their different filter selection policies compatible; (2) A closed form formula for the gradient of the Alpha-Beta Log-Det divergences is derived that allows to perform optimization as well as easily use it in many practical applications; (3) Finally, in similarity with the work of Samek et al. 2014, which proposed the robust spatial filtering of the motor imagery movements based on the beta-divergence, the optimization of the Alpha-Beta Log-Det divergences is applied to this problem. The resulting subspace algorithm provides a unified framework for testing the performance and robustness of the several divergences in different scenarios.

[1]  K. Müller,et al.  Finding stationary subspaces in multivariate time series. , 2009, Physical review letters.

[2]  Mehrtash Harandi,et al.  Dimensionality Reduction on SPD Manifolds: The Emergence of Geometry-Aware Methods , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Cuntai Guan,et al.  Optimizing Spatial Filters by Minimizing Within-Class Dissimilarities in Electroencephalogram-Based Brain–Computer Interface , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[4]  Haixian Wang Harmonic Mean of Kullback–Leibler Divergences for Optimizing Multi-Class EEG Spatio-Temporal Filters , 2012, Neural Processing Letters.

[5]  Ad Aertsen,et al.  Review of the BCI Competition IV , 2012, Front. Neurosci..

[6]  Ren-Cang Li Rayleigh Quotient Based Optimization Methods For Eigenvalue Problems , 2014 .

[7]  G. Pfurtscheller,et al.  Optimal spatial filtering of single trial EEG during imagined hand movement. , 2000, IEEE transactions on rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society.

[8]  Andrzej Cichocki,et al.  Families of Alpha- Beta- and Gamma- Divergences: Flexible and Robust Measures of Similarities , 2010, Entropy.

[9]  Wei Wu Probabilistic Common Spatial Patterns for Multichannel EEG Analysis , 2015 .

[10]  Motoaki Kawanabe,et al.  Brain-computer interfacing in discriminative and stationary subspaces , 2012, 2012 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[11]  Alan Edelman,et al.  The Geometry of Algorithms with Orthogonality Constraints , 1998, SIAM J. Matrix Anal. Appl..

[12]  M. Kawanabe,et al.  Improving BCI Performance by Modified Common Spatial Patterns with Robustly Averaged Covariance Matrices , 2009 .

[13]  Minh Ha Quang Infinite-dimensional Log-Determinant divergences II: Alpha-Beta divergences. , 2016 .

[14]  Motoaki Kawanabe,et al.  Robust Spatial Filtering with Beta Divergence , 2013, NIPS.

[15]  Keinosuke Fukunaga,et al.  Application of the Karhunen-Loève Expansion to Feature Selection and Ordering , 1970, IEEE Trans. Computers.

[16]  F. L. D. Silva,et al.  Event-related EEG/MEG synchronization and desynchronization: basic principles , 1999, Clinical Neurophysiology.

[17]  Cuntai Guan,et al.  Spatially Regularized Common Spatial Patterns for EEG Classification , 2010, 2010 20th International Conference on Pattern Recognition.

[18]  Seungjin Choi,et al.  Composite Common Spatial Pattern for Subject-to-Subject Transfer , 2009, IEEE Signal Processing Letters.

[19]  R. Ward,et al.  Robust Common Spatial Patterns for EEG signal preprocessing , 2008, 2008 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[20]  Motoaki Kawanabe,et al.  Robust Common Spatial Filters with a Maxmin Approach , 2014, Neural Computation.

[21]  Motoaki Kawanabe,et al.  Invariant Common Spatial Patterns: Alleviating Nonstationarities in Brain-Computer Interfacing , 2007, NIPS.

[22]  Cuntai Guan,et al.  Learning from other subjects helps reducing Brain-Computer Interface calibration time , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[23]  Z J Koles,et al.  The quantitative extraction and topographic mapping of the abnormal components in the clinical EEG. , 1991, Electroencephalography and clinical neurophysiology.

[24]  David Barber,et al.  Bayesian reasoning and machine learning , 2012 .

[25]  J. Wolpaw,et al.  Brain-Computer Interfaces: Principles and Practice , 2012 .

[26]  G Pfurtscheller,et al.  Seperability of four-class motor imagery data using independent components analysis , 2006, Journal of neural engineering.

[27]  Klaus-Robert Muller,et al.  Finding stationary brain sources in EEG data , 2010, 2010 Annual International Conference of the IEEE Engineering in Medicine and Biology.

[28]  Moritz Grosse-Wentrup,et al.  Multiclass Common Spatial Patterns and Information Theoretic Feature Extraction , 2008, IEEE Transactions on Biomedical Engineering.

[29]  G. Pfurtscheller,et al.  Designing optimal spatial filters for single-trial EEG classification in a movement task , 1999, Clinical Neurophysiology.

[30]  Klaus-Robert Müller,et al.  Boosting bit rates in noninvasive EEG single-trial classifications by feature combination and multiclass paradigms , 2004, IEEE Transactions on Biomedical Engineering.

[31]  Terrence J. Sejnowski,et al.  Toward Brain-Computer Interfacing (Neural Information Processing) , 2007 .

[32]  Christian Jutten,et al.  Multiclass Brain–Computer Interface Classification by Riemannian Geometry , 2012, IEEE Transactions on Biomedical Engineering.

[33]  Sergio Cruces,et al.  Log-Determinant Divergences Revisited: Alpha-Beta and Gamma Log-Det Divergences , 2014, Entropy.

[34]  Alexander Binder,et al.  Multiple Kernel Learning for Brain-Computer Interfacing , 2013, 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[35]  Cuntai Guan,et al.  Regularizing Common Spatial Patterns to Improve BCI Designs: Unified Theory and New Algorithms , 2011, IEEE Transactions on Biomedical Engineering.

[36]  Motoaki Kawanabe,et al.  Divergence-Based Framework for Common Spatial Patterns Algorithms , 2014, IEEE Reviews in Biomedical Engineering.

[37]  G. Pfurtscheller,et al.  The BCI competition III: validating alternative approaches to actual BCI problems , 2006, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[38]  Gert Pfurtscheller,et al.  Characterization of four-class motor imagery EEG data for the BCI-competition 2005 , 2005, Journal of neural engineering.

[39]  Yoram Singer,et al.  Reducing Multiclass to Binary: A Unifying Approach for Margin Classifiers , 2000, J. Mach. Learn. Res..

[40]  Suvrit Sra,et al.  Geometric Optimization in Machine Learning , 2016 .

[41]  Shun-ichi Amari,et al.  Natural Gradient Works Efficiently in Learning , 1998, Neural Computation.

[42]  W. Samek,et al.  Group-wise Stationary Subspace Analysis-A novel method for studying non-stationarities , 2011 .

[43]  Cuntai Guan,et al.  Spatially sparsed Common Spatial Pattern to improve BCI performance , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[44]  Haiping Lu,et al.  Regularized common spatial patterns with generic learning for EEG signal classification , 2009, 2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[45]  Selina Wriessnegger,et al.  Regularised CSP for Sensor Selection in BCI , 2006 .

[46]  Sergio Cruces,et al.  From blind signal extraction to blind instantaneous signal separation: criteria, algorithms, and stability , 2004, IEEE Transactions on Neural Networks.

[47]  Gary E. Birch,et al.  Sparse spatial filter optimization for EEG channel reduction in brain-computer interface , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[48]  T. Tao Topics in Random Matrix Theory , 2012 .

[49]  Shiguang Shan,et al.  Log-Euclidean Metric Learning on Symmetric Positive Definite Manifold with Application to Image Set Classification , 2015, ICML.

[50]  Qin Tang,et al.  L1-Norm-Based Common Spatial Patterns , 2012, IEEE Transactions on Biomedical Engineering.

[51]  Cuntai Guan,et al.  Bayesian Learning for Spatial Filtering in an EEG-Based Brain–Computer Interface , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[52]  Masashi Sugiyama,et al.  Geometry-aware principal component analysis for symmetric positive definite matrices , 2017, Machine Learning.

[53]  Stephanie Brandl,et al.  Robust common spatial patterns based on Bhattacharyya distance and Gamma divergence , 2015, The 3rd International Winter Conference on Brain-Computer Interface.

[54]  Wonzoo Chung,et al.  Common spatial patterns based on generalized norms , 2013, 2013 International Winter Workshop on Brain-Computer Interface (BCI).