Random Forests Based Group Importance Scores and Their Statistical Interpretation: Application for Alzheimer's Disease

Machine learning approaches have been increasingly used in the neuroimaging field for the design of computer-aided diagnosis systems. In this paper, we focus on the ability of these methods to provide interpretable information about the brain regions that are the most informative about the disease or condition of interest. In particular, we investigate the benefit of group-based, instead of voxel-based, analyses in the context of Random Forests. Assuming a prior division of the voxels into non overlapping groups (defined by an atlas), we propose several procedures to derive group importances from individual voxel importances derived from Random Forests models. We then adapt several permutation schemes to turn group importance scores into more interpretable statistical scores that allow to determine the truly relevant groups in the importance rankings. The good behaviour of these methods is first assessed on artificial datasets. Then, they are applied on our own dataset of FDG-PET scans to identify the brain regions involved in the prognosis of Alzheimer's disease.

[1]  Ramon Casanova,et al.  High Dimensional Classification of Structural MRI Alzheimer’s Disease Data Based on Large Scale Regularization , 2011, Front. Neuroinform..

[2]  S. Wold,et al.  The Collinearity Problem in Linear Regression. The Partial Least Squares (PLS) Approach to Generalized Inverses , 1984 .

[3]  Pierre Geurts,et al.  Tree ensemble methods and parcelling to identify brain areas related to Alzheimer’s disease , 2017, 2017 International Workshop on Pattern Recognition in Neuroimaging (PRNI).

[4]  G. Frisoni,et al.  Mapping brain morphological and functional conversion patterns in amnestic MCI: a voxel-based MRI and FDG-PET study , 2009, European Journal of Nuclear Medicine and Molecular Imaging.

[5]  Nick C Fox,et al.  Automatic classification of MR scans in Alzheimer's disease. , 2008, Brain : a journal of neurology.

[6]  Andrea Bergmann,et al.  Statistical Parametric Mapping The Analysis Of Functional Brain Images , 2016 .

[7]  J. Karlawish,et al.  The A4 Study: Stopping AD Before Symptoms Begin? , 2014, Science Translational Medicine.

[8]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[9]  A. Ravishankar Rao,et al.  Prediction and interpretation of distributed neural activity with sparse models , 2009, NeuroImage.

[10]  Terence P Speed,et al.  SOME STEP-DOWN PROCEDURES CONTROLLING THE FALSE DISCOVERY RATE UNDER DEPENDENCE. , 2008, Statistica Sinica.

[11]  Luca Baldassarre,et al.  Localizing and Comparing Weight Maps Generated from Linear Kernel Machine Learning Models , 2013, 2013 International Workshop on Pattern Recognition in Neuroimaging.

[12]  WestonJason,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002 .

[13]  E. Tangalos,et al.  Mild Cognitive Impairment Clinical Characterization and Outcome , 1999 .

[14]  A. Drzezga,et al.  Cerebral metabolic changes accompanying conversion of mild cognitive impairment into Alzheimer's disease: a PET follow-up study , 2003, European Journal of Nuclear Medicine and Molecular Imaging.

[15]  J. Baron,et al.  FDG-PET measurement is more accurate than neuropsychological assessments to predict global cognitive deterioration in patients with mild cognitive impairment , 2005, Neurocase.

[16]  J. Baron,et al.  Mild cognitive impairment , 2003, Neurology.

[17]  Janaina Mourão Miranda,et al.  Embedding Anatomical or Functional Knowledge in Whole-Brain Multiple Kernel Learning Models , 2018, Neuroinformatics.

[18]  Pierre Geurts,et al.  Extremely randomized trees , 2006, Machine Learning.

[19]  R. Petersen,et al.  Mild cognitive impairment , 2006, The Lancet.

[20]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[21]  I. Jolliffe Principal Component Analysis and Factor Analysis , 1986 .

[22]  J. Hardy,et al.  The Amyloid Hypothesis of Alzheimer ’ s Disease : Progress and Problems on the Road to Therapeutics , 2009 .

[23]  Janaina Mourão Miranda,et al.  Classifying brain states and determining the discriminating activation patterns: Support Vector Machine on functional MRI data , 2005, NeuroImage.

[24]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[25]  Polina Golland,et al.  Detecting stable distributed patterns of brain activation using Gini contrast , 2011, NeuroImage.

[26]  Kaustubh Supekar,et al.  Sparse logistic regression for whole-brain classification of fMRI data , 2010, NeuroImage.

[27]  B. Kowalski,et al.  Partial least-squares regression: a tutorial , 1986 .

[28]  J. Ashburner,et al.  Nonlinear spatial normalization using basis functions , 1999, Human brain mapping.

[29]  Yongchao Ge Resampling-based Multiple Testing for Microarray Data Analysis , 2003 .

[30]  Jean-Baptiste Poline,et al.  Which fMRI clustering gives good brain parcellations? , 2014, Front. Neurosci..

[31]  Yvan Saeys,et al.  Statistical interpretation of machine learning-based feature importance scores for biomarker discovery , 2012, Bioinform..

[32]  Andreas Christmann,et al.  Support Vector Machines , 2008, Data Mining and Knowledge Discovery Handbook.

[33]  L. Mucke,et al.  100 Years and Counting: Prospects for Defeating Alzheimer's Disease , 2006, Science.

[34]  Daoqiang Zhang,et al.  Multimodal classification of Alzheimer's disease and mild cognitive impairment , 2011, NeuroImage.

[35]  R. Petersen,et al.  Mild Cognitive Impairment: An Overview , 2008, CNS Spectrums.

[36]  George C. Runger,et al.  Feature Selection with Ensembles, Artificial Variables, and Redundancy Elimination , 2009, J. Mach. Learn. Res..

[37]  J. Morris,et al.  Current concepts in mild cognitive impairment. , 2001, Archives of neurology.

[38]  Kewei Chen,et al.  Peripheral apoE isoform levels in cognitively normal APOE ε3/ε4 individuals are associated with regional gray matter volume and cerebral glucose metabolism , 2017, Alzheimer's Research & Therapy.

[39]  Thomas Lengauer,et al.  Permutation importance: a corrected feature importance measure , 2010, Bioinform..

[40]  C. Jack,et al.  Prediction of AD with MRI-based hippocampal volume in mild cognitive impairment , 1999, Neurology.

[41]  Gilles Louppe,et al.  Understanding variable importances in forests of randomized trees , 2013, NIPS.

[42]  Pierre Geurts,et al.  Exploiting tree-based variable importances to selectively identify relevant variables , 2008, FSDM.

[43]  Petruska Clarkson,et al.  Review feature , 2000 .

[44]  Arno Villringer,et al.  Differential effects of global and cerebellar normalization on detection and differentiation of dementia in FDG-PET studies , 2010, NeuroImage.

[45]  Achim Zeileis,et al.  Bias in random forest variable importance measures: Illustrations, sources and a solution , 2007, BMC Bioinformatics.

[46]  R. Killiany,et al.  Use of structural magnetic resonance imaging to predict who will get Alzheimer's disease , 2000, Annals of neurology.

[47]  T. Lumley,et al.  PRINCIPAL COMPONENT ANALYSIS AND FACTOR ANALYSIS , 2004, Statistical Methods for Biomedical Research.

[48]  Daniel Rueckert,et al.  Random forest-based similarity measures for multi-modal classification of Alzheimer's disease , 2013, NeuroImage.

[49]  Christos Davatzikos,et al.  A review on neuroimaging-based classification studies and associated feature extraction methods for Alzheimer's disease and its prodromal stages , 2017, NeuroImage.

[50]  Jean-Baptiste Poline,et al.  A supervised clustering approach for extracting predictive information from brain activation images , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[51]  Ender Konukoglu,et al.  Relevant feature set estimation with a knock-out strategy and random forests , 2015, NeuroImage.

[52]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[53]  Kathryn Ziegler-Graham,et al.  Forecasting the global burden of Alzheimer’s disease , 2007, Alzheimer's & Dementia.

[54]  Benson Mwangi,et al.  A Review of Feature Reduction Techniques in Neuroimaging , 2013, Neuroinformatics.

[55]  Bertrand Thirion,et al.  Multiscale Mining of fMRI Data with Hierarchical Structured Sparsity , 2012, SIAM J. Imaging Sci..

[56]  N. Tzourio-Mazoyer,et al.  Automated Anatomical Labeling of Activations in SPM Using a Macroscopic Anatomical Parcellation of the MNI MRI Single-Subject Brain , 2002, NeuroImage.

[57]  Clifford R. Jack,et al.  Alzheimer's disease diagnosis in individual subjects using structural MR images: Validation studies , 2008, NeuroImage.

[58]  John D. Storey,et al.  Statistical significance for genomewide studies , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[59]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[60]  R. Petersen,et al.  Alzheimer's disease and mild cognitive impairment. , 2007, Neurologic clinics.

[61]  Aixia Guo,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2014 .

[62]  M. Korenberg,et al.  Microarray Data Analysis , 2022, Methods in Molecular Biology.

[63]  Juan José Rodríguez Diez,et al.  Random Subspace Ensembles for fMRI Classification , 2010, IEEE Transactions on Medical Imaging.

[64]  Peter A. Bandettini,et al.  Does feature selection improve classification accuracy? Impact of sample size and feature selection on classification using anatomical magnetic resonance images , 2012, NeuroImage.

[65]  Juan Manuel Górriz,et al.  A comparative study of feature extraction methods for the diagnosis of Alzheimer's disease using the ADNI database , 2012, Neurocomputing.

[66]  S. Rombouts,et al.  Altered resting state networks in mild cognitive impairment and mild Alzheimer's disease: An fMRI study , 2005, Human brain mapping.

[67]  Pierre Geurts,et al.  Pattern Extraction for Time Series Classification , 2001, PKDD.

[68]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .