论文信息 - Explaining and Interpreting LSTMs

Explaining and Interpreting LSTMs

While neural networks have acted as a strong unifying force in the design of modern AI systems, the neural network architectures themselves remain highly heterogeneous due to the variety of tasks to be solved. In this chapter, we explore how to adapt the Layer-wise Relevance Propagation (LRP) technique used for explaining the predictions of feed-forward networks to the LSTM architecture used for sequential data modeling and forecasting. The special accumulators and gated interactions present in the LSTM require both a new propagation scheme and an extension of the underlying theoretical framework to deliver faithful explanations.

[1] Frank Fallside,et al. Dynamic reinforcement driven error propagation networks with application to game playing , 1989 .

[2] Jürgen Schmidhuber,et al. Framewise phoneme classification with bidirectional LSTM and other neural network architectures , 2005, Neural Networks.

[3] Klaus-Robert Müller,et al. Evaluating Recurrent Neural Network Explanations , 2019, BlackboxNLP@ACL.

[4] Yoshua Bengio,et al. Gradient Flow in Recurrent Nets: the Difficulty of Learning Long-Term Dependencies , 2001 .

[5] Klaus-Robert Müller,et al. Interpretable deep neural networks for single-trial EEG classification , 2016, Journal of Neuroscience Methods.

[6] Björn W. Schuller,et al. Robust speech recognition using long short-term memory recurrent neural networks for hybrid acoustic modelling , 2014, INTERSPEECH.

[7] Xinlei Chen,et al. Visualizing and Understanding Neural Models in NLP , 2015, NAACL.

[8] Jukka Luoma,et al. Time delays, competitive interdependence, and firm performance , 2017 .

[9] Jürgen Schmidhuber,et al. LSTM: A Search Space Odyssey , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[10] S. Hochreiter. Recurrent Neural Net Learning and Vanishing , 1998 .

[11] Sepp Hochreiter,et al. Learning to Learn Using Gradient Descent , 2001, ICANN.

[12] Klaus-Robert Müller,et al. Explaining the unique nature of individual gait patterns with deep learning , 2018, Scientific Reports.

[13] Wojciech Zaremba,et al. Recurrent Neural Network Regularization , 2014, ArXiv.

[14] Aurélien Garivier,et al. On the Complexity of Best-Arm Identification in Multi-Armed Bandit Models , 2014, J. Mach. Learn. Res..

[15] Jürgen Schmidhuber,et al. Recurrent nets that time and count , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[16] Sepp Hochreiter,et al. RUDDER: Return Decomposition for Delayed Rewards , 2018, NeurIPS.

[17] Markus H. Gross,et al. Gradient-Based Attribution Methods , 2019, Explainable AI.

[18] Klaus-Robert Müller,et al. Structuring Neural Networks for More Explainable Predictions , 2018 .

[19] Yuval Tassa,et al. Learning and Transfer of Modulated Locomotor Controllers , 2016, ArXiv.

[20] Alexander Binder,et al. On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation , 2015, PloS one.

[21] Bin Yu,et al. Beyond Word Importance: Contextual Decomposition to Extract Interactions from LSTMs , 2018, ICLR.

[22] Bram Bakker,et al. Reinforcement Learning with Long Short-Term Memory , 2001, NIPS.

[23] Nitish Srivastava,et al. Unsupervised Learning of Video Representations using LSTMs , 2015, ICML.

[24] Avanti Shrikumar,et al. Learning Important Features Through Propagating Activation Differences , 2017, ICML.

[25] Sepp Hochreiter,et al. Untersuchungen zu dynamischen neuronalen Netzen , 1991 .

[26] Trevor Darrell,et al. Long-term recurrent convolutional networks for visual recognition and description , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27] Klaus Obermayer,et al. Fast model-based protein homology detection without alignment , 2007, Bioinform..

[28] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[29] Klaus-Robert Müller,et al. Interpreting and Explaining Deep Neural Networks for Classification of Audio Signals , 2018, ArXiv.

[30] U. Austin,et al. Translating Videos to Natural Language Using Deep Recurrent Neural Networks , 2017 .

[31] Wojciech Samek,et al. Methods for interpreting and understanding deep neural networks , 2017, Digit. Signal Process..

[32] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[33] Klaus-Robert Müller,et al. From Clustering to Cluster Explanations via Neural Networks , 2019, IEEE transactions on neural networks and learning systems.

[34] Cengiz Öztireli,et al. Towards better understanding of gradient-based attribution methods for Deep Neural Networks , 2017, ICLR.

[35] Daniel Jurafsky,et al. Understanding Neural Networks through Representation Erasure , 2016, ArXiv.

[36] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[37] Hinrich Schütze,et al. Evaluating neural network explanation methods using hybrid documents and morphosyntactic agreement , 2018, ACL.

[38] Joaquín González-Rodríguez,et al. Automatic language identification using long short-term memory recurrent neural networks , 2014, INTERSPEECH.

[39] Joaquín González-Rodríguez,et al. Automatic language identification using deep neural networks , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[40] Andrew Zisserman,et al. Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps , 2013, ICLR.

[41] Volker Tresp,et al. Explaining Therapy Predictions with Layer-Wise Relevance Propagation in Neural Networks , 2018, 2018 IEEE International Conference on Healthcare Informatics (ICHI).

[42] Matthew Botvinick,et al. On the importance of single directions for generalization , 2018, ICLR.

[43] Alexander Binder,et al. Understanding and Comparing Deep Neural Networks for Age and Gender Classification , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[44] Jürgen Schmidhuber,et al. Deep learning in neural networks: An overview , 2014, Neural Networks.

[45] Christopher Potts,et al. Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank , 2013, EMNLP.

[46] Alex Graves,et al. Generating Sequences With Recurrent Neural Networks , 2013, ArXiv.

[47] Andrew W. Senior,et al. Long short-term memory recurrent neural network architectures for large scale acoustic modeling , 2014, INTERSPEECH.

[48] Ankur Taly,et al. Axiomatic Attribution for Deep Networks , 2017, ICML.

[49] Erik Marchi,et al. Multi-resolution linear prediction based features for audio onset detection with bidirectional LSTM neural networks , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[50] Yang Liu,et al. Visualizing and Understanding Neural Machine Translation , 2017, ACL.

[51] Jürgen Schmidhuber,et al. LSTM can Solve Hard Long Time Lag Problems , 1996, NIPS.

[52] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[53] Klaus-Robert Müller,et al. Layer-Wise Relevance Propagation: An Overview , 2019, Explainable AI.

[54] B. Bakker,et al. Reinforcement learning by backpropagation through an LSTM model/critic , 2007, 2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning.

[55] Alexander Binder,et al. Evaluating the Visualization of What a Deep Neural Network Has Learned , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[56] Yoshua Bengio,et al. Learning long-term dependencies with gradient descent is difficult , 1994, IEEE Trans. Neural Networks.

[57] Alexander Binder,et al. The LRP Toolbox for Artificial Neural Networks , 2016, J. Mach. Learn. Res..

[58] Misha Denil,et al. Extraction of Salient Sentences from Labelled Documents , 2014, ArXiv.

[59] Klaus-Robert Müller,et al. "What is relevant in a text document?": An interpretable machine learning approach , 2016, PloS one.

[60] Alexander Binder,et al. Explaining nonlinear classification decisions with deep Taylor decomposition , 2015, Pattern Recognit..

[61] Jürgen Schmidhuber,et al. Learning to forget: continual prediction with LSTM , 1999 .

[62] Alexander Binder,et al. Unmasking Clever Hans predictors and assessing what machines really learn , 2019, Nature Communications.

[63] Peter Stone,et al. Deep Recurrent Q-Learning for Partially Observable MDPs , 2015, AAAI Fall Symposia.

[64] Le Song,et al. Learning to Explain: An Information-Theoretic Perspective on Model Interpretation , 2018, ICML.

[65] Klaus-Robert Müller,et al. Explaining Recurrent Neural Network Predictions in Sentiment Analysis , 2017, WASSA@EMNLP.

[66] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[67] Melanie Mitchell,et al. Interpreting individual classifications of hierarchical networks , 2013, 2013 IEEE Symposium on Computational Intelligence and Data Mining (CIDM).

[68] Sepp Hochreiter,et al. The Vanishing Gradient Problem During Learning Recurrent Neural Nets and Problem Solutions , 1998, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[69] Hazhir Rahmandad,et al. Effects of feedback delay on learning , 2009 .

[70] Rob Fergus,et al. Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[71] Scott Lundberg,et al. A Unified Approach to Interpreting Model Predictions , 2017, NIPS.

[72] Ivan Tashev,et al. Spatial Audio Feature Discovery with Convolutional Neural Networks , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[73] M. Gevrey,et al. Review and comparison of methods to study the contribution of variables in artificial neural network models , 2003 .