论文信息 - One-shot and few-shot learning of word embeddings

One-shot and few-shot learning of word embeddings

Standard deep learning systems require thousands or millions of examples to learn a concept, and cannot integrate new concepts easily. By contrast, humans have an incredible ability to do one-shot or few-shot learning. For instance, from just hearing a word used in a sentence, humans can infer a great deal about it, by leveraging what the syntax and semantics of the surrounding words tells us. Here, we draw inspiration from this to highlight a simple technique by which deep recurrent networks can similarly exploit their prior knowledge to learn a useful representation for a new word from little data. This could make natural language processing systems much more flexible, by allowing them to learn continually from the new words they encounter.

Andrew K. Lampinen | James L. McClelland | Andrew Kyle Lampinen

[1] Angeliki Lazaridou,et al. Multimodal Word Meaning Induction From Minimal Exposure to Natural Text. , 2017, Cognitive science.

[2] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[3] Oriol Vinyals,et al. Matching Networks for One Shot Learning , 2016, NIPS.

[4] Igor Mordatch,et al. A Paradigm for Situated and Goal-Driven Language Learning , 2016, ArXiv.

[5] Wojciech Zaremba,et al. Recurrent Neural Network Regularization , 2014, ArXiv.

[6] Razvan Pascanu,et al. A simple neural network module for relational reasoning , 2017, NIPS.

[7] James L. McClelland,et al. What Learning Systems do Intelligent Agents Need? Complementary Learning Systems Theory Updated , 2016, Trends in Cognitive Sciences.

[8] Marco Baroni,et al. High-risk learning: acquiring new word vectors from tiny data , 2017, EMNLP.

[9] James L. McClelland,et al. Semantic Cognition: A Parallel Distributed Processing Approach , 2004 .

[10] Joshua B. Tenenbaum,et al. Building machines that learn and think like people , 2016, Behavioral and Brain Sciences.

[11] Beatrice Santorini,et al. Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[12] George Kurian,et al. Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation , 2016, ArXiv.

[13] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[14] Geoffrey Zweig,et al. Linguistic Regularities in Continuous Space Word Representations , 2013, NAACL.

[15] Ryan Cotterell,et al. Morphological Smoothing and Extrapolation of Word Embeddings , 2016, ACL.

[16] Demis Hassabis,et al. Grounded Language Learning in a Simulated 3D World , 2017, ArXiv.