论文信息 - A Distributional Semantics Model for Idiom Detection - The Case of English and Russian

A Distributional Semantics Model for Idiom Detection - The Case of English and Russian

This paper describes experiments in English and Russian automatic idiom detection. Our algorithm is based on the idea that literal and idiomatic expressions appear in different contexts. This difference is captured by our distributional semantics model. We evaluate our model on both languages and compare its results. We show that our model is language-independent. We also describe a new annotated resource we created for our

Jing Peng | Anna Feldman | Katsiaryna Aharodnik

[1] Jing Peng,et al. Experiments in Idiom Recognition , 2016, COLING.

[2] Suzanne Stevenson,et al. The VNC-Tokens Dataset , 2008 .

[3] Caroline Sporleder,et al. Using Gaussian Mixture Models to Detect Figurative Language in Context , 2010, NAACL.

[4] Ari Rappoport,et al. Multi-Word Expression Identification Using Sentence Surface Features , 2009, EMNLP.

[5] Jing Peng,et al. Classifying Idiomatic and Literal Expressions Using Vector Space Representations , 2015, RANLP.

[6] Eugenie Giesbrecht,et al. Automatic Identification of Non-Compositional Multi-Word Expressions using Latent Semantic Analysis , 2006 .

[7] Suresh Manandhar,et al. An Empirical Study on Compositionality in Compound Nouns , 2011, IJCNLP.

[8] Sophia Lubensky. Russian-English dictionary of idioms , 2000 .

[9] Timothy Baldwin,et al. Multiword Expressions: A Pain in the Neck for NLP , 2002, CICLing.

[10] Xiaoyan Zhu,et al. Measuring the Non-compositionality of Multiword Expressions , 2010, COLING.

[11] Marti A. Hearst. Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.