A Probabilistic Model of Lexical and Syntactic Access and Disambiguation

The problems of access—retrieving linguistic structure from some mental grammar —and disambiguation—choosing among these structures to correctly parse ambiguous linguistic input—are fundamental to language understanding. The literature abounds with psychological results on lexical access, the access of idioms, syntactic rule access, parsing preferences, syntactic disambiguation, and the processing of garden-path sentences. Unfortunately, it has been difficult to combine models which account for these results to build a general, uniform model of access and disambiguation at the lexical, idiomatic, and syntactic levels. For example, psycholinguistic theories of lexical access and idiom access and parsing theories of syntactic rule access have almost no commonality in methodology or coverage of psycholinguistic data. This article presents a single probabilistic algorithm which models both the access and disambiguation of linguistic knowledge. The algorithm is based on a parallel parser which ranks constructions for access, and interpretations for disambiguation, by their conditional probability. Low-ranked constructions and interpretations are pruned through beam-search; this pruning accounts, among other things, for the garden-path effect. I show that this motivated probabilistic treatment accounts for a wide variety of psycholinguistic results, arguing for a more uniform representation of linguistic knowledge and for the use of probabilistically-enriched grammars and interpreters as models of human knowledge of and processing of language.

[1]  Bradley L. Pritchett Grammatical Competence and Parsing Performance , 1992 .

[2]  Chuck Rieger,et al.  Parsing and comprehending with word experts (a theory and its realization) , 1982 .

[3]  Julie E. Boland,et al.  The use of lexical knowledge in sentence processing , 1991 .

[4]  John C. Trueswell,et al.  Tense, Temporal Context, and Syntactic Ambiguity Resolution. , 1991 .

[5]  Graeme Hirst,et al.  Semantic Interpretation and the Resolution of Ambiguity , 1987, Studies in natural language processing.

[6]  M. MacDonald The interaction of lexical and syntactic ambiguity , 1993 .

[7]  Lenhart K. Schubert On Parsing Preferences , 1984, COLING.

[8]  George Lakoff,et al.  Women, Fire, and Dangerous Things , 1987 .

[9]  J. Baker Trainable grammars for speech recognition , 1979 .

[10]  Steven L. Lytinen Semantics-First Natural Language Processing , 1991, AAAI.

[11]  Howard S. Kurtzman,et al.  Locating Wh-Traces , 1991 .

[12]  Christopher K. Riesbeck,et al.  Comprehension by computer : expectation-based analysis of sentences in context , 1976 .

[13]  Don C. Mitchell,et al.  Verb guidance and other lexical effects in parsing , 1989 .

[14]  Jeffrey L. Elman,et al.  Interactive processes in speech perception: the TRACE model , 1986 .

[15]  Archibald A. Hill,et al.  Laymen, Lexicographers, and Linguists , 1970 .

[16]  R. GibbsJr. Literal meaning and psychological theory , 1984 .

[17]  Anne Cutler,et al.  The access and processing of idiomatic expressions , 1979 .

[18]  Martin Kay,et al.  The MIND System , 1970 .

[19]  K. Lambrecht The pragmatics of case : On the relationship between semantic, grammatical, and pragmatic roles in English and French , 1995 .

[20]  A Wolf,et al.  HUMAN TOXOPLASMOSIS: OCCURRENCE IN INFANTS AS AN ENCEPHALOMYELITIS VERIFICATION BY TRANSMISSION TO ANIMALS. , 1939, Science.

[21]  Claire Cardie,et al.  A Cognitively Plausible Approach to Understanding Complex Syntax , 1991, AAAI.

[22]  Lynne M. Reder,et al.  What kind of pitcher can a catcher fill? Effects of priming in sentence comprehension , 1983 .

[23]  Peter Norvig A Unified Theory of Inference for Text Understanding , 1986 .

[24]  Stuart M. Shieber,et al.  Unification and Grammatical Theory , 1986 .

[25]  James L. McClelland,et al.  Constituent Attachment and Thematic Role Assignment in Sentence Processing: Influences of Content-Based Expectations , 1988 .

[26]  Simon Garrod,et al.  On the real-time character of interpretation during reading , 1985 .

[27]  C. Clifton,et al.  The independence of syntactic processing , 1986 .

[28]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[29]  Lyn Frazier,et al.  Sentence processing: A tutorial review. , 1987 .

[30]  W. Marslen-Wilson Functional parallelism in spoken word-recognition , 1987, Cognition.

[31]  Christopher D. Manning Automatic Acquisition of a Large Sub Categorization Dictionary From Corpora , 1993, ACL.

[32]  Bradley L. Pritchett Garden Path Phenomena and the Grammatical Basis of Language Processing , 1988 .

[33]  M. Tanenhaus,et al.  Context effects in lexical processing , 1987, Cognition.

[34]  Daniel Jurafsky,et al.  An On-Line Computational Model of Human Sentence Interpretation , 1992, AAAI.

[35]  Jean-Pierre Koenig,et al.  Type underspecification and On-line Type Construction in the Lexicon , 1994 .

[36]  Laurie A. Stowe,et al.  Parsing WH-constructions: Evidence for on-line gap location , 1986 .

[37]  Curt Burgess,et al.  Activation and selection processes in the recognition of ambiguous words. , 1985 .

[38]  Andreas Stolcke,et al.  Using a stochastic context-free grammar as a language model for speech recognition , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[39]  L. Shastri,et al.  From simple associations to systematic reasoning: A connectionist representation of rules, variables and dynamic bindings using temporal synchrony , 1993, Behavioral and Brain Sciences.

[40]  Robert C. Berwick,et al.  Computational complexity and natural language , 1987 .

[41]  Steven Abney,et al.  A computational model of human parsing , 1989 .

[42]  Peter Norvig,et al.  A Critical Evaluation of Commensurable Abduction Models for Semantic Interpretation , 1990, COLING.

[43]  A. Salasoo,et al.  Interaction of Knowledge Sources in Spoken Word Identification. , 1985, Journal of memory and language.

[44]  William D. Marslen-Wilson,et al.  Activation, competition, and frequency in lexical access , 1991 .

[45]  M. A. Britt,et al.  Parsing in discourse: Context effects and their limits , 1992 .

[46]  D. Swinney,et al.  Inference Generation During Auditory Language Comprehension , 1990 .

[47]  Susan M. Garnsey,et al.  Evidence for the immediate use of verb control information in sentence processing , 1990 .

[48]  Robert Wilensky,et al.  Phran - A Knowledge-Base Natural Language Understander , 1980, ACL.

[49]  Bruce Fraser,et al.  The verb-particle combination in English , 1976 .

[50]  William D. Marslen-Wilson,et al.  Lexical Representations in Spoken Language Comprehension , 1988 .

[51]  P. Tabossi,et al.  The comprehension of idioms. , 1988 .

[52]  Andreas Stolcke,et al.  An Efficient Probabilistic Context-Free Parsing Algorithm that Computes Prefix Probabilities , 1994, CL.

[53]  Charles J. Fillmore,et al.  The Mechanisms of “Construction Grammar” , 1988 .

[54]  Janet D. Fodor,et al.  The sausage machine: A new two-stage parsing model , 1978, Cognition.

[55]  Wessel Kraaij,et al.  Ambiguity resolution and the retrieval of idioms: two approaches , 1990, COLING.

[56]  Paul Griffith Gorrell,et al.  STUDIES OF HUMAN SYNTACTIC PROCESSING: RANKED-PARALLEL VERSUS SERIAL MODELS , 1987 .

[57]  M. Garrett,et al.  Lexical decision in sentences: Effects of syntactic structure , 1984, Memory & cognition.

[58]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[59]  Suzanne Stevenson,et al.  A Competition-Based Explanation of Syntactic Attachment Preferences and Garden Path Phenomena , 1993, ACL.

[60]  Edward Gibson,et al.  A computational theory of human linguistic processing: memory limitations and processing breakdown , 1991 .

[61]  Leslie Henderson,et al.  On mental representation of morphology and its diagnosis by measures of visual access speed , 1989 .

[62]  Lyn Frazier,et al.  Verb frame preferences: Descriptive norms , 1984 .

[63]  Yorick Wilks,et al.  An intelligent analyzer and understander of English , 1975, Commun. ACM.

[64]  Ivan A. Sag,et al.  Information-based syntax and semantics , 1987 .

[65]  Michael K. Tanenhaus,et al.  Thematic roles and language comprehension , 1988 .

[66]  Philip Resnik,et al.  Probabilistic Tree-Adjoining Grammar as a Framework for Statistical Natural Language Processing , 1992, COLING.

[67]  Hans-Ulrich Krieger,et al.  Feature-based inheritance networks for computational lexicons , 1994 .

[68]  R. Schvaneveldt,et al.  Lexical ambiguity, semantic context, and visual word recognition. , 1976, Journal of experimental psychology. Human perception and performance.

[69]  A. Goldberg The inherent semantics of argument structure: The case of the English ditransitive construction , 1992 .

[70]  Robert William Milne,et al.  Predicting Garden Path Sentences , 1982, Cogn. Sci..

[71]  A. H. Kawamoto Nonlinear dynamics in the resolution of lexical ambiguity: A parallel distributed processing account. , 1993 .

[72]  C. Fillmore,et al.  Regularity and Idiomaticity in Grammatical Constructions: The Case of Let Alone , 1988 .

[73]  Mitchell P. Marcus,et al.  Pearl: A Probabilistic Chart Parser , 1991, EACL.

[74]  Jerry R. Hobbs,et al.  Two Principles of Parse Preference , 1990, COLING.

[75]  Michael C. Mozer,et al.  On the Computational Utility of Consciousness , 1994, NIPS.

[76]  Stuart M. Shieber,et al.  Using Restriction to Extend Parsing Algorithms for Complex-Feature-Based Formalisms , 1985, ACL.

[77]  Lee Osterhout,et al.  On the role of the simplicity heuristic in language processing: Evidence from structural and inferential processing , 1989 .

[78]  John D. Lafferty,et al.  Computation of the Probability of Initial Substring Generation by Stochastic Context-Free Grammars , 1991, Comput. Linguistics.

[79]  Hans Brunner,et al.  Empirical Study of Predictive Powers od Simple Attachment Schemes for Post-Modifier Prepositional Phrases , 1990, ACL.

[80]  Stuart M. Shieber,et al.  Sentence Disambiguation by a Shift-Reduce Parsing Technique , 1983, ACL.

[81]  Samuel A. Bobrow,et al.  On catching on to idiomatic expressions , 1973, Memory & cognition.

[82]  Howard Steven Kurtzman,et al.  Studies in syntactic ambiguity resolution , 1985 .

[83]  Graeme Hirst,et al.  Race-Based Parsing and Syntactic Disambiguation , 1990, Cogn. Sci..

[84]  P. Zwitserlood The locus of the effects of sentential-semantic context in spoken-word processing , 1989, Cognition.

[85]  L. Tyler The structure of the initial cohort: Evidence from gating , 1984, Perception & Psychophysics.

[86]  Mark Steedman,et al.  On not being led up the garden path : The use of context by the psychological syntax processor , 1985 .

[87]  Ivan A. Sag,et al.  Book Reviews: Head-driven Phrase Structure Grammar and German in Head-driven Phrase-structure Grammar , 1996, CL.

[88]  Susumu Kuno,et al.  The predictive analyzer and a path elimination technique , 1965, CACM.

[89]  D. Swinney Lexical access during sentence comprehension: (Re)consideration of context effects , 1979 .

[90]  R. Burchfield Frequency Analysis of English Usage: Lexicon and Grammar. By W. Nelson Francis and Henry Kučera with the assistance of Andrew W. Mackie. Boston: Houghton Mifflin. 1982. x + 561 , 1985 .

[91]  Robert C. Berwick,et al.  Principle-Based Parsing: Computation and Psycholinguistics , 1991 .

[92]  Ivan A. Sag,et al.  Information-Based Syntax and Semantics: Volume 1, Fundamentals , 1987 .

[93]  Jerome A. Feldman,et al.  Connectionist Models and Their Properties , 1982, Cogn. Sci..

[94]  Daniel Jurafsky,et al.  Learning Phonological Rule Probabilities from Speech Corpora with Exploratory Computational Phonology , 1995, ACL.

[95]  Mary Dalrymple,et al.  Categorial Semantics for LFG , 1992, COLING.

[96]  Peter Norvig,et al.  Interpretation Under Ambiguity , 1988 .

[97]  Yorick Wilks,et al.  Syntax, Preference, and Right Attachment , 1985, IJCAI.

[98]  James Henderson,et al.  Description-based parsing in a connectionist network , 1995 .

[99]  Philip Resnik A Class-Based Approach to Lexical Discovery , 1992, ACL.

[100]  Eric P. Hamp,et al.  A glossary of American technical linguistic usage 1925-1950 , 1958 .

[101]  David B. Pisoni,et al.  Similarity neighborhoods of spoken words , 1991 .

[102]  Lyn Frazier,et al.  ON COMPREHENDING SENTENCES: SYNTACTIC PARSING STRATEGIES. , 1979 .

[103]  Robert P. Goldman,et al.  A Logic for Semantic Interpretation , 1988, ACL.

[104]  Yorick Wilks,et al.  Formal semantics of Natural Language: Preference semantics , 1975 .

[105]  C. Clifton,et al.  Comprehending Sentences with Long-Distance Dependencies , 1989 .

[106]  Julie C. Sedivy,et al.  Resolving attachment ambiguities with multiple constraints , 1995, Cognition.

[107]  Steven P. Abney,et al.  Parsing arguments: Phrase structure and argument structure as determinants of initial parsing decisions. , 1991 .

[108]  Mark Steedman,et al.  Interaction with context during human sentence processing , 1988, Cognition.

[109]  Andreas Stolcke,et al.  Bayesian learning of probabilistic language models , 1994 .

[110]  Susan M. Garnsey,et al.  Evoked potentials and the study of sentence comprehension , 1989, Journal of psycholinguistic research.

[111]  Hwee Tou Ng,et al.  On the Role of Coherence in Abductive Explanation , 1990, AAAI.

[112]  C. Clifton,et al.  Thematic roles in sentence parsing. , 1993, Canadian journal of experimental psychology = Revue canadienne de psychologie experimentale.

[113]  G. Altmann Cognitive models of speech processing , 1991 .

[114]  M. Tanenhaus,et al.  Context effects in syntactic ambiguity resolution: discourse and semantic influences in parsing reduced relative clauses. , 1993, Canadian journal of experimental psychology = Revue canadienne de psychologie experimentale.

[115]  Jay Earley,et al.  An efficient context-free parsing algorithm , 1970, Commun. ACM.

[116]  Jean-Pierre Koenig,et al.  Linking constructions vs. linking rules: evidence from French , 1993 .

[117]  Susan M. Garnsey,et al.  Lexical structure in parsing long-distance dependencies , 1989, Journal of psycholinguistic research.

[118]  J. Peregrin LINGUISTICS AND PHILOSOPHY , 1998 .

[119]  W D Marslen-Wilson,et al.  Sentence Perception as an Interactive Parallel Process , 1975, Science.

[120]  Lenhart K. Schubert Are There Preference Trade-offs in Attachment Decisions? , 1986, AAAI.

[121]  J. Kimball Seven principles of surface structure parsing in natural language , 1973 .

[122]  Anne Cutler,et al.  The role of strong syllables in segmentation for lexical access , 1988 .

[123]  R. Duncan Luce,et al.  Individual Choice Behavior , 1959 .

[124]  R. Kreuz,et al.  Context can constrain lexical access: implications for models of language comprehension , 1986 .

[125]  Gertjan van Noord Head Corner Parsing for Discontinuous Constituency , 1991, ACL.

[126]  M. Joos Semantic Axiom Number One , 1972 .

[127]  Michael K. Tanenhaus,et al.  Semantic effects on syntactic ambiguity resolution: Evidence for a constraint-based resolution process. , 1994 .

[128]  M. Baltin,et al.  The Mental representation of grammatical relations , 1985 .

[129]  Knud Lambrecht,et al.  The pragmatics of case , 1996 .

[130]  Raymond W. Gibbs,et al.  How to kick the bucket and not decompose: Analyzability and idiom processing , 1989 .

[131]  Mitchell P. Marcus,et al.  A theory of syntactic recognition for natural language , 1979 .

[132]  James L. McClelland,et al.  Sentence comprehension: A parallel distributed processing approach , 1989, Language and Cognitive Processes.

[133]  Paul Gorrell Establishing the loci of serial and parallel effects in syntactic processing , 1989 .

[134]  K. Rayner,et al.  Resolution of syntactic category ambiguities: Eye movements in parsing lexically ambiguous sentences☆ , 1987 .

[135]  Erik-Jan van der Linden,et al.  Incremental Processing and the Hierarchical Lexicon , 1992, Comput. Linguistics.

[136]  Janet Dean Fodor,et al.  Natural language parsing: How can grammars help parsers? , 1985 .

[137]  J. McCawley The Comparative Conditional Construction in English, German, and Chinese , 1988 .

[138]  Graeme Hirst,et al.  Word Sense and Case Slot Disambiguation , 1982, AAAI.

[139]  Charles A. Perfetti,et al.  Lexical ambiguity and sentence comprehension , 1975 .

[140]  P. Resnik Selection and information: a class-based approach to lexical relationships , 1993 .

[141]  John Cocke,et al.  Probabilistic Parsing Method for Sentence Disambiguation , 1989, IWPT.

[142]  E. Wanner The ATN and the sausage machine: Which one is baloney? , 1980, Cognition.

[143]  Andreas Stolcke,et al.  Hidden Markov Model} Induction by Bayesian Model Merging , 1992, NIPS.