Contextual bootstrapping for grammar learning

The problem of grammar learning is a challenging one for both children and machines due to impoverished input: hidden grammatical structures, lack of explicit correction, and in pro-drop languages, argument omission. This dissertation describes a computational model of child grammar learning using a probabilistic version of Embodied Construction Grammar (ECG) that demonstrates how the problem of impoverished input is alleviated through bootstrapping from the situational context. This model represents the convergence of: (1) a unified representation that integrates semantic knowledge, linguistic knowledge, and contextual knowledge, (2) a context-aware language understanding process, and (3) a structured grammar learning and generalization process. Using situated child-directed utterances as learning input, the model performs two concurrent learning tasks: structural learning of the grammatical units and statistical learning of the associated parameters. The structural learning task is a guided search over the space of possible constructions. The search is informed by embodied semantic knowledge that it has gathered through experience with the world even before learning grammar and situational knowledge that the model obtains from context. The statistical learning task requires continuous updating of the parameters associated with the probabilistic grammar based on usage and these parameters reflect shifting preferences on learned grammatical structures. The computational model of grammar learning has been validated in two ways. It has been applied to a subset of the CHILDES Beijing corpus, which is a corpus of naturalistic parent-child interaction in Mandarin Chinese. Its learning behavior has also been more closely examined using an artificial miniature language. This learning model provides a precise, computational framework for fleshing out theories of construction formation and generalization.

[1]  Twila Tardif,et al.  But Are They Really Verbs? Chinese Words for Action. , 2006 .

[2]  Ellen M. Markman,et al.  Word Learning in Children: An Examination of Fast Mapping. , 1987 .

[3]  Susan M. Garnsey,et al.  Knowledge of Grammar, Knowledge of Usage: Syntactic Probabilities Affect Pronunciation Variation , 2004 .

[4]  G. Lakoff,et al.  Metaphors We Live By , 1980 .

[5]  Benjamin K. Bergen,et al.  Embodied Construction Grammar in Simulation-Based Language Understanding , 2003 .

[6]  R. Gómez Variability and Detection of Invariant Structure , 2002, Psychological science.

[7]  Barbara C. Scholz,et al.  Empirical assessment of stimulus poverty arguments , 2002 .

[8]  Wendan Li,et al.  Topic Chains in Chinese Discourse , 2004 .

[9]  Mark L. Johnson The body in the mind: the bodily basis of meaning , 1987 .

[10]  A. Goldberg Constructions: A Construction Grammar Approach to Argument Structure , 1995 .

[11]  Jerome A. Feldman,et al.  Modeling Embodied Lexical Development , 1997 .

[12]  Barbara Landau,et al.  Spatial language and spatial representation: a cross-linguistic comparison , 2001, Cognition.

[13]  C. Fillmore,et al.  Grammatical constructions and linguistic generalizations: The What's X doing Y? construction , 1999 .

[14]  Srinivas Bangalore,et al.  Automated extraction of Tree-Adjoining Grammars from treebanks , 2006, Nat. Lang. Eng..

[15]  M. Bowerman Learning how to structure space for language: A crosslinguistic perspective , 1996 .

[16]  L. Steels Experiments on the emergence of human communication , 2006, Trends in Cognitive Sciences.

[17]  G. Altmann,et al.  The real-time mediation of visual attention by language and world knowledge: Linking anticipatory (and other) eye movements to linguistic processing , 2007 .

[18]  Letitia R. Naigles,et al.  Mandarin learners use syntactic bootstrapping in verb acquisition , 2008, Cognition.

[19]  L. Wittgenstein,et al.  Language, thought, and reality. , 1989, The Hastings Center report.

[20]  M. Tomasello The Cultural Origins of Human Cognition , 2000 .

[21]  Luke S. Zettlemoyer,et al.  Online Learning of Relaxed CCG Grammars for Parsing to Logical Form , 2007, EMNLP.

[22]  Joan L. Bybee,et al.  From Usage to Grammar: The Mind's Response to Repetition , 2007 .

[23]  Mark Steedman,et al.  Acquiring Compact Lexicalized Grammars from a Cleaner Treebank , 2002, LREC.

[24]  Srini Narayanan,et al.  Spatial and Linguistic Aspects of Visual Imagery in Sentence Comprehension , 2007, Cogn. Sci..

[25]  Jeremy I. Skipper,et al.  Speech-associated gestures, Broca’s area, and the human mirror system , 2007, Brain and Language.

[26]  Roger K. Moore Computer Speech and Language , 1986 .

[27]  Anat Ninio,et al.  Testing the role of semantic similarity in syntactic development , 2005, Journal of Child Language.

[28]  Ping Li,et al.  The noun-verb problem in Chinese aphasia , 1991, Brain and Language.

[29]  D. Gentner,et al.  Structure mapping in analogy and similarity. , 1997 .

[30]  Dan Klein,et al.  Corpus-Based Induction of Syntactic Structure: Models of Dependency and Constituency , 2004, ACL.

[31]  T. Wilcox,et al.  Object individuation and event mapping: developmental changes in infants' use of featural information , 2002 .

[32]  G. Lakoff,et al.  The Brain's concepts: the role of the Sensory-motor system in conceptual knowledge , 2005, Cognitive neuropsychology.

[33]  John Dore,et al.  A pragmatic description of early language development , 1974 .

[34]  Jessica Maye,et al.  Infant sensitivity to distributional information can affect phonetic discrimination , 2002, Cognition.

[35]  Cheng-Teh James Huang,et al.  New horizons in Chinese linguistics , 1996 .

[36]  David M. Sobel,et al.  A theory of causal learning in children: causal maps and Bayes nets. , 2004, Psychological review.

[37]  Amy Perfors,et al.  Learnability, representation, and language: a bayesian approach , 2008 .

[38]  Eva Mok,et al.  A Structured Context Model for Grammar Learning , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.

[39]  LouAnn Gerken,et al.  Interplay of Function Morphemes and Prosody in Early Language , 1993 .

[40]  Pat Langley,et al.  Learning Context-Free Grammars with a Simplicity Bias , 2000, ECML.

[41]  Young-Joo Kim,et al.  Subject/Object Drop in the Acquisition of Korean: A Cross-Linguistic Comparison , 2000 .

[42]  G. Rizzolatti,et al.  Action recognition in the premotor cortex. , 1996, Brain : a journal of neurology.

[43]  E. Conwell,et al.  Early syntactic productivity: Evidence from dative shift , 2007, Cognition.

[44]  J. Mandler How to build a baby: II. Conceptual primitives. , 1992, Psychological review.

[45]  S. Levinson,et al.  Language Acquisition and Conceptual Development , 2001 .

[46]  Amanda C. Brandone,et al.  Action speaks louder than words: young children differentially weight perceptual, social, and linguistic cues to learn verbs. , 2007, Child development.

[47]  C. Fillmore,et al.  Regularity and Idiomaticity in Grammatical Constructions: The Case of Let Alone , 1988 .

[48]  G. Rizzolatti,et al.  Listening to action-related sentences modulates the activity of the motor system: a combined TMS and behavioral study. , 2005, Brain research. Cognitive brain research.

[49]  G. Rizzolatti,et al.  Object representation in the ventral premotor cortex (area F5) of the monkey. , 1997, Journal of neurophysiology.

[50]  Nancy Chang,et al.  A computational model of the emergence of early constructions , 2008 .

[51]  Jenny R Saffran,et al.  Words in a sea of sounds: the output of infant statistical learning , 2001, Cognition.

[52]  R N Aslin,et al.  Statistical Learning by 8-Month-Old Infants , 1996, Science.

[53]  D. Kimbrough Oller,et al.  Evolution of communication systems : a comparative approach , 2004 .

[54]  Michael Tomasello,et al.  Two-year-olds learn words for absent objects and actions. , 1996 .

[55]  Afra Alishahi,et al.  A Probabilistic Model of Early Argument Structure Acquisition , 2008 .

[56]  Elissa L. Newport,et al.  Statistical Learning of Syntax: The Role of Transitional Probability , 2007 .

[57]  Peter M. Vishton,et al.  Rule learning by seven-month-old infants. , 1999, Science.

[58]  L. Steels Evolving grounded communication for robots , 2003, Trends in Cognitive Sciences.

[59]  M. Tomasello Do young children have adult syntactic competence? , 2000, Cognition.

[60]  J. Tenenbaum,et al.  Generalization, similarity, and Bayesian inference. , 2001, The Behavioral and brain sciences.

[61]  Benjamin K. Bergen,et al.  Sentence Understanding Engages Motor Processes , 2005 .

[62]  J. Saffran,et al.  Dog is a dog is a dog: Infant rule learning is not specific to language , 2007, Cognition.

[63]  M. Tomasello,et al.  Training 2;6-year-olds to produce the transitive construction: the role of frequency, semantic similarity and shared syntactic distribution. , 2004, Developmental science.

[64]  Letitia R. Naigles,et al.  Caregiver speech and children's use of nouns versus verbs: A comparison of English, Italian, and Mandarin , 1997, Journal of Child Language.

[65]  P. Kay,et al.  Further evidence that Whorfian effects are stronger in the right visual field than the left , 2007, Proceedings of the National Academy of Sciences.

[66]  J. Tenenbaum,et al.  Poverty of the Stimulus? A Rational Approach , 2006 .

[67]  Wilfried Brauer,et al.  Spatial Cognition III , 2003, Lecture Notes in Computer Science.

[68]  Paul U. Lee,et al.  How Space Structures Language , 1998, Spatial Cognition.

[69]  George Lakoff,et al.  Women, Fire, and Dangerous Things , 1987 .

[70]  John Bryant,et al.  A Best-Fit Approach to Productive Omission of Arguments , 2006 .

[71]  Kristine H. Onishi,et al.  Infants learn phonotactic regularities from brief auditory experience , 2003, Cognition.

[72]  Charles N. Li,et al.  Mandarin Chinese: A Functional Reference Grammar , 1989 .

[73]  Srinivas Narayanan,et al.  Moving Right Along: A Computational Model of Metaphoric Reasoning about Events , 1999, AAAI/IAAI.

[74]  Letitia R. Naigles,et al.  The use of multiple frames in verb learning via syntactic bootstrapping , 1996, Cognition.

[75]  L. Gleitman The Structural Sources of Verb Meanings , 2020, Sentence First, Arguments Afterward.

[76]  G. Altmann,et al.  Discourse-mediation of the mapping between language and the visual world: Eye movements and mental representation , 2009, Cognition.

[77]  Andy Way,et al.  Treebank-Based Acquisition of a Chinese Lexical-Functional Grammar , 2004, PACLIC.

[78]  John Bryant Exploiting Statistical Information in Constructional Analysis , 2006 .

[79]  Scott P. Johnson,et al.  Visual statistical learning in infancy: evidence for a domain general learning mechanism , 2002, Cognition.

[80]  M. Raijmakers Rethinking innateness: A connectionist perspective on development. , 1997 .

[81]  John B. Lowe,et al.  The Berkeley FrameNet Project , 1998, ACL.

[82]  Ping Li,et al.  Neural representations of nouns and verbs in Chinese: an fMRI study , 2004, NeuroImage.

[83]  E. Newport,et al.  Computation of Conditional Probability Statistics by 8-Month-Old Infants , 1998 .

[84]  LouAnn Gerken,et al.  Decisions, decisions: infant language learning when multiple generalizations are possible , 2006, Cognition.

[85]  Deb Roy,et al.  Situated Language Understanding as Filtering Perceived Affordances , 2007, Cogn. Sci..

[86]  D. Slobin Crosslinguistic Evidence for the Language-making Capacity , 1985 .

[87]  Richard N Aslin,et al.  Statistical learning of new visual feature combinations by infants , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[88]  Michael C. Frank,et al.  Russian blues reveal effects of language on color discrimination , 2007, Proceedings of the National Academy of Sciences.

[89]  H. Grice Logic and conversation , 1975 .

[90]  P. Kay,et al.  Color categories in thought and language: Color naming across languages , 1997 .

[91]  Nancy Chang,et al.  Context-Driven Construction Learning , 2004 .

[92]  Michael J. Spivey,et al.  Continuous Dynamics in Real-Time Cognition , 2006 .

[93]  T. Givón Syntax : an introduction , 2001 .

[94]  Toben H. Mintz Frequent frames as a cue for grammatical categories in child directed speech , 2003, Cognition.

[95]  Deb Roy,et al.  Grounded spoken language acquisition: experiments in word learning , 2003, IEEE Trans. Multim..

[96]  Cynthia L Fisher,et al.  Structural limits on verb mapping: the role of abstract structure in 2.5‐year‐olds’ interpretations of novel verbs , 2002 .

[97]  M. Tanenhaus,et al.  Acquiring and processing verb argument structure: Distributional learning in a miniature language , 2008, Cognitive Psychology.

[98]  Luc Steels,et al.  Social and Cultural Learning in the Evolution of Human Communication , 2004 .

[99]  M. Hauser,et al.  Grammatical pattern learning by human infants and cotton-top tamarin monkeys , 2008, Cognition.

[100]  Catherine T. Best,et al.  Null Subject Versus Null Object: Some Evidence From the Acquisition of , 1992 .

[101]  P. Kay,et al.  Language, thought and color: recent developments , 2006, Trends in Cognitive Sciences.

[102]  John Bryant,et al.  Recovering coherent interpretations using semantic integration of partial parses , 2004 .

[103]  Jerome A. Feldman,et al.  From Molecule to Metaphor - A Neural Theory of Language , 2006 .

[104]  Kathy Hirsh-Pasek,et al.  Imageability predicts the age of acquisition of verbs in Chinese children* , 2008, Journal of Child Language.

[105]  Christopher D. Manning,et al.  The unsupervised learning of natural language structure , 2005 .

[106]  Sylvia Yuan,et al.  Cross-Cultural Differences in the Input to Early Word Learning , 2003 .

[107]  N. Presmeg The body in the mind: The bodily basis of meaning, imagination and reason , 1992 .

[108]  James Jay Horning,et al.  A study of grammatical inference , 1969 .

[109]  Shweta Narayan,et al.  Simulated Action in an Embodied Construction Grammar , 2004 .

[110]  Dingxu Shi Topic and topic-comment constructions in Mandarin Chinese , 2000 .

[111]  L. Markson,et al.  Evidence against a dedicated system for word learning in children , 1997, Nature.

[112]  L. Talmy Toward a Cognitive Semantics , 2003 .

[113]  B. Heine,et al.  The Oxford Handbook of Linguistic Analysis , 2009 .

[114]  P. Bloom How Children Learn the Meaning of Words and How LSA Does It ( Too ) , 2005 .

[115]  Susanne Gahl,et al.  Knowledge of Grammar Includes Knowledge of Syntactic Probabilities , 2006 .

[116]  Carla L. Hudson Kam,et al.  Regularizing Unpredictable Variation: The Roles of Adult and Child Learners in Language Formation and Change , 2005 .

[117]  N. Chater,et al.  Proceedings of the fourteenth annual conference of the cognitive science society , 1992 .

[118]  D. Slobin The Crosslinguistic Study of Language Acquisition , 1987 .

[119]  E. Mark Gold,et al.  Language Identification in the Limit , 1967, Inf. Control..

[120]  E. Newport,et al.  Learning at a distance I. Statistical learning of non-adjacent dependencies , 2004, Cognitive Psychology.

[121]  Willard Van Orman Quine,et al.  Word and Object , 1960 .

[122]  M. Tomasello,et al.  Young children's productivity with word order and verb morphology. , 1997, Developmental psychology.

[123]  J. Carroll,et al.  Language, Thought and Reality , 1957 .

[124]  B. MacWhinney A multiple process solution to the logical problem of language acquisition , 2004, Journal of Child Language.

[125]  T. Lee,et al.  Theoretical Issues in Language Development and Chinese Child Language , 1996 .

[126]  H. Gleitman,et al.  Human simulations of vocabulary learning , 1999, Cognition.

[127]  B. MacWhinney The CHILDES project: tools for analyzing talk , 1992 .

[128]  J. Tenenbaum,et al.  Word learning as Bayesian inference. , 2007, Psychological review.

[129]  P. Kay,et al.  Whorf hypothesis is supported in the right visual field but not the left. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[130]  Peter W. Jusczyk,et al.  Finding and Remembering Words , 1997 .

[131]  Jon Driver,et al.  Adult's Eyes Trigger Shifts of Visual Attention in Human Infants , 1998 .

[132]  M. Bowerman,et al.  Learning to express motion events in English and Korean: The influence of language-specific lexicalization patterns , 1991, Cognition.

[133]  L. Boroditsky Does Language Shape Thought?: Mandarin and English Speakers' Conceptions of Time , 2001, Cognitive Psychology.

[134]  Devin M. Casenhiser,et al.  Fast mapping between a phrasal form and meaning. , 2005, Developmental science.

[135]  L. Gleitman,et al.  Language and Experience: Evidence from the Blind Child , 1988 .

[136]  Adele E. Goldberg,et al.  Learning argument structure generalizations , 2004 .

[137]  Gary F. Marcus,et al.  From semantics to syntax and back again: Argument structure in the third year of life , 2006, Cognition.

[138]  T. Tardif Nouns are not always learned before verbs : Evidence from Mandarin speakers' early vocabularies , 1996 .

[139]  Michael J. Spivey,et al.  The Continuity Of Mind , 2008 .

[140]  William Croft,et al.  Radical Construction Grammar: Syntactic Theory in Typological Perspective , 2001 .

[141]  Martha Palmer,et al.  From TreeBank to PropBank , 2002, LREC.

[142]  Jerome A. Feldman,et al.  Best-fit constructional analysis , 2008 .

[143]  T. Matlock Fictive motion as cognitive simulation , 2004, Memory & cognition.

[144]  Joan L. Bybee,et al.  The effect of usage on degrees of constituency: the reduction of don't in English , 1999 .

[145]  Andy Way,et al.  Wide-Coverage Deep Statistical Parsing Using Automatic Dependency Structure Annotation , 2008, Computational Linguistics.

[146]  Toben H. Mintz Finding The Verbs: Distributional Cues to Categories Available to Young Learners , 2004 .

[147]  C. L. Hardin,et al.  Color categories in thought and language: Author index , 1997 .

[148]  Elissa L Newport,et al.  Structural packaging in the input to language learning: Contributions of prosodic and morphological marking of phrases to the acquisition of language , 1987, Cognitive Psychology.

[149]  M. Tomasello Constructing a Language: A Usage-Based Theory of Language Acquisition , 2003 .

[150]  Elissa L. Newport,et al.  The role of constituent structure in the induction of an artificial language , 1981 .

[151]  James L. McClelland,et al.  Rethinking infant knowledge: toward an adaptive process account of successes and failures in object permanence tasks. , 1997, Psychological review.

[152]  Nancy Chang,et al.  STRUCTURED CONNECTIONIST MODELS OF LANGUAGE, COGNITION AND ACTION , 2005 .

[153]  Jane B. Childers,et al.  The role of pronouns in young children's acquisition of the English transitive construction. , 2001, Developmental psychology.

[154]  M. Tanenhaus,et al.  Circumscribing Referential Domains during Real-Time Language Comprehension , 2002 .

[155]  Fei Xia,et al.  Automatically Extracting and Comparing Lexicalized Grammars for Different Languages , 2001, IJCAI.

[156]  Alexander Clark,et al.  Unsupervised Language Acquisition: Theory and Practice , 2002, ArXiv.

[157]  J. Wolff Learning Syntax and Meanings Through Optimization and Distributional Analysis , 1988 .

[158]  Erik D. Thiessen,et al.  Pattern induction by infant language learners. , 2003, Developmental psychology.