Finding Structure in Time

Time underlies many interesting human behaviors. Thus, the question of how to represent time in connectionist models is very important. One approach is to represent time implicitly by its effects on processing rather than explicitly (as in a spatial representation). The current report develops a proposal along these lines first described by Jordan (1986) which involves the use of recurrent links in order to provide networks with a dynamic memory. In this approach, hidden unit patterns are fed back to themselves; the internal representations which develop thus reflect task demands in the context of prior internal states. A set of simulations is reported which range from relatively simple problems (temporal version of XOR) to discovering syntactic/semantic features for words. The networks are able to learn interesting internal representations which incorporate task demands with memory demands; indeed, in this approach the notion of memory is inextricably bound up with task processing. These representations reveal a rich structure, which allows them to be highly context-dependent while also expressing generalizations across classes of items. These representations suggest a method for representing lexical categories and the type/token distinction.

[1]  K. Lashley The problem of serial order in behavior , 1951 .

[2]  L. A. Jeffress,et al.  Cerebral Mechanisms in Behavior , 1953 .

[3]  Winfred P. Lehmann,et al.  Historical Linguistics: An Introduction , 1962 .

[4]  Noam Chomsky,et al.  वाक्यविन्यास का सैद्धान्तिक पक्ष = Aspects of the theory of syntax , 1965 .

[5]  S. Potter,et al.  Universals of Language , 1966 .

[6]  W. Stolz Universals of Language. , 1968 .

[7]  P. MacNeilage Motor control of serial ordering of speech. , 1970, Psychological review.

[8]  Janet D. Fodor,et al.  The sausage machine: A new two-stage parsing model , 1978, Cognition.

[9]  B. MacWhinney The Acquisition Of Morphophonology , 1978 .

[10]  Mitchell P. Marcus,et al.  A theory of syntactic recognition for natural language , 1979 .

[11]  D. Swinney Lexical access during sentence comprehension: (Re)consideration of context effects , 1979 .

[12]  F Grosjean,et al.  Spoken word recognition processes and the gating paradigm , 1980, Perception & psychophysics.

[13]  Carol A. Fowler,et al.  Coarticulation and theories of extrinsic timing , 1980 .

[14]  W. Marslen-Wilson,et al.  The temporal structure of spoken language understanding , 1980, Cognition.

[15]  Steven Pinker,et al.  Language learnability and language development , 1985 .

[16]  E. Shoben,et al.  The influence of sentence constraint on the scope of facilitation for upcoming words. , 1985 .

[17]  A. Salasoo,et al.  Interaction of Knowledge Sources in Spoken Word Identification. , 1985, Journal of memory and language.

[18]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[19]  Elliot Saltzman,et al.  The dynamical perspectives on speech production: Data and theory , 1986 .

[20]  Robert C. Berwick,et al.  The Grammatical Basis of Linguistic Performance: Language Use and Acquisition , 1986 .

[21]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[22]  Tad Hogg,et al.  A Dynamical Approach to Temporal Pattern Processing , 1987, NIPS.

[23]  Terrence J. Sejnowski,et al.  Parallel Networks that Learn to Pronounce , 1987 .

[24]  P. Tabossi,et al.  Accessing lexical ambiguity: Effects of context and dominance , 1987 .

[25]  P. Smolensky On variable binding and the representation of symbolic structures in connectionist systems , 1987 .

[26]  Elliot Saltzman,et al.  Skilled actions: a task-dynamic approach. , 1987, Psychological review.

[27]  Fernando J. Pineda,et al.  Generalization of Back propagation to Recurrent and Higher Order Neural Networks , 1987, NIPS.

[28]  J. Kelso,et al.  Skilled actions: a task-dynamic approach. , 1987, Psychological review.

[29]  Terrence J. Sejnowski,et al.  Parallel Networks that Learn to Pronounce English Text , 1987, Complex Syst..

[30]  J J Hopfield,et al.  Neural computation by concentrating information in time. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[31]  Lokendra Shastri,et al.  Learning Phonetic Features Using Connectionist Networks , 1987, IJCAI.

[32]  J. Fodor,et al.  Connectionism and cognitive architecture: A critical analysis , 1988, Cognition.

[33]  P. Tabossi Effects of context on the immediate interpretation of unambiguous nouns. , 1988 .

[34]  P. Smolensky On the proper treatment of connectionism , 1988, Behavioral and Brain Sciences.

[35]  D Zipser,et al.  Learning the hidden structure of speech. , 1988, The Journal of the Acoustical Society of America.

[36]  James L. McClelland,et al.  Learning Subsequential Structure in Simple Recurrent Networks , 1988, NIPS.

[37]  Hubert L. Dreyfus,et al.  On the proper treatment of Smolensky , 1988, Behavioral and Brain Sciences.

[38]  Garrison W. Cottrell,et al.  Image compression by back-propagation: An example of extensional programming , 1988 .

[39]  Geoffrey E. Hinton,et al.  Phoneme recognition using time-delay neural networks , 1989, IEEE Trans. Acoust. Speech Signal Process..

[40]  Ronald J. Williams,et al.  A Learning Algorithm for Continually Running Fully Recurrent Neural Networks , 1989, Neural Computation.

[41]  Geoffrey E. Hinton,et al.  Distributed Representations , 1986, The Philosophy of Artificial Intelligence.

[42]  Lyle Campbell,et al.  Historical Linguistics: An Introduction , 1991 .