Do audio-visual motion cues promote segregation of auditory streams?

An audio-visual experiment using moving sound sources was designed to investigate whether the analysis of auditory scenes is modulated by synchronous presentation of visual information. Listeners were presented with an alternating sequence of two pure tones delivered by two separate sound sources. In different conditions, the two sound sources were either stationary or moving on random trajectories around the listener. Both the sounds and the movement trajectories were derived from recordings in which two humans were moving with loudspeakers attached to their heads. Visualized movement trajectories modeled by a computer animation were presented together with the sounds. In the main experiment, behavioral reports on sound organization were collected from young healthy volunteers. The proportion and stability of the different sound organizations were compared between the conditions in which the visualized trajectories matched the movement of the sound sources and when the two were independent of each other. The results corroborate earlier findings that separation of sound sources in space promotes segregation. However, no additional effect of auditory movement per se on the perceptual organization of sounds was obtained. Surprisingly, the presentation of movement-congruent visual cues did not strengthen the effects of spatial separation on segregating auditory streams. Our findings are consistent with the view that bistability in the auditory modality can occur independently from other modalities.

[1]  Cristy Ho,et al.  Multisensory In-Car Warning Signals for Collision Avoidance , 2007, Hum. Factors.

[2]  E. Van der Burg,et al.  Audiovisual events capture attention: evidence from temporal order judgments. , 2008, Journal of vision.

[3]  R. Carlyon,et al.  Effects of location, frequency region, and time course of selective attention on auditory scene analysis. , 2004, Journal of experimental psychology. Human perception and performance.

[4]  Makio Kashino,et al.  Involvement of the Thalamocortical Loop in the Spontaneous Switching of Percepts in Auditory Streaming , 2009, The Journal of Neuroscience.

[5]  Brian C J Moore,et al.  Multistability in perception: binding sensory modalities, an overview , 2012, Philosophical Transactions of the Royal Society B: Biological Sciences.

[6]  L. V. Noorden Temporal coherence in the perception of tone sequences , 1975 .

[7]  Nao Ninomiya,et al.  The 10th anniversary of journal of visualization , 2007, J. Vis..

[8]  B. Moore,et al.  Primitive stream segregation of tone sequences without differences in fundamental frequency or passband. , 2002, The Journal of the Acoustical Society of America.

[9]  E. Lopez-Poveda,et al.  The neurophysiological bases of auditory perception , 2010 .

[10]  Nava Rubin,et al.  Alternation rate in perceptual bistability is maximal at and symmetric around equi-dominance. , 2010, Journal of vision.

[11]  P F Seitz,et al.  The use of visible speech cues for improving auditory detection of spoken sentences. , 2000, The Journal of the Acoustical Society of America.

[12]  P Bertelson,et al.  Directing spatial attention towards the illusory location of a ventriloquized sound. , 2001, Acta psychologica.

[13]  D. C. Higgins Human Spatial Orientation , 1967, The Yale Journal of Biology and Medicine.

[14]  Naoko Shinozaki,et al.  Spectrotemporal window of integration of auditory information in the human brain. , 2003, Brain research. Cognitive brain research.

[15]  R. Blake,et al.  Neural bases of binocular rivalry , 2006, Trends in Cognitive Sciences.

[16]  Torsten Rahne,et al.  Visual cues can modulate integration and segregation of objects in auditory scene analysis , 2007, Brain Research.

[17]  J. Hupé,et al.  Temporal Dynamics of Auditory and Visual Bistability Reveal Common Principles of Perceptual Organization , 2006, Current Biology.

[18]  Alan Kingstone,et al.  Cross-modal dynamic capture: congruency effects in the perception of motion across sensory modalities. , 2004, Journal of experimental psychology. Human perception and performance.

[19]  H. McGurk,et al.  Hearing lips and seeing voices , 1976, Nature.

[20]  Alexandra Bendixen,et al.  Stable individual characteristics in the perception of multiple embedded patterns in multistable auditory stimuli , 2014, Front. Neurosci..

[21]  Susan Denham,et al.  Multistability in auditory stream segregation: a predictive coding view , 2012, Philosophical Transactions of the Royal Society B: Biological Sciences.

[22]  N. Logothetis,et al.  Visual competition , 2002, Nature Reviews Neuroscience.

[23]  C. Spence,et al.  Multisensory contributions to the perception of motion , 2003, Neuropsychologia.

[24]  Mark T. Wallace,et al.  Crossmodal spatial interactions in subcortical and cortical circuits , 2004 .

[25]  C. Spence,et al.  Attracting attention to the illusory location of a sound: reflexive crossmodal orienting and ventriloquism , 2000, Neuroreport.

[26]  Aleksander Väljamäe,et al.  Multisensory Interactions during Motion Perception , 2012 .

[27]  Albert Postma,et al.  Multisensory integration affects ERP components elicited by exogenous cues , 2008, Experimental Brain Research.

[28]  I. Winkler,et al.  Perceptual bistability in auditory streaming: How much do stimulus features matter? , 2013 .

[29]  K. Fujii,et al.  Visualization for the analysis of fluid motion , 2005, J. Vis..

[30]  I. Winkler,et al.  Modulation-frequency acts as a primary cue for auditory stream segregation , 2013 .

[31]  M. Scherg,et al.  Neuromagnetic Correlates of Streaming in Human Auditory Cortex , 2005, The Journal of Neuroscience.

[32]  M. Posner,et al.  Orienting of Attention* , 1980, The Quarterly journal of experimental psychology.

[33]  Susan L. Denham,et al.  Stability of Perceptual Organisation in Auditory Streaming , 2010 .

[34]  A. Postma,et al.  Spatial attention triggered by unimodal, crossmodal, and bimodal exogenous cues: a comparison of reflexive orienting mechanisms , 2006, Experimental Brain Research.

[35]  Laura A Cook,et al.  Audio-Visual Organisation and the Temporal Ventriloquism Effect between Grouped Sequences: Evidence That Unimodal Grouping Precedes Cross-Modal Integration , 2009, Perception.

[36]  A. Oxenham,et al.  Sequential stream segregation in the absence of spectral cues. , 1999, The Journal of the Acoustical Society of America.

[37]  A. Andreou,et al.  The role of perceived source location in auditory stream segregation: Separation affects sound organization, common fate does not , 2013 .

[38]  I. Nelken,et al.  Modeling the auditory scene: predictive regularity representations and perceptual objects , 2009, Trends in Cognitive Sciences.

[39]  W. H. Sumby,et al.  Visual contribution to speech intelligibility in noise , 1954 .

[40]  M. Kleiner,et al.  Audiovisual interactions in binocular rivalry. , 2010, Journal of vision.

[41]  A. Fort,et al.  Is the auditory sensory memory sensitive to visual information? , 2005, Experimental Brain Research.

[42]  M. Giard,et al.  Auditory-Visual Integration during Multimodal Object Recognition in Humans: A Behavioral and Electrophysiological Study , 1999, Journal of Cognitive Neuroscience.

[43]  L. M. Ward,et al.  Supramodal and modality-specific mechanisms for stimulus-driven shifts of auditory and visual attention. , 1994, Canadian journal of experimental psychology = Revue canadienne de psychologie experimentale.

[44]  I. Winkler,et al.  The role of predictive models in the formation of auditory streams , 2006, Journal of Physiology-Paris.

[45]  K. G. Munhall,et al.  Audiovisual Integration of Speech in a Bistable Illusion , 2009, Current Biology.

[46]  Christopher W. Bishop,et al.  Auditory grouping mechanisms reflect a sound's relative position in a sequence , 2012, Front. Hum. Neurosci..

[47]  Charles Spence,et al.  Multisensory warning signals: when spatial correspondence matters , 2009, Experimental Brain Research.

[48]  C. Spence Crossmodal spatial attention , 2010, Annals of the New York Academy of Sciences.

[49]  Alexandra Bendixen,et al.  The effects of rhythm and melody on auditory stream segregation. , 2014, The Journal of the Acoustical Society of America.

[50]  David Alais,et al.  Multisensory Congruency as a Mechanism for Attentional Control over Perceptual Selection , 2009, The Journal of Neuroscience.

[51]  I. Winkler,et al.  Organizing sound sequences in the human brain: the interplay of auditory streaming and temporal integration 1 1 Published on the World Wide Web on 27 February 2001. , 2001, Brain Research.

[52]  C. Spence,et al.  Exogenous spatial cuing studies of human crossmodal attention and multisensory integration , 2004 .

[53]  J. Hupé,et al.  Bistability for audiovisual stimuli: Perceptual decision is modality specific. , 2008, Journal of vision.

[54]  J. Culling,et al.  Perceptual separation of concurrent speech sounds: absence of across-frequency grouping by common interaural delay. , 1995, The Journal of the Acoustical Society of America.

[55]  N. Logothetis,et al.  Multisensory Influences on Auditory Processing , 2012 .

[56]  T. Rahne,et al.  Visual cues release the temporal coherence of auditory objects in auditory scene analysis , 2009, Brain Research.

[57]  J. Pernier,et al.  Dynamics of cortico-subcortical cross-modal operations involved in audio-visual object detection in humans. , 2002, Cerebral cortex.

[58]  J. Mattingley,et al.  Effects of audio–visual integration on the detection of masked speech and non-speech sounds , 2011, Brain and Cognition.

[59]  C. Micheyl,et al.  Auditory stream segregation on the basis of amplitude-modulation rate. , 2002, The Journal of the Acoustical Society of America.

[60]  G. Recanzone Interactions of auditory and visual stimuli in space and time , 2009, Hearing Research.

[61]  J. Snyder,et al.  Toward a neurophysiological theory of auditory stream segregation. , 2007, Psychological bulletin.

[62]  Alexandra Bendixen,et al.  Regular patterns stabilize auditory streams. , 2010, The Journal of the Acoustical Society of America.

[63]  J. Theeuwes,et al.  Attention and the multiple stages of multisensory integration: A review of audiovisual studies. , 2010, Acta psychologica.

[64]  Charles Spence,et al.  Capturing spatial attention with multisensory cues: A review , 2009, Hearing Research.