Effects of phase on the perception of intervocalic stop consonants

Abstract Identification experiments were performed to assess the relative importance of Fourier phase versus amplitude for intervocalic stop consonant perception. In the first experiment, three types of stimuli were constructed from VCV signals: (1) Swapped stimuli, a swapped stimulus has the amplitude spectra of its consisting segments from one VCV signal and its corresponding phase spectra from another; (2) Phase-only stimuli; and (3) Amplitude-only stimuli. It was shown that the perception of intervocalic stop consonants varies from amplitude dominance to phase dominance as the Fourier analysis window size increases. The crossover lies somewhere between 192 ms and 256 ms. The influence of phase at smaller window sizes was elaborated in the second experiment. This experiment demonstrated that phonetically different signals can be constructed by combining the same short-time amplitude spectra with different phase spectra, so the short-time amplitude spectra displayed on a spectrogram cannot exclusively specify a stop consonant. In both experiments, the perception of voicing in stops was found to rely strongly on phase information while the perception of the place of articulation was mainly determined by amplitude information.

[1]  R B Gardner,et al.  Mistuning a harmonic of a vowel: grouping and phase effects on vowel quality. , 1986, The Journal of the Acoustical Society of America.

[2]  E. Zwicker,et al.  Analytical expressions for critical‐band rate and critical bandwidth as a function of frequency , 1980 .

[3]  Manfred R. Schroeder New Results Concerning Monaural Phase Sensitivity , 1959 .

[4]  Carola Schatz The Role of Context in the Perception of Stops , 1954 .

[5]  R. C. Mathes,et al.  Phase Effects in Monaural Perception , 1947 .

[6]  Gunnar Fant,et al.  Speech sounds and features , 1973 .

[7]  Ronald A. Cole,et al.  The phantom in the phoneme: Invariant cues for stop consonants , 1974 .

[8]  S. Öhman Coarticulation in VCV Utterances: Spectrographic Measurements , 1966 .

[9]  Dennis H. Klatt,et al.  Software for a cascade/parallel formant synthesizer , 1980 .

[10]  M. Hayes The reconstruction of a multidimensional sequence from the phase or magnitude of its Fourier transform , 1982 .

[11]  M R Schroeder,et al.  Flat-spectrum speech. , 1986, The Journal of the Acoustical Society of America.

[12]  Hermann von Helmholtz,et al.  On the Sensations of Tone , 1954 .

[13]  M. E. H. Schouten,et al.  The psychophysics of speech perception , 1987 .

[14]  J. L. Goldstein Auditory spectral filtering and monaural phase perception. , 1967, The Journal of the Acoustical Society of America.

[15]  J M Festen,et al.  Phase effects in a three-component signal. , 1974, The Journal of the Acoustical Society of America.

[16]  H Winitz,et al.  Identification of stops and vowels for the burst portion of (p,t,k) isolated from conversational speech. , 1972, The Journal of the Acoustical Society of America.

[17]  A. Oppenheim,et al.  Signal reconstruction from phase or magnitude , 1980 .

[18]  M.G. Bellanger,et al.  Digital processing of speech signals , 1980, Proceedings of the IEEE.

[19]  J. Reeds,et al.  Identification of Stops and Vowels for the Burst Portion of /p, t, k/ Isolated from Conversational Speech , 1972 .

[20]  A.V. Oppenheim,et al.  The importance of phase in signals , 1980, Proceedings of the IEEE.

[21]  F. A. Firestone,et al.  The Influence of Phase on Tone Quality and Loudness; the Interference of Subjective Harmonics , 1934 .

[22]  A. C. Smit,et al.  The importance of phase in the perception of intervocalic stop consonants , 1992 .

[23]  Jae S. Lim,et al.  Signal Reconstruction from Phas , 1980 .

[24]  Douglas D. O'Shaughnessy,et al.  Speech communication : human and machine , 1987 .

[25]  R. Patterson,et al.  A pulse ribbon model of monaural phase perception. , 1987, The Journal of the Acoustical Society of America.

[26]  B H Repp,et al.  Perception of intervocalic stop consonants: the contributions of closure duration and formant transitions. , 1983, The Journal of the Acoustical Society of America.

[27]  J. C. R. Licklider Effects of Changes in the Phase Pattern upon the Sound of a 16‐Harmonic Tone , 1957 .

[28]  B. McGonigle,et al.  The Role of Context in the Perception of Orientation by the Rat , 1980, Perception.