Word-level invariant representations from acoustic waveforms
暂无分享,去创建一个
Lorenzo Rosasco | Tomaso A. Poggio | Stephen Voinea | Georgios Evangelopoulos | Chiyuan Zhang | T. Poggio | Chiyuan Zhang | L. Rosasco | Georgios Evangelopoulos | S. Voinea
[1] H. Wold,et al. Some Theorems on Distribution Functions , 1936 .
[2] D. Hubel,et al. Receptive fields, binocular interaction and functional architecture in the cat's visual cortex , 1962, The Journal of physiology.
[3] H. O. Foulkes. Abstract Algebra , 1967, Nature.
[4] Chin-Hui Lee,et al. Word recognition using whole word and subword models , 1989, International Conference on Acoustics, Speech, and Signal Processing,.
[5] Werner Verhelst,et al. An overlap-add technique based on waveform similarity (WSOLA) for high quality time-scale modification of speech , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[6] James R. Glass,et al. Heterogeneous acoustic measurements for phonetic classification 1 , 1997, EUROSPEECH.
[7] Richard Lippmann,et al. Speech recognition by machines and humans , 1997, Speech Commun..
[8] James R. Glass,et al. HETEROGENEOUS ACOUSTIC MEASUREMENTS FOR PHONETIC CLASSIFICATION , 1997 .
[9] Hynek Hermansky,et al. TRAPS - classifiers of temporal patterns , 1998, ICSLP.
[10] Shrikanth S. Narayanan,et al. Acoustics of children's speech: developmental changes of temporal and spectral parameters. , 1999, The Journal of the Acoustical Society of America.
[11] T. Poggio,et al. Hierarchical models of object recognition in cortex , 1999, Nature Neuroscience.
[12] David G. Lowe,et al. Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.
[13] S. Shamma. On the role of space and time in auditory processing , 2001, Trends in Cognitive Sciences.
[14] Roger K. Moore. A comparison of the data requirements of automatic speech recognition systems and human listeners , 2003, INTERSPEECH.
[15] Bill Triggs,et al. Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).
[16] B. Kollmeier,et al. A HUMAN-MACHINE COMPARISON IN SPEECH RECOGNITION BASED ON A LOGATOME CORPUS , 2006 .
[17] Alfred Mertins,et al. Automatic speech recognition and speech variability: A review , 2007, Speech Commun..
[18] Thomas Serre,et al. Robust Object Recognition with Cortex-Like Mechanisms , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[19] Abeer Alwan,et al. Frequency warping for VTLN and speaker adaptation by linear transformation of standard MFCC , 2009, Comput. Speech Lang..
[20] Geoffrey E. Hinton,et al. Learning a better representation of speech soundwaves using restricted boltzmann machines , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[21] Stéphane Mallat,et al. Group Invariant Scattering , 2011, ArXiv.
[22] Honglak Lee,et al. An Analysis of Single-Layer Networks in Unsupervised Feature Learning , 2011, AISTATS.
[23] Richard M. Stern,et al. Hearing Is Believing: Biologically Inspired Methods for Robust Automatic Speech Recognition , 2012, IEEE Signal Processing Magazine.
[24] Andrew L. Maas,et al. Word-level Acoustic Modeling with Convolutional Vector Regression , 2012 .
[25] Aren Jansen,et al. MAP Estimation of Whole-Word Acoustic Models with Dictionary Priors , 2012, INTERSPEECH.
[26] Tara N. Sainath,et al. Deep Neural Networks for Acoustic Modeling in Speech Recognition , 2012 .
[27] Joel Z. Leibo,et al. Unsupervised Learning of Invariant Representations in Hierarchical Architectures , 2013, ArXiv.
[28] D. O'Shaughnessy. Acoustic Analysis for Automatic Speech Recognition , 2013, Proceedings of the IEEE.
[29] Lorenzo Rosasco,et al. GURLS: a least squares library for supervised learning , 2013, J. Mach. Learn. Res..
[30] Joakim Andén,et al. Deep Scattering Spectrum , 2013, IEEE Transactions on Signal Processing.
[31] Gerald Penn,et al. Convolutional Neural Networks for Speech Recognition , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.