Year |
Citation |
Score |
2022 |
Kayser H, Hermansky H, Meyer BT. Spatial speech detection for binaural hearing aids using deep phoneme classifiers. Acta Acustica. European Acoustics Association. 6. PMID 36159631 DOI: 10.1051/aacus/2022013 |
0.351 |
|
2020 |
Li R, Wang X, Mallidi SH, Watanabe S, Hori T, Hermansky H. Multi-Stream End-to-End Speech Recognition Ieee/Acm Transactions On Audio, Speech, and Language Processing. 28: 646-655. DOI: 10.1109/TASLP.2019.2959721 |
0.381 |
|
2019 |
Mahajan NR, Mesgarani N, Hermansky H. General properties of auditory spectro-temporal receptive fields. The Journal of the Acoustical Society of America. 146: EL459. PMID 31893764 DOI: 10.1121/1.5135021 |
0.65 |
|
2019 |
Hermansky H. Coding and decoding of messages in human speech communication: Implications for machine recognition of speech Speech Communication. 106: 112-117. DOI: 10.1016/J.SPECOM.2018.12.004 |
0.503 |
|
2019 |
Castro Martinez AM, Gerlach L, Payá-Vayá G, Hermansky H, Ooster J, Meyer BT. DNN-based performance measures for predicting error rates in automatic speech recognition and optimizing hearing aid parameters Speech Communication. 106: 44-56. DOI: 10.1016/j.specom.2018.11.006 |
0.45 |
|
2016 |
Hsiao R, Ma J, Hartmann W, Karafiát M, Grézl F, Burget L, Szöke I, Černocky JH, Watanabe S, Chen Z, Mallidi SH, Hermansky H, Tsakalidis S, Schwartz R. Robust speech recognition in unknown reverberant and noisy conditions 2015 Ieee Workshop On Automatic Speech Recognition and Understanding, Asru 2015 - Proceedings. 533-538. DOI: 10.1109/ASRU.2015.7404841 |
0.516 |
|
2014 |
Ganapathy S, Mallidi SH, Hermansky H. Robust feature extraction using modulation filtering of autoregressive models Ieee Transactions On Audio, Speech and Language Processing. 22: 1285-1295. DOI: 10.1109/Taslp.2014.2329190 |
0.675 |
|
2014 |
Mahajan N, Mesgarani N, Hermansky H. Principal components of auditory spectro-temporal receptive fields Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1983-1987. |
0.571 |
|
2014 |
Li F, Nidadavolu PS, Hermansky H. A long, deep and wide artificial neural net for robust speech recognition in unknown noise Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 358-362. |
0.322 |
|
2013 |
Garimella S, Hermansky H. Factor analysis of auto-associative neural networks with application in speaker verification. Ieee Transactions On Neural Networks and Learning Systems. 24: 522-8. PMID 24808374 DOI: 10.1109/Tnnls.2012.2236652 |
0.713 |
|
2013 |
Hermansky H. Multistream recognition of speech: Dealing with unknown unknowns Proceedings of the Ieee. 101: 1076-1088. DOI: 10.1109/JPROC.2012.2236871 |
0.4 |
|
2013 |
Jansen A, Dupoux E, Goldwater S, Johnson M, Khudanpur S, Church K, Feldman N, Hermansky H, Metze F, Rose R, Seltzer M, Clark P, McGraw I, Varadarajan B, Bennett E, et al. A summary of the 2012 JHU CLSP workshop on zero resource speech technologies and models of early language acquisition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 8111-8115. DOI: 10.1109/ICASSP.2013.6639245 |
0.502 |
|
2013 |
Jansen A, Thomas S, Hermansky H. Weak top-down constraints for unsupervised acoustic model training Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 8091-8095. DOI: 10.1109/ICASSP.2013.6639241 |
0.456 |
|
2013 |
Plchot O, Matsoukas S, Matejka P, Dehak N, Ma J, Cumani S, Glembek O, Hermansky H, Mallidi SH, Mesgarani N, Schwartz R, Soufifar M, Tan ZH, Thomas S, Zhang B, et al. Developing a speaker identification system for the DARPA RATS project Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 6768-6772. DOI: 10.1109/ICASSP.2013.6638972 |
0.605 |
|
2013 |
Thomas S, Seltzer ML, Church K, Hermansky H. Deep neural network features and semi-supervised training for low resource speech recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 6704-6708. DOI: 10.1109/ICASSP.2013.6638959 |
0.539 |
|
2013 |
Kintzley K, Jansen A, Hermansky H. Text-to-speech inspired duration modeling for improved whole-word acoustic models Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1253-1257. |
0.328 |
|
2013 |
Variani E, Li F, Hermansky H. Multi-stream recognition of noisy speech with performance monitoring Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2978-2981. |
0.37 |
|
2013 |
Mallidi SH, Ganapathy S, Hermansky H. Robust speaker recognition using spectro-temporal autoregressive models Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 3689-3693. |
0.327 |
|
2012 |
Ganapathy S, Hermansky H. Temporal resolution analysis in frequency domain linear prediction. The Journal of the Acoustical Society of America. 132: EL436-42. PMID 23145707 DOI: 10.1121/1.4758826 |
0.661 |
|
2012 |
Sivaram GSVS, Hermansky H. Sparse multilayer perceptron for phoneme recognition Ieee Transactions On Audio, Speech and Language Processing. 20: 23-29. DOI: 10.1109/TASL.2011.2129510 |
0.412 |
|
2012 |
Garimella S, Mallidi SH, Hermansky H. Regularized auto-associative neural networks for speaker verification Ieee Signal Processing Letters. 19: 841-844. DOI: 10.1109/Lsp.2012.2221706 |
0.725 |
|
2012 |
Thomas S, Ganapathy S, Hermansky H. Multilingual MLP features for low-resource LVCSR systems Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4269-4272. DOI: 10.1109/ICASSP.2012.6288862 |
0.624 |
|
2012 |
Garcia-Romero D, Zhou X, Zotkin D, Srinivasan B, Luo Y, Ganapathy S, Thomas S, Nemala S, Sivaram GSVS, Mirbagheri M, Mallidi SH, Janu T, Rajan P, Mesgarani N, Elhilali M, ... Hermansky H, et al. The UMD-JHU 2011 speaker recognition system Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4229-4232. DOI: 10.1109/ICASSP.2012.6288852 |
0.769 |
|
2012 |
Ikbal S, Misra H, Hermansky H, Magimai-Doss M. Phase AutoCorrelation (PAC) features for noise robust speech recognition Speech Communication. 54: 867-880. DOI: 10.1016/j.specom.2012.02.005 |
0.471 |
|
2012 |
Thomas S, Mallidi SH, Janu T, Hermansky H, Mesgarani N, Zhou X, Shamma S, Ng T, Zhang B, Nguyen L, Matsoukas S. Acoustic and data-driven features for robust speech activity detection 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 3: 1983-1986. |
0.736 |
|
2012 |
Jansen A, Thomas S, Hermansky H. Intrinsic spectral analysis for zero and high resource speech recognition 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 1: 878-881. |
0.36 |
|
2012 |
Thomas S, Ganapathy S, Jansen A, Hermansky H. Data-driven posterior features for low resource speech recognition applications 13th Annual Conference of the International Speech Communication Association 2012, Interspeech 2012. 1: 790-793. |
0.375 |
|
2011 |
Mesgarani N, Thomas S, Hermansky H. Toward optimizing stream fusion in multistream recognition of speech. The Journal of the Acoustical Society of America. 130: EL14-8. PMID 21786862 DOI: 10.1121/1.3595744 |
0.743 |
|
2011 |
Hermansky H. Dealing with unknown unknowns in speech The Journal of the Acoustical Society of America. 130: 2408-2408. DOI: 10.1121/1.3654655 |
0.428 |
|
2011 |
Pinto J, Garimella S, Magimai-Doss M, Hermansky H, Bourlard H. Analysis of MLP-based hierarchical phoneme posterior probability estimator Ieee Transactions On Audio, Speech and Language Processing. 19: 225-241. DOI: 10.1109/Tasl.2010.2045943 |
0.395 |
|
2011 |
Thomas S, Nguyen P, Zweig G, Hermansky H. MLP based phoneme detectors for automatic speech recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 5024-5027. DOI: 10.1109/ICASSP.2011.5947485 |
0.598 |
|
2011 |
Ganapathy S, Rajan P, Hermansky H. Multi-layer perceptron based speech activity detection for speaker verification Ieee Workshop On Applications of Signal Processing to Audio and Acoustics. 321-324. DOI: 10.1109/ASPAA.2011.6082323 |
0.569 |
|
2011 |
Hermansky H. Speech recognition from spectral dynamics Sadhana - Academy Proceedings in Engineering Sciences. 36: 729-744. DOI: 10.1007/s12046-011-0044-2 |
0.518 |
|
2011 |
Hermansky H. Dealing with unexpected words in automatic recognition of speech Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 6836: 1-15. DOI: 10.1007/978-3-642-23538-2_1 |
0.372 |
|
2011 |
Mallidi SH, Ganapathy S, Hermansky H. Modulation spectrum analysis for recognition of reverberant speech Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 189-192. |
0.412 |
|
2011 |
Mesgarani N, Thomas S, Hermansky H. Adaptive stream fusion in multistream recognition of speech Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2329-2332. |
0.671 |
|
2010 |
Ganapathy S, Thomas S, Hermansky H. Temporal envelope compensation for robust phoneme recognition using modulation spectrum. The Journal of the Acoustical Society of America. 128: 3769-80. PMID 21218908 DOI: 10.1121/1.3504658 |
0.754 |
|
2010 |
Motlicek P, Ganapathy S, Hermansky H, Garudadri H. Wide-band audio coding based on frequency-domain linear prediction Eurasip Journal On Audio, Speech, and Music Processing. 2010. DOI: 10.1155/2010/856280 |
0.656 |
|
2010 |
Motlicek P, Ganapathy S, Hermansky H, Garudadri H. Wide-Band Audio Coding Based on Frequency-Domain Linear Prediction Eurasip Journal On Audio, Speech, and Music Processing. 2010: 1-14. DOI: 10.1155/2010/856280 |
0.532 |
|
2010 |
Hermansky H. Posterior‐based attributes in machine recognition of speech. The Journal of the Acoustical Society of America. 127: 2041-2041. DOI: 10.1121/1.3385373 |
0.483 |
|
2010 |
Ganapathy S, Motlicek P, Hermansky H. Autoregressive models of amplitude modulations in audio compression Ieee Transactions On Audio, Speech and Language Processing. 18: 1624-1631. DOI: 10.1109/Tasl.2009.2038813 |
0.628 |
|
2010 |
Sivaram GSVS, Nemala SK, Mesgarani N, Hermansky H. Data-driven and feedback based spectro-temporal features for speech recognition Ieee Signal Processing Letters. 17: 957-960. DOI: 10.1109/Lsp.2010.2079930 |
0.691 |
|
2010 |
Liu SC, Mesgarani N, Harris J, Hermansky H. The use of spike-based representations for hardware audition systems Iscas 2010 - 2010 Ieee International Symposium On Circuits and Systems: Nano-Bio Circuit Fabrics and Systems. 505-508. DOI: 10.1109/ISCAS.2010.5537588 |
0.558 |
|
2010 |
Ganapathy S, Thomas S, Hermansky H. Robust spectro-temporal features based on autoregressive models of Hilbert envelopes Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4286-4289. DOI: 10.1109/ICASSP.2010.5495668 |
0.681 |
|
2010 |
Sivaram GSVS, Nemala SK, Elhilali M, Tran TD, Hermansky H. Sparse coding for speech recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4346-4349. DOI: 10.1109/ICASSP.2010.5495649 |
0.332 |
|
2010 |
Ganapathy S, Thomas S, Hermansky H. Comparison of modulation features for phoneme recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 5038-5041. DOI: 10.1109/ICASSP.2010.5495057 |
0.699 |
|
2010 |
Thomas S, Patil K, Ganapathy S, Mesgarani N, Hermansky H. A phoneme recognition framework based on auditory spectro-temporal receptive fields Proceedings of the 11th Annual Conference of the International Speech Communication Association, Interspeech 2010. 2458-2461. |
0.628 |
|
2010 |
Mesgarani N, Thomas S, Hermansky H. A multistream multiresolution framework for phoneme recognition Proceedings of the 11th Annual Conference of the International Speech Communication Association, Interspeech 2010. 318-321. |
0.645 |
|
2009 |
Ganapathy S, Thomas S, Hermansky H. Modulation frequency features for phoneme recognition in noisy speech. The Journal of the Acoustical Society of America. 125: EL8-12. PMID 19173383 DOI: 10.1121/1.3040022 |
0.762 |
|
2009 |
Hermansky H. Nonlinear mapping for feature extraction in automatic speech recognition The Journal of the Acoustical Society of America. 125: 4109. DOI: 10.1121/1.3155499 |
0.449 |
|
2009 |
Thomas S, Ganapathy S, Hermansky H. Phoneme recognition using spectral envelope and modulation frequency features Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4453-4456. DOI: 10.1109/ICASSP.2009.4960618 |
0.695 |
|
2009 |
Ganapathy S, Thomas S, Hermansky H. Temporal envelope subtraction for robust speech recognition using modulation spectrum Proceedings of the 2009 Ieee Workshop On Automatic Speech Recognition and Understanding, Asru 2009. 164-169. DOI: 10.1109/ASRU.2009.5372922 |
0.731 |
|
2009 |
Ganapathy S, Thomas S, Motlicek P, Hermansky H. Applications of signal analysis using autoregressive models for amplitude modulation Ieee Workshop On Applications of Signal Processing to Audio and Acoustics. 341-344. DOI: 10.1109/ASPAA.2009.5346495 |
0.621 |
|
2009 |
Ganapathy S, Motlicek P, Hermansky H. Error resilient speech coding using sub-band hilbert envelopes Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5729: 355-362. DOI: 10.1007/978-3-642-04208-9_49 |
0.541 |
|
2009 |
Thomas S, Ganapathy S, Hermansky H. Tandem representations of spectral envelope and modulation frequency features for ASR Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2955-2958. |
0.305 |
|
2009 |
Mesgarani N, Sivaram GSVS, Nemala SK, Elhilali M, Hermansky H. Discriminant spectrotemporal features for phoneme recognition Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2983-2986. |
0.662 |
|
2009 |
Ganapathy S, Thomas S, Hermansky H. Static and dynamic modulation spectrum for speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2823-2826. |
0.329 |
|
2009 |
Kombrink S, Burget L, Matějka P, Karafiát M, Hermansky H. Posterior-based out of vocabulary word detection in telephone speech Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 80-83. |
0.317 |
|
2008 |
Thomas S, Ganapathy S, Hermansky H. Recognition of reverberant speech using frequency domain linear prediction Ieee Signal Processing Letters. 15: 681-684. DOI: 10.1109/Lsp.2008.2002708 |
0.76 |
|
2008 |
Ganapathy S, Motlicek P, Hermansky H, Garudadri H. Temporal masking for bit-rate reduction in audio codec based on Frequency Domain Linear Prediction Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4781-4784. DOI: 10.1109/ICASSP.2008.4518726 |
0.538 |
|
2008 |
Krishnan Parthasarathi SH, Motlíček P, Hermansky H. Exploiting contextual information for speech/non-speech detection Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5246: 451-459. DOI: 10.1007/978-3-540-87391-4_58 |
0.316 |
|
2008 |
Motlíček P, Ganapathy S, Hermansky H, Garudadri H, Athineos M. Perceptually motivated sub-band decomposition for FDLP audio coding Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5246: 435-442. DOI: 10.1007/978-3-540-87391-4_56 |
0.499 |
|
2008 |
Thomas S, Ganapathy S, Hermansky H. Hilbert envelope based features for far-field speech recognition Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 5237: 119-124. DOI: 10.1007/978-3-540-85853-9-11 |
0.394 |
|
2008 |
Motlicek P, Ganapathy S, Hermansky H, Garudadri H. Frequency domain linear prediction for QMF sub-bands and applications to audio coding Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 4892: 248-258. DOI: 10.1007/978-3-540-78155-4_22 |
0.525 |
|
2008 |
Thomas S, Ganapathy S, Hermansky H. Spectro-temporal features for automatic speech recognition using linear prediction in spectral domain European Signal Processing Conference. |
0.394 |
|
2008 |
Ganapathy S, Thomas S, Hermansky H. Front-end for far-field speech recognition based on frequency domain linear prediction Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 984-987. |
0.31 |
|
2008 |
Ganapathy S, Motlicek P, Hermansky H, Garudadri H. Spectral noise shaping: Improvements in speech/audio codec based on linear prediction in spectral domain Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 675-678. |
0.304 |
|
2008 |
Sivaram GSVS, Hermansky H. Introducing temporal asymmetries in feature extraction for automatic speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 890-893. |
0.411 |
|
2008 |
Thomas S, Ganapathy S, Hermansky H. Hilbert envelope based spectro-temporal features for phoneme recognition in telephone speech Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1521-1524. |
0.458 |
|
2007 |
Prasanna SRM, Hermansky H. MRASTA and PLP in automatic speech recognition Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1: 137-140. |
0.416 |
|
2005 |
Morgan N, Zhu Q, Stolcke A, Sönmez K, Sivadas S, Shinozaki T, Ostendorf M, Jain P, Hermansky H, Ellis D, Doddington G, Chen B, Çetin O, Bourlard H, Athineos M. Pushing the envelope - Aside Ieee Signal Processing Magazine. 22: 81-88. DOI: 10.1109/Msp.2005.1511826 |
0.505 |
|
2004 |
Ikbal S, Misra H, Bourlard H, Hermansky H. Phase AutoCorrelation (PAC) features in entropy based multi-stream for robust speech recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: I205-I208. |
0.377 |
|
2003 |
Hermansky H. Recognition of information‐bearing elements in speech The Journal of the Acoustical Society of America. 114: 2424-2424. DOI: 10.1121/1.4778809 |
0.496 |
|
2003 |
Hermansky H. TRAP-TANDEM: Data-driven extraction of temporal features from speech 2003 Ieee Workshop On Automatic Speech Recognition and Understanding, Asru 2003. 255-260. DOI: 10.1109/ASRU.2003.1318450 |
0.343 |
|
2003 |
Malayath N, Hermansky H. Data-driven spectral basis functions for automatic speech recognition Speech Communication. 40: 449-466. DOI: 10.1016/S0167-6393(02)00127-9 |
0.515 |
|
2000 |
Hermansky H. Method and system for generating an estimated clean speech signal from a noisy speech signal The Journal of the Acoustical Society of America. 107: 1816. DOI: 10.1121/1.428550 |
0.415 |
|
2000 |
Yang HH, Van Vuuren S, Sharma S, Hermansky H. Relevance of time-frequency features for phonetic and speaker-channel classification Speech Communication. 31: 35-50. DOI: 10.1016/S0167-6393(00)00007-8 |
0.406 |
|
2000 |
Malayath N, Hermansky H, Kajarekar S, Yegnanarayana B. Data-driven temporal filters and alternatives to GMM in speaker verification Digital Signal Processing: a Review Journal. 10: 55-74. DOI: 10.1006/dspr.1999.0363 |
0.363 |
|
2000 |
Kajarekar SS, Hermansky H. Analysis of information in speech and its application in speech recognition Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 1902: 283-288. |
0.357 |
|
1999 |
Arai T, Pavel M, Hermansky H, Avendano C. Syllable intelligibility for temporally filtered LPC cepstral trajectories. The Journal of the Acoustical Society of America. 105: 2783-91. PMID 10335630 DOI: 10.1121/1.426895 |
0.359 |
|
1999 |
Hermansky H. Data‐driven speech analysis for ASR The Journal of the Acoustical Society of America. 105: 1352-1352. DOI: 10.1121/1.426410 |
0.505 |
|
1999 |
Sharma S, Hermansky H. Recognition of speech from temporal patterns The Journal of the Acoustical Society of America. 105: 1158-1158. DOI: 10.1121/1.425505 |
0.499 |
|
1999 |
Kanedera N, Arai T, Hermansky H, Pavel M. On the relative importance of various components of the modulation spectrum for automatic speech recognition Speech Communication. 28: 43-55. DOI: 10.1016/S0167-6393(99)00002-3 |
0.499 |
|
1999 |
Yegnanarayana B, Avendano C, Hermansky H, Satyanarayana Murthy P. Speech enhancement using linear prediction residual Speech Communication. 28: 25-42. DOI: 10.1016/S0167-6393(98)00070-3 |
0.453 |
|
1998 |
Kanedera N, Hermansky H, Arai T. On properties of modulation spectrum for robust automatic speech recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 2: 613-616. DOI: 10.1109/ICASSP.1998.675339 |
0.408 |
|
1998 |
Hermansky H. Should recognizers have ears? Speech Communication. 25: 3-27. DOI: 10.1016/S0167-6393(98)00027-2 |
0.466 |
|
1997 |
Hermansky H, Morgan NH. Noise resistant auditory model for parameterization of speech The Journal of the Acoustical Society of America. 101: 2426. DOI: 10.1121/1.418514 |
0.41 |
|
1997 |
Avendano C, Hermansky H. On the effects of short-term spectrum smoothing in channel normalization Ieee Transactions On Speech and Audio Processing. 5: 372-374. DOI: 10.1109/89.593318 |
0.301 |
|
1996 |
Hermansky H. Beyond a ‘‘short‐term’’ analysis of speech The Journal of the Acoustical Society of America. 100: 2792-2792. DOI: 10.1121/1.416495 |
0.475 |
|
1996 |
Arai T, Pavel M, Hermansky H, Avendano C. Intelligibility of speech with filtered time trajectories of LPC cepstrum The Journal of the Acoustical Society of America. 100: 2756-2756. DOI: 10.1121/1.416322 |
0.458 |
|
1996 |
Bourlard H, Hermansky H, Morgan N. Towards increasing speech recognition error rates Speech Communication. 18: 205-231. DOI: 10.1016/0167-6393(96)00003-9 |
0.438 |
|
1995 |
Cole R, Hermansky H, Novick DG, Oviatt S, Hirschman L, Atlas L, Beckman M, Biermann A, Bush M, Clements M, Cohen J, Garcia O, Hanson B, Levinson S, McKeown K, et al. The Challenge of Spoken Language Systems: Research Directions for the Nineties Ieee Transactions On Speech and Audio Processing. 3: 1-21. DOI: 10.1109/89.365385 |
0.384 |
|
1995 |
Morgan N, Bourlard H, Greenberg S, Hermansky H, Wu SL. Stochastic perceptual models of speech Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: 397-400. |
0.312 |
|
1994 |
Pavel M, Hermansky H. Temporal masking in automatic speech recognition The Journal of the Acoustical Society of America. 95: 2876-2876. DOI: 10.1121/1.409409 |
0.527 |
|
1994 |
Hermansky H, Morgan N. RASTA Processing of Speech Ieee Transactions On Speech and Audio Processing. 2: 578-589. DOI: 10.1109/89.326616 |
0.49 |
|
1993 |
Junqua JC, Wakita H, Hermansky H. Evaluation and Optimization of Perceptually-Based ASR Front-End Ieee Transactions On Speech and Audio Processing. 1: 39-48. DOI: 10.1109/89.221366 |
0.459 |
|
1993 |
Hermansky H, Morgan N, Hirsch HG. Recognition of speech in additive and convolutional noise based on RASTA spectral processing Proceedings - Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing. 2: II-83-II-86. |
0.334 |
|
1991 |
Morgan N, Hermansky H, Bourlard H, Kohn P, Wooters C. Continuous speech recognition using PLP analysis with multilayer perceptrons Proceedings - Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing. 1: 49-52. |
0.392 |
|
1990 |
Hermansky H. Perceptual linear predictive (PLP) analysis of speech. The Journal of the Acoustical Society of America. 87: 1738-52. PMID 2341679 DOI: 10.1121/1.399423 |
0.458 |
|
1990 |
Hermansky H, Cox TL. Synthesis of speech from the low‐dimensional PLP representation The Journal of the Acoustical Society of America. 88: S179-S180. DOI: 10.1121/1.2028800 |
0.425 |
|
1988 |
Terry M, Hermansky H. Comparison of standard ASR front ends and auditory models in neural net‐based automatic speech recognition The Journal of the Acoustical Society of America. 83: S53-S53. DOI: 10.1121/1.2025401 |
0.441 |
|
1987 |
Hermansky H. Should ASR front‐end be insensitive to fundamental frequency? (perceptual shift of formant position due to fine harmonic structure of voiced speech The Journal of the Acoustical Society of America. 82: S36-S36. DOI: 10.1121/1.2024778 |
0.384 |
|
1987 |
Hermansky H. Why is the formant frequency difference limen asymmetric? The Journal of the Acoustical Society of America. 81: S18-S18. DOI: 10.1121/1.2024129 |
0.355 |
|
1986 |
Hermansky H, Javkin HR. Evaluation of ASR front ends using synthetic vowel‐like sounds The Journal of the Acoustical Society of America. 80: S18-S18. DOI: 10.1121/1.2023687 |
0.316 |
|
1986 |
Tsuga K, Hermansky H. Effect of the spectral model order in automatic speech recognition The Journal of the Acoustical Society of America. 80: S18-S18. DOI: 10.1121/1.2023684 |
0.352 |
|
1985 |
Hanson BA, Hermansky H, Wakita H. Root‐power sums and spectral slope distortion measures for all‐pole models of speech The Journal of the Acoustical Society of America. 78: S49-S49. DOI: 10.1121/1.2022847 |
0.407 |
|
1985 |
Hermansky H, Hanson BA, Wakita H. Low-dimensional representation of vowels based on all-pole modeling in the psychophysical domain Speech Communication. 4: 181-187. DOI: 10.1016/0167-6393(85)90045-7 |
0.451 |
|
1984 |
Hermansky H, Hanson BA, Wakita H. Critical‐band‐weighted linear prediction of speech The Journal of the Acoustical Society of America. 76: S1-S1. DOI: 10.1121/1.2021743 |
0.406 |
|
Show low-probability matches. |