Year |
Citation |
Score |
2022 |
Gao H, Ni J, Zhang Y, Qian K, Chang S, Hasegawa-Johnson M. Domain Generalization for Language-Independent Automatic Speech Recognition. Frontiers in Artificial Intelligence. 5: 806274. PMID 35647534 DOI: 10.3389/frai.2022.806274 |
0.311 |
|
2021 |
Li J, Hasegawa-Johnson M, McElwain NL. Analysis of acoustic and voice quality features for the classification of infant and mother vocalizations. Speech Communication. 133: 41-61. PMID 36062214 DOI: 10.1016/j.specom.2021.07.010 |
0.321 |
|
2020 |
Wang L, Hasegawa-Johnson M. Multimodal Word Discovery and Retrieval With Spoken Descriptions and Visual Concepts Ieee/Acm Transactions On Audio, Speech, and Language Processing. 28: 1560-1573. DOI: 10.1109/Taslp.2020.2996082 |
0.45 |
|
2020 |
Scharenborg O, Besacier L, Black A, Hasegawa-Johnson M, Metze F, Neubig G, Stuker S, Godard P, Muller M, Ondel L, Palaskar S, Arthur P, Ciannella F, Du M, Larsen E, et al. Speech Technology for Unwritten Languages Ieee/Acm Transactions On Audio, Speech, and Language Processing. 28: 964-975. DOI: 10.1109/Taslp.2020.2973896 |
0.488 |
|
2018 |
He D, Lim BP, Yang X, Hasegawa-Johnson M, Chen D. Acoustic landmarks contain more information about the phone string than other frames for automatic speech recognition with deep neural network acoustic model. The Journal of the Acoustical Society of America. 143: 3207. PMID 29960420 DOI: 10.1121/1.5039837 |
0.705 |
|
2017 |
He D, Lim BPP, Yang X, Hasegawa-Johnson M, Chen D. Selecting frames for automatic speech recognition based on acoustic landmarks Journal of the Acoustical Society of America. 141: 3468-3468. DOI: 10.1121/1.4987204 |
0.525 |
|
2017 |
Kong X, Yang X, Hasegawa-Johnson M, Choi J, Shattuck-Hufnagel S. Landmark-based consonant voicing detection on multilingual corpora Journal of the Acoustical Society of America. 141: 3468-3468. DOI: 10.1121/1.4987203 |
0.743 |
|
2016 |
Chen W, Hasegawa-Johnson M, Chen NF. Mismatched Crowdsourcing based Language Perception for Under-resourced Languages Procedia Computer Science. 81: 23-29. DOI: 10.1016/j.procs.2016.04.025 |
0.328 |
|
2016 |
Livescu K, Rudzicz F, Fosler-Lussier E, Hasegawa-Johnson M, Bilmes J. Speech Production in Speech Technologies: Introduction to the CSL Special Issue Computer Speech and Language. 36: 165-172. DOI: 10.1016/J.Csl.2015.11.002 |
0.465 |
|
2015 |
Zhang Y, Ou Z, Hasegawa-Johnson M. Incorporating AM-FM effect in voiced speech for probabilistic acoustic tube model 2015 Ieee Workshop On Applications of Signal Processing to Audio and Acoustics, Waspaa 2015. DOI: 10.1109/WASPAA.2015.7336905 |
0.415 |
|
2015 |
Huang PS, Kim M, Hasegawa-Johnson M, Smaragdis P. Joint Optimization of Masks and Deep Recurrent Neural Networks for Monaural Source Separation Ieee/Acm Transactions On Speech and Language Processing. 23: 2136-2147. DOI: 10.1109/Taslp.2015.2468583 |
0.403 |
|
2015 |
Chen K, Hasegawa-Johnson M. Improving the robustness of prosody dependent language modeling based on prosody syntax dependence 2003 Ieee Workshop On Automatic Speech Recognition and Understanding, Asru 2003. 435-440. DOI: 10.1109/ASRU.2003.1318480 |
0.335 |
|
2015 |
Pietrowicz M, Hasegawa-Johnson M, Karahalios K. Acoustic correlates for perceived effort levels in expressive speech Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2015: 3720-3724. |
0.35 |
|
2015 |
Jyothi P, Hasegawa-Johnson M. Acquiring speech transcriptions using mismatched crowdsourcing Proceedings of the National Conference On Artificial Intelligence. 2: 1263-1269. |
0.336 |
|
2015 |
Jyothi P, Hasegawa-Johnson M. Transcribing continuous speech using mismatched crowdsourcing Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2015: 2774-2778. |
0.386 |
|
2014 |
Chen A, Hasegawa-Johnson MA. Mixed stereo audio classification using a stereo-input mixed-to-panned level feature Ieee/Acm Transactions On Speech and Language Processing. 22: 2025-2033. DOI: 10.1109/TASLP.2014.2359628 |
0.435 |
|
2014 |
Zhang Y, Ou Z, Hasegawa-Johnson M. Improvement of Probabilistic Acoustic Tube model for speech decomposition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 7929-7933. DOI: 10.1109/ICASSP.2014.6855144 |
0.331 |
|
2014 |
Khasanova A, Cole J, Hasegawa-Johnson M. Detecting articulatory compensation in acoustic data through linear regression modeling Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 925-929. |
0.308 |
|
2014 |
Jyothi P, Cole J, Hasegawa-Johnson M, Puri V. An investigation of prosody in Hindi narrative speech Proceedings of the International Conference On Speech Prosody. 623-627. |
0.398 |
|
2013 |
Yoon S, Pierce L, Huensch A, Juul E, Perkins S, Sproat R, Hasegawa-Johnson M. Construction of a Rated Speech Corpus of L2 Learners' Spontaneous Speech Calico Journal. 26: 662-673. DOI: 10.1558/Cj.V26I3.662-673 |
0.375 |
|
2013 |
Sharma HV, Hasegawa-Johnson M. Acoustic model adaptation using in-domain background models for dysarthric speech recognition Computer Speech and Language. 27: 1147-1162. DOI: 10.1016/J.Csl.2012.10.002 |
0.751 |
|
2012 |
Nam H, Mitra V, Tiede M, Hasegawa-Johnson M, Espy-Wilson C, Saltzman E, Goldstein L. A procedure for estimating gestural scores from speech acoustics. The Journal of the Acoustical Society of America. 132: 3980-9. PMID 23231127 DOI: 10.1121/1.4763545 |
0.497 |
|
2012 |
Rong P, Loucks T, Kim H, Hasegawa-Johnson M. Relationship between kinematics, F2 slope and speech intelligibility in dysarthria due to cerebral palsy. Clinical Linguistics & Phonetics. 26: 806-22. PMID 22876770 DOI: 10.3109/02699206.2012.706686 |
0.328 |
|
2012 |
Tang H, Chu SM, Hasegawa-Johnson M, Huang TS. Partially supervised speaker clustering. Ieee Transactions On Pattern Analysis and Machine Intelligence. 34: 959-71. PMID 21844626 DOI: 10.1109/Tpami.2011.174 |
0.357 |
|
2012 |
Mertens R, Huang P, Gottlieb L, Friedland G, Divakaran A, Hasegawa-Johnson M. On the Applicability of Speaker Diarization to Audio Indexing of Non-Speech and Mixed Non-Speech/Speech Video Soundtracks International Journal of Multimedia Data Engineering and Management. 3: 1-19. DOI: 10.4018/Jmdem.2012070101 |
0.492 |
|
2012 |
Kim H, Hasegawa-Johnson M. Second-formant locus patterns in dysarthric speech The Journal of the Acoustical Society of America. 132: 2089-2089. DOI: 10.1121/1.4755719 |
0.379 |
|
2012 |
Ozbek IY, Hasegawa-Johnson M, Demirekler M. On Improving Dynamic State Space Approaches to Articulatory Inversion With MAP-Based Parameter Estimation Ieee Transactions On Audio, Speech, and Language Processing. 20: 67-81. DOI: 10.1109/Tasl.2011.2157496 |
0.353 |
|
2012 |
Cole J, Hasegawa-Johnson M, Loehr D, Guilder LV, Reetz H, Frisch SA. Corpora, Databases, and Internet Resources: Corpus Phonology with Speech Resources Using The Internet For Collecting Phonological Data Speech Manipulation, Synthesis, and Automatic Recognition in Laboratory Phonology Phonotactic Patterns in Lexical Corpora The Oxford Handbook of Laboratory Phonology. DOI: 10.1093/oxfordhb/9780199575039.013.0017 |
0.366 |
|
2012 |
Mathur S, Poole MS, Peña-Mora F, Hasegawa-Johnson M, Contractor N. Detecting interaction links in a collaborating group using manually annotated data Social Networks. 34: 515-526. DOI: 10.1016/J.Socnet.2012.04.002 |
0.32 |
|
2012 |
Kim LH, Hasegawa-Johnson M. Optimal multi-microphone speech enhancement in cars Digital Signal Processing For in-Vehicle Systems and Safety. 195-204. DOI: 10.1007/978-1-4419-9607-7_13 |
0.344 |
|
2011 |
Kim H, Hasegawa-Johnson M, Perlman A. Vowel contrast and speech intelligibility in dysarthria. Folia Phoniatrica Et Logopaedica : Official Organ of the International Association of Logopedics and Phoniatrics (Ialp). 63: 187-94. PMID 20938200 DOI: 10.1159/000318881 |
0.364 |
|
2011 |
Hasegawa-Johnson MA, Huang J, King S, Zhou X. Normalized recognition of speech and audio events The Journal of the Acoustical Society of America. 130: 2524-2524. DOI: 10.1121/1.3655075 |
0.311 |
|
2011 |
Kim H, Hasegawa-Johnson M, Perlman A. Temporal and spectral characteristics of fricatives in dysarthria The Journal of the Acoustical Society of America. 130: 2446-2446. DOI: 10.1121/1.3654821 |
0.455 |
|
2011 |
Hasegawa-Johnson MA, Huang J, Zhuang X. Semi-supervised learning for speech and audio processing The Journal of the Acoustical Society of America. 130: 2408-2408. DOI: 10.1121/1.3654654 |
0.628 |
|
2011 |
Ozbek İY, Hasegawa-Johnson M, Demirekler M. Estimation of Articulatory Trajectories Based on Gaussian Mixture Model (GMM) With Audio-Visual Information Fusion and Dynamic Kalman Smoothing Ieee Transactions On Audio, Speech, and Language Processing. 19: 1180-1195. DOI: 10.1109/Tasl.2010.2087751 |
0.374 |
|
2011 |
Lobdell BE, Allen JB, Hasegawa-Johnson MA. Intelligibility predictors and neural representation of speech Speech Communication. 53: 185-194. DOI: 10.1016/J.Specom.2010.08.016 |
0.737 |
|
2011 |
Hasegawa-Johnson M, Goudeseune C, Cole J, Kaczmarski H, Kim H, King S, Mahrt T, Huang JT, Zhuang X, Lin KH, Sharma HV, Li Z, Huang TS. Multimodal speech and audio user interfaces for K-12 outreach Apsipa Asc 2011 - Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2011. 526-531. |
0.648 |
|
2011 |
Mahrt T, Huang JT, Mo Y, Fleck M, Hasegawa-Johnson M, Cole J. Optimal models of prosodic prominence using the Bayesian information criterion Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2037-2040. |
0.336 |
|
2010 |
Kim H, Martin K, Hasegawa-Johnson M, Perlman A. Frequency of consonant articulation errors in dysarthric speech. Clinical Linguistics & Phonetics. 24: 759-70. PMID 20831376 DOI: 10.3109/02699206.2010.497238 |
0.398 |
|
2010 |
Cole J, Mo Y, Hasegawa-Johnson M. Signal-based and expectation-based factors in the perception of prosodic prominence Laboratory Phonology. 1: 425-452. DOI: 10.1515/Labphon.2010.022 |
0.445 |
|
2010 |
Tang H, Hasegawa-Johnson M, Huang T. A novel vector representation of stochastic signals based on adapted ergodic HMMs Ieee Signal Processing Letters. 17: 715-718. DOI: 10.1109/Lsp.2010.2051945 |
0.39 |
|
2010 |
Zhuang X, Zhou X, Hasegawa-Johnson MA, Huang TS. Real-world acoustic event detection Pattern Recognition Letters. 31: 1543-1551. DOI: 10.1016/J.Patrec.2010.02.005 |
0.633 |
|
2010 |
Kim LH, Kim KT, Hasegawa-Johnson M. Robust automatic speech recognition with decoder oriented ideal binary mask estimation Proceedings of the 11th Annual Conference of the International Speech Communication Association, Interspeech 2010. 2066-2069. |
0.324 |
|
2009 |
Huang TS, Hasegawa-Johnson MA, Chu SM, Zeng Z, Tang H. Sensitive Talking Heads Ieee Signal Processing Magazine. 26: 67-72. DOI: 10.1109/Msp.2009.932562 |
0.307 |
|
2009 |
Huang JT, Zhou X, Hasegawa-Johnson M, Huang T. Kernel metric learning for phonetic classification Proceedings of the 2009 Ieee Workshop On Automatic Speech Recognition and Understanding, Asru 2009. 141-145. DOI: 10.1109/ASRU.2009.5373389 |
0.392 |
|
2009 |
Sharma HV, Hasegawa-Johnson M. Universal access: Speech recognition for talkers with spastic dysarthria Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1451-1454. |
0.737 |
|
2008 |
Kim LH, Hasegawa-Johnson M, Lim JS, Sung KM. Acoustic model for robustness analysis of optimal multipoint room equalization. The Journal of the Acoustical Society of America. 123: 2043-53. PMID 18397012 DOI: 10.1121/1.2837285 |
0.535 |
|
2008 |
Tang H, Fu Y, Tu J, Hasegawa-Johnson M, Huang TS. Humanoid audio-visual avatar with emotive text-to-speech synthesis Ieee Transactions On Multimedia. 10: 969-981. DOI: 10.1109/Tmm.2008.2001355 |
0.335 |
|
2008 |
Kantor A, Hasegawa-Johnson M. Stream weight tuning in dynamic Bayesian networks Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4525-4528. DOI: 10.1109/ICASSP.2008.4518662 |
0.565 |
|
2008 |
Zhuang X, Zhou X, Huang TS, Hasegawa-Johnson M. Feature analysis and selection for acoustic event detection Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 17-20. DOI: 10.1109/ICASSP.2008.4517535 |
0.382 |
|
2008 |
Zhou X, Zhuang X, Liu M, Tang H, Hasegawa-Johnson M, Huang T. HMM-based acoustic event detection with adaboost feature selection Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 4625: 345-353. DOI: 10.1007/978-3-540-68585-2_33 |
0.345 |
|
2008 |
Zhuang X, Hasegawa-Johnson M. Towards interpretation of creakiness in switchboard Proceedings of the 4th International Conference On Speech Prosody, Sp 2008. 37-40. |
0.328 |
|
2008 |
Kim H, Hasegawa-Johnson M, Perlman A, Gunderson J, Huang T, Watkin K, Frame S. Dysarthric speech database for universal access research Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1741-1744. |
0.379 |
|
2008 |
Lobdell BE, Hasegawa-Johnson MA, Allen JB. Human speech perception and feature extraction Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1797-1800. |
0.726 |
|
2007 |
Zhu W, Hasegawa-Johnson M, Kantor A, Roth D, Park Y, Yang L. E-coder for Automatic Scoring Physical Activity Diary Data Medicine & Science in Sports & Exercise. 39: S190. DOI: 10.1249/01.Mss.0000273709.05036.8D |
0.543 |
|
2007 |
Hasegawa-Johnson M. A multi-stream approach to audiovisual automatic speech recognition 2007 Ieee 9th International Workshop On Multimedia Signal Processing, Mmsp 2007 - Proceedings. 328-331. DOI: 10.1109/MMSP.2007.4412884 |
0.378 |
|
2007 |
Zhou X, Fu Y, Liu M, Hasegawa-Johnson M, Huang TS. Robust analysis and weighting on MFCC components for speech recognition and speaker identification Proceedings of the 2007 Ieee International Conference On Multimedia and Expo, Icme 2007. 188-191. |
0.327 |
|
2006 |
Zhu W, Hasegawa-Johnson M, Roth D, Kantor A, Gao Y, Gandhi MA, Park Y, Yang L. Validation of an E-diary System for Assessing Physical Activities Medicine & Science in Sports & Exercise. 38: S102-S103. DOI: 10.1249/00005768-200605001-01354 |
0.521 |
|
2006 |
Chen K, Hasegawa-Johnson M, Cohen A, Borys S, Kim SS, Cole J, Choi JY. Prosody dependent speech recognition on radio news corpus of American English Ieee Transactions On Audio, Speech and Language Processing. 14: 232-244. DOI: 10.1109/Tsa.2005.853208 |
0.611 |
|
2006 |
Zhang T, Hasegawa-Johnson M, Levinson SE. Cognitive state classification in a spoken tutorial dialogue system Speech Communication. 48: 616-632. DOI: 10.1016/J.Specom.2005.09.006 |
0.462 |
|
2006 |
Zhang T, Hasegawa-Johnson M, Levinson SE. Extraction of pragmatic and semantic salience from spontaneous spoken English Speech Communication. 48: 437-462. DOI: 10.1016/J.Specom.2005.07.007 |
0.49 |
|
2005 |
Hasegawa-Johnson M, Baker J, Borys S, Chen K, Coogan E, Greenberg S, Juneja A, Kirchhoff K, Livescu K, Mohan S, Muller J, Sonmez K, Wang T. LANDMARK-BASED SPEECH RECOGNITION: REPORT OF THE 2004 JOHNS HOPKINS SUMMER WORKSHOP. Proceedings of the ... Ieee International Conference On Acoustics, Speech, and Signal Processing / Sponsored by the Institute of Electrical and Electronics Engineers Signal Processing Society. Icassp (Conference). 1: 1213-1216. PMID 19212454 DOI: 10.1109/ICASSP.2005.1415088 |
0.581 |
|
2005 |
Choi JY, Hasegawa-Johnson M, Cole J. Finding intonational boundaries using acoustic cues related to the voice source. The Journal of the Acoustical Society of America. 118: 2579-87. PMID 16266178 DOI: 10.1121/1.2010288 |
0.342 |
|
2005 |
Hasegawa-Johnson M, Chen K, Cole J, Borys S, Kim SS, Cohen A, Zhang T, Choi JY, Kim H, Yoon T, Chavarria S. Simultaneous recognition of words and prosody in the Boston University Radio Speech Corpus Speech Communication. 46: 418-439. DOI: 10.1016/J.Specom.2005.01.009 |
0.634 |
|
2005 |
Borys S, Hasegawa-Johnson M. Distinctive feature based SVM discriminant features for improvements to phone recognition on telephone band speech 9th European Conference On Speech Communication and Technology. 697-700. |
0.349 |
|
2004 |
Omar M, Hasegawa-Johnson M. Model Enforcement: A Unified Feature Transformation Framework for Classification and Recognition Ieee Transactions On Signal Processing. 52: 2701-2710. DOI: 10.1109/Tsp.2004.834344 |
0.594 |
|
2004 |
Kim SS, Hasegawa-Johnson M, Chen K. Automatic recognition of pitch movements using multilayer perception and time-delay recursive neural network Ieee Signal Processing Letters. 11: 645-648. DOI: 10.1109/Lsp.2004.830114 |
0.341 |
|
2004 |
Chen K, Hasegawa-Johnson M, Cohen A. An automatic prosody labeling system using ANN-based syntactic-prosodic model and GMM-based acoustic-prosodic model Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1. |
0.382 |
|
2004 |
Zheng Y, Hasegawa-Johnson M. Formant tracking by mixture state particle filter Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1. |
0.419 |
|
2003 |
Hasegawa-Johnson M, Pizza S, Alwan A, Cha JS, Haker K. Vowel category dependence of the relationship between palate height, tongue height, and oral area. Journal of Speech, Language, and Hearing Research : Jslhr. 46: 738-53. PMID 14697000 DOI: 10.1044/1092-4388(2003/059) |
0.53 |
|
2003 |
Zheng Y, Hasegawa-Johnson M, Pizza S. Analysis of the three-dimensional tongue shape using a three-index factor analysis model. The Journal of the Acoustical Society of America. 113: 478-86. PMID 12558285 DOI: 10.1121/1.1520538 |
0.443 |
|
2003 |
Lee B, Hasegawa-Johnson MA, Goudeseune C. Open‐loop dereverberation of multichannel room impulse responses The Journal of the Acoustical Society of America. 113: 2202-2203. DOI: 10.1121/1.4780198 |
0.509 |
|
2003 |
Omar MK, Hasegawa-Johnson M. Approximately Independent Factors of Speech Using Nonlinear Symplectic Transformation Ieee Transactions On Speech and Audio Processing. 11: 660-671. DOI: 10.1109/Tsa.2003.814457 |
0.637 |
|
2003 |
Zheng Y, Hasegawa-Johnson M. Particle filtering approach to Bayesian formant tracking Ieee Workshop On Statistical Signal Processing Proceedings. 2003: 601-604. DOI: 10.1109/SSP.2003.1289549 |
0.49 |
|
2003 |
Omar MK, Hasegawa-Johnson M. Strong-sense class-dependent features for statistical recognition Ieee Workshop On Statistical Signal Processing Proceedings. 2003: 490-493. DOI: 10.1109/SSP.2003.1289454 |
0.556 |
|
2003 |
Hasegawa-Johnson M. Bayesian learning for models of human speech perception Ieee Workshop On Statistical Signal Processing Proceedings. 2003: 408-411. DOI: 10.1109/SSP.2003.1289432 |
0.389 |
|
2003 |
Zheng Y, Hasegawa-Johnson M. Acoustic segmentation using switching state Kalman filter Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: 752-755. |
0.533 |
|
2002 |
Omar MK, Hasegawa-Johnson M. Maximum mutual information based acoustic-features representation of phonological features for speech recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1. |
0.619 |
|
2002 |
Jing Z, Hasegawa-Johnson M. Auditory-modeling inspired methods of feature extraction for robust automatic speech recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4. |
0.355 |
|
2001 |
Beauchamp JW, Taube H, Tipei S, Wyatt SA, Haken L, Hasegawa-Johnson M. Acoustics, Audio, and Music Technology Education at the University of Illinois at Urbana‐Champaign The Journal of the Acoustical Society of America. 110: 2626-2626. DOI: 10.1121/1.4776867 |
0.367 |
|
2001 |
Omar MK, Hasegawa-Johnson M, Levinson S. Gaussian mixture models of phonetic boundaries for speech recognition 2001 Ieee Workshop On Automatic Speech Recognition and Understanding, Asru 2001 - Conference Proceedings. 33-36. DOI: 10.1109/ASRU.2001.1034582 |
0.652 |
|
2001 |
Gunawan W, Hasegawa-Johnson M. PLP coefficients can be quantized at 400 BPS Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: 77-80. |
0.368 |
|
Show low-probability matches. |