Mark Hasegawa-Johnson - Publications

Affiliations:

University of Illinois, Urbana-Champaign, Urbana-Champaign, IL

Area:

Computer Science, Linguistics Language

Year	Citation	Score
2022	Gao H, Ni J, Zhang Y, Qian K, Chang S, Hasegawa-Johnson M. Domain Generalization for Language-Independent Automatic Speech Recognition. Frontiers in Artificial Intelligence. 5: 806274. PMID 35647534 DOI: 10.3389/frai.2022.806274	0.311
2021	Li J, Hasegawa-Johnson M, McElwain NL. Analysis of acoustic and voice quality features for the classification of infant and mother vocalizations. Speech Communication. 133: 41-61. PMID 36062214 DOI: 10.1016/j.specom.2021.07.010	0.321
2020	Wang L, Hasegawa-Johnson M. Multimodal Word Discovery and Retrieval With Spoken Descriptions and Visual Concepts Ieee/Acm Transactions On Audio, Speech, and Language Processing. 28: 1560-1573. DOI: 10.1109/Taslp.2020.2996082	0.45
2020	Scharenborg O, Besacier L, Black A, Hasegawa-Johnson M, Metze F, Neubig G, Stuker S, Godard P, Muller M, Ondel L, Palaskar S, Arthur P, Ciannella F, Du M, Larsen E, et al. Speech Technology for Unwritten Languages Ieee/Acm Transactions On Audio, Speech, and Language Processing. 28: 964-975. DOI: 10.1109/Taslp.2020.2973896	0.488
2018	He D, Lim BP, Yang X, Hasegawa-Johnson M, Chen D. Acoustic landmarks contain more information about the phone string than other frames for automatic speech recognition with deep neural network acoustic model. The Journal of the Acoustical Society of America. 143: 3207. PMID 29960420 DOI: 10.1121/1.5039837	0.705
2017	He D, Lim BPP, Yang X, Hasegawa-Johnson M, Chen D. Selecting frames for automatic speech recognition based on acoustic landmarks Journal of the Acoustical Society of America. 141: 3468-3468. DOI: 10.1121/1.4987204	0.525
2017	Kong X, Yang X, Hasegawa-Johnson M, Choi J, Shattuck-Hufnagel S. Landmark-based consonant voicing detection on multilingual corpora Journal of the Acoustical Society of America. 141: 3468-3468. DOI: 10.1121/1.4987203	0.743
2016	Chen W, Hasegawa-Johnson M, Chen NF. Mismatched Crowdsourcing based Language Perception for Under-resourced Languages Procedia Computer Science. 81: 23-29. DOI: 10.1016/j.procs.2016.04.025	0.328
2016	Livescu K, Rudzicz F, Fosler-Lussier E, Hasegawa-Johnson M, Bilmes J. Speech Production in Speech Technologies: Introduction to the CSL Special Issue Computer Speech and Language. 36: 165-172. DOI: 10.1016/J.Csl.2015.11.002	0.465
2015	Zhang Y, Ou Z, Hasegawa-Johnson M. Incorporating AM-FM effect in voiced speech for probabilistic acoustic tube model 2015 Ieee Workshop On Applications of Signal Processing to Audio and Acoustics, Waspaa 2015. DOI: 10.1109/WASPAA.2015.7336905	0.415
2015	Huang PS, Kim M, Hasegawa-Johnson M, Smaragdis P. Joint Optimization of Masks and Deep Recurrent Neural Networks for Monaural Source Separation Ieee/Acm Transactions On Speech and Language Processing. 23: 2136-2147. DOI: 10.1109/Taslp.2015.2468583	0.403
2015	Chen K, Hasegawa-Johnson M. Improving the robustness of prosody dependent language modeling based on prosody syntax dependence 2003 Ieee Workshop On Automatic Speech Recognition and Understanding, Asru 2003. 435-440. DOI: 10.1109/ASRU.2003.1318480	0.335
2015	Pietrowicz M, Hasegawa-Johnson M, Karahalios K. Acoustic correlates for perceived effort levels in expressive speech Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2015: 3720-3724.	0.35
2015	Jyothi P, Hasegawa-Johnson M. Acquiring speech transcriptions using mismatched crowdsourcing Proceedings of the National Conference On Artificial Intelligence. 2: 1263-1269.	0.336
2015	Jyothi P, Hasegawa-Johnson M. Transcribing continuous speech using mismatched crowdsourcing Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2015: 2774-2778.	0.386
2014	Chen A, Hasegawa-Johnson MA. Mixed stereo audio classification using a stereo-input mixed-to-panned level feature Ieee/Acm Transactions On Speech and Language Processing. 22: 2025-2033. DOI: 10.1109/TASLP.2014.2359628	0.435
2014	Zhang Y, Ou Z, Hasegawa-Johnson M. Improvement of Probabilistic Acoustic Tube model for speech decomposition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 7929-7933. DOI: 10.1109/ICASSP.2014.6855144	0.331
2014	Khasanova A, Cole J, Hasegawa-Johnson M. Detecting articulatory compensation in acoustic data through linear regression modeling Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 925-929.	0.308
2014	Jyothi P, Cole J, Hasegawa-Johnson M, Puri V. An investigation of prosody in Hindi narrative speech Proceedings of the International Conference On Speech Prosody. 623-627.	0.398
2013	Yoon S, Pierce L, Huensch A, Juul E, Perkins S, Sproat R, Hasegawa-Johnson M. Construction of a Rated Speech Corpus of L2 Learners' Spontaneous Speech Calico Journal. 26: 662-673. DOI: 10.1558/Cj.V26I3.662-673	0.375
2013	Sharma HV, Hasegawa-Johnson M. Acoustic model adaptation using in-domain background models for dysarthric speech recognition Computer Speech and Language. 27: 1147-1162. DOI: 10.1016/J.Csl.2012.10.002	0.751
2012	Nam H, Mitra V, Tiede M, Hasegawa-Johnson M, Espy-Wilson C, Saltzman E, Goldstein L. A procedure for estimating gestural scores from speech acoustics. The Journal of the Acoustical Society of America. 132: 3980-9. PMID 23231127 DOI: 10.1121/1.4763545	0.497
2012	Rong P, Loucks T, Kim H, Hasegawa-Johnson M. Relationship between kinematics, F2 slope and speech intelligibility in dysarthria due to cerebral palsy. Clinical Linguistics & Phonetics. 26: 806-22. PMID 22876770 DOI: 10.3109/02699206.2012.706686	0.328
2012	Tang H, Chu SM, Hasegawa-Johnson M, Huang TS. Partially supervised speaker clustering. Ieee Transactions On Pattern Analysis and Machine Intelligence. 34: 959-71. PMID 21844626 DOI: 10.1109/Tpami.2011.174	0.357
2012	Mertens R, Huang P, Gottlieb L, Friedland G, Divakaran A, Hasegawa-Johnson M. On the Applicability of Speaker Diarization to Audio Indexing of Non-Speech and Mixed Non-Speech/Speech Video Soundtracks International Journal of Multimedia Data Engineering and Management. 3: 1-19. DOI: 10.4018/Jmdem.2012070101	0.492
2012	Kim H, Hasegawa-Johnson M. Second-formant locus patterns in dysarthric speech The Journal of the Acoustical Society of America. 132: 2089-2089. DOI: 10.1121/1.4755719	0.379
2012	Ozbek IY, Hasegawa-Johnson M, Demirekler M. On Improving Dynamic State Space Approaches to Articulatory Inversion With MAP-Based Parameter Estimation Ieee Transactions On Audio, Speech, and Language Processing. 20: 67-81. DOI: 10.1109/Tasl.2011.2157496	0.353
2012	Cole J, Hasegawa-Johnson M, Loehr D, Guilder LV, Reetz H, Frisch SA. Corpora, Databases, and Internet Resources: Corpus Phonology with Speech Resources Using The Internet For Collecting Phonological Data Speech Manipulation, Synthesis, and Automatic Recognition in Laboratory Phonology Phonotactic Patterns in Lexical Corpora The Oxford Handbook of Laboratory Phonology. DOI: 10.1093/oxfordhb/9780199575039.013.0017	0.366
2012	Mathur S, Poole MS, Peña-Mora F, Hasegawa-Johnson M, Contractor N. Detecting interaction links in a collaborating group using manually annotated data Social Networks. 34: 515-526. DOI: 10.1016/J.Socnet.2012.04.002	0.32
2012	Kim LH, Hasegawa-Johnson M. Optimal multi-microphone speech enhancement in cars Digital Signal Processing For in-Vehicle Systems and Safety. 195-204. DOI: 10.1007/978-1-4419-9607-7_13	0.344
2011	Kim H, Hasegawa-Johnson M, Perlman A. Vowel contrast and speech intelligibility in dysarthria. Folia Phoniatrica Et Logopaedica : Official Organ of the International Association of Logopedics and Phoniatrics (Ialp). 63: 187-94. PMID 20938200 DOI: 10.1159/000318881	0.364
2011	Hasegawa-Johnson MA, Huang J, King S, Zhou X. Normalized recognition of speech and audio events The Journal of the Acoustical Society of America. 130: 2524-2524. DOI: 10.1121/1.3655075	0.311
2011	Kim H, Hasegawa-Johnson M, Perlman A. Temporal and spectral characteristics of fricatives in dysarthria The Journal of the Acoustical Society of America. 130: 2446-2446. DOI: 10.1121/1.3654821	0.455
2011	Hasegawa-Johnson MA, Huang J, Zhuang X. Semi-supervised learning for speech and audio processing The Journal of the Acoustical Society of America. 130: 2408-2408. DOI: 10.1121/1.3654654	0.628
2011	Ozbek İY, Hasegawa-Johnson M, Demirekler M. Estimation of Articulatory Trajectories Based on Gaussian Mixture Model (GMM) With Audio-Visual Information Fusion and Dynamic Kalman Smoothing Ieee Transactions On Audio, Speech, and Language Processing. 19: 1180-1195. DOI: 10.1109/Tasl.2010.2087751	0.374
2011	Lobdell BE, Allen JB, Hasegawa-Johnson MA. Intelligibility predictors and neural representation of speech Speech Communication. 53: 185-194. DOI: 10.1016/J.Specom.2010.08.016	0.737
2011	Hasegawa-Johnson M, Goudeseune C, Cole J, Kaczmarski H, Kim H, King S, Mahrt T, Huang JT, Zhuang X, Lin KH, Sharma HV, Li Z, Huang TS. Multimodal speech and audio user interfaces for K-12 outreach Apsipa Asc 2011 - Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2011. 526-531.	0.648
2011	Mahrt T, Huang JT, Mo Y, Fleck M, Hasegawa-Johnson M, Cole J. Optimal models of prosodic prominence using the Bayesian information criterion Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 2037-2040.	0.336
2010	Kim H, Martin K, Hasegawa-Johnson M, Perlman A. Frequency of consonant articulation errors in dysarthric speech. Clinical Linguistics & Phonetics. 24: 759-70. PMID 20831376 DOI: 10.3109/02699206.2010.497238	0.398
2010	Cole J, Mo Y, Hasegawa-Johnson M. Signal-based and expectation-based factors in the perception of prosodic prominence Laboratory Phonology. 1: 425-452. DOI: 10.1515/Labphon.2010.022	0.445
2010	Tang H, Hasegawa-Johnson M, Huang T. A novel vector representation of stochastic signals based on adapted ergodic HMMs Ieee Signal Processing Letters. 17: 715-718. DOI: 10.1109/Lsp.2010.2051945	0.39
2010	Zhuang X, Zhou X, Hasegawa-Johnson MA, Huang TS. Real-world acoustic event detection Pattern Recognition Letters. 31: 1543-1551. DOI: 10.1016/J.Patrec.2010.02.005	0.633
2010	Kim LH, Kim KT, Hasegawa-Johnson M. Robust automatic speech recognition with decoder oriented ideal binary mask estimation Proceedings of the 11th Annual Conference of the International Speech Communication Association, Interspeech 2010. 2066-2069.	0.324
2009	Huang TS, Hasegawa-Johnson MA, Chu SM, Zeng Z, Tang H. Sensitive Talking Heads Ieee Signal Processing Magazine. 26: 67-72. DOI: 10.1109/Msp.2009.932562	0.307
2009	Huang JT, Zhou X, Hasegawa-Johnson M, Huang T. Kernel metric learning for phonetic classification Proceedings of the 2009 Ieee Workshop On Automatic Speech Recognition and Understanding, Asru 2009. 141-145. DOI: 10.1109/ASRU.2009.5373389	0.392
2009	Sharma HV, Hasegawa-Johnson M. Universal access: Speech recognition for talkers with spastic dysarthria Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1451-1454.	0.737
2008	Kim LH, Hasegawa-Johnson M, Lim JS, Sung KM. Acoustic model for robustness analysis of optimal multipoint room equalization. The Journal of the Acoustical Society of America. 123: 2043-53. PMID 18397012 DOI: 10.1121/1.2837285	0.535
2008	Tang H, Fu Y, Tu J, Hasegawa-Johnson M, Huang TS. Humanoid audio-visual avatar with emotive text-to-speech synthesis Ieee Transactions On Multimedia. 10: 969-981. DOI: 10.1109/Tmm.2008.2001355	0.335
2008	Kantor A, Hasegawa-Johnson M. Stream weight tuning in dynamic Bayesian networks Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4525-4528. DOI: 10.1109/ICASSP.2008.4518662	0.565
2008	Zhuang X, Zhou X, Huang TS, Hasegawa-Johnson M. Feature analysis and selection for acoustic event detection Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 17-20. DOI: 10.1109/ICASSP.2008.4517535	0.382
2008	Zhou X, Zhuang X, Liu M, Tang H, Hasegawa-Johnson M, Huang T. HMM-based acoustic event detection with adaboost feature selection Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 4625: 345-353. DOI: 10.1007/978-3-540-68585-2_33	0.345
2008	Zhuang X, Hasegawa-Johnson M. Towards interpretation of creakiness in switchboard Proceedings of the 4th International Conference On Speech Prosody, Sp 2008. 37-40.	0.328
2008	Kim H, Hasegawa-Johnson M, Perlman A, Gunderson J, Huang T, Watkin K, Frame S. Dysarthric speech database for universal access research Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1741-1744.	0.379
2008	Lobdell BE, Hasegawa-Johnson MA, Allen JB. Human speech perception and feature extraction Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech. 1797-1800.	0.726
2007	Zhu W, Hasegawa-Johnson M, Kantor A, Roth D, Park Y, Yang L. E-coder for Automatic Scoring Physical Activity Diary Data Medicine & Science in Sports & Exercise. 39: S190. DOI: 10.1249/01.Mss.0000273709.05036.8D	0.543
2007	Hasegawa-Johnson M. A multi-stream approach to audiovisual automatic speech recognition 2007 Ieee 9th International Workshop On Multimedia Signal Processing, Mmsp 2007 - Proceedings. 328-331. DOI: 10.1109/MMSP.2007.4412884	0.378
2007	Zhou X, Fu Y, Liu M, Hasegawa-Johnson M, Huang TS. Robust analysis and weighting on MFCC components for speech recognition and speaker identification Proceedings of the 2007 Ieee International Conference On Multimedia and Expo, Icme 2007. 188-191.	0.327
2006	Zhu W, Hasegawa-Johnson M, Roth D, Kantor A, Gao Y, Gandhi MA, Park Y, Yang L. Validation of an E-diary System for Assessing Physical Activities Medicine & Science in Sports & Exercise. 38: S102-S103. DOI: 10.1249/00005768-200605001-01354	0.521
2006	Chen K, Hasegawa-Johnson M, Cohen A, Borys S, Kim SS, Cole J, Choi JY. Prosody dependent speech recognition on radio news corpus of American English Ieee Transactions On Audio, Speech and Language Processing. 14: 232-244. DOI: 10.1109/Tsa.2005.853208	0.611
2006	Zhang T, Hasegawa-Johnson M, Levinson SE. Cognitive state classification in a spoken tutorial dialogue system Speech Communication. 48: 616-632. DOI: 10.1016/J.Specom.2005.09.006	0.462
2006	Zhang T, Hasegawa-Johnson M, Levinson SE. Extraction of pragmatic and semantic salience from spontaneous spoken English Speech Communication. 48: 437-462. DOI: 10.1016/J.Specom.2005.07.007	0.49
2005	Hasegawa-Johnson M, Baker J, Borys S, Chen K, Coogan E, Greenberg S, Juneja A, Kirchhoff K, Livescu K, Mohan S, Muller J, Sonmez K, Wang T. LANDMARK-BASED SPEECH RECOGNITION: REPORT OF THE 2004 JOHNS HOPKINS SUMMER WORKSHOP. Proceedings of the ... Ieee International Conference On Acoustics, Speech, and Signal Processing / Sponsored by the Institute of Electrical and Electronics Engineers Signal Processing Society. Icassp (Conference). 1: 1213-1216. PMID 19212454 DOI: 10.1109/ICASSP.2005.1415088	0.581
2005	Choi JY, Hasegawa-Johnson M, Cole J. Finding intonational boundaries using acoustic cues related to the voice source. The Journal of the Acoustical Society of America. 118: 2579-87. PMID 16266178 DOI: 10.1121/1.2010288	0.342
2005	Hasegawa-Johnson M, Chen K, Cole J, Borys S, Kim SS, Cohen A, Zhang T, Choi JY, Kim H, Yoon T, Chavarria S. Simultaneous recognition of words and prosody in the Boston University Radio Speech Corpus Speech Communication. 46: 418-439. DOI: 10.1016/J.Specom.2005.01.009	0.634
2005	Borys S, Hasegawa-Johnson M. Distinctive feature based SVM discriminant features for improvements to phone recognition on telephone band speech 9th European Conference On Speech Communication and Technology. 697-700.	0.349
2004	Omar M, Hasegawa-Johnson M. Model Enforcement: A Unified Feature Transformation Framework for Classification and Recognition Ieee Transactions On Signal Processing. 52: 2701-2710. DOI: 10.1109/Tsp.2004.834344	0.594
2004	Kim SS, Hasegawa-Johnson M, Chen K. Automatic recognition of pitch movements using multilayer perception and time-delay recursive neural network Ieee Signal Processing Letters. 11: 645-648. DOI: 10.1109/Lsp.2004.830114	0.341
2004	Chen K, Hasegawa-Johnson M, Cohen A. An automatic prosody labeling system using ANN-based syntactic-prosodic model and GMM-based acoustic-prosodic model Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1.	0.382
2004	Zheng Y, Hasegawa-Johnson M. Formant tracking by mixture state particle filter Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1.	0.419
2003	Hasegawa-Johnson M, Pizza S, Alwan A, Cha JS, Haker K. Vowel category dependence of the relationship between palate height, tongue height, and oral area. Journal of Speech, Language, and Hearing Research : Jslhr. 46: 738-53. PMID 14697000 DOI: 10.1044/1092-4388(2003/059)	0.53
2003	Zheng Y, Hasegawa-Johnson M, Pizza S. Analysis of the three-dimensional tongue shape using a three-index factor analysis model. The Journal of the Acoustical Society of America. 113: 478-86. PMID 12558285 DOI: 10.1121/1.1520538	0.443
2003	Lee B, Hasegawa-Johnson MA, Goudeseune C. Open‐loop dereverberation of multichannel room impulse responses The Journal of the Acoustical Society of America. 113: 2202-2203. DOI: 10.1121/1.4780198	0.509
2003	Omar MK, Hasegawa-Johnson M. Approximately Independent Factors of Speech Using Nonlinear Symplectic Transformation Ieee Transactions On Speech and Audio Processing. 11: 660-671. DOI: 10.1109/Tsa.2003.814457	0.637
2003	Zheng Y, Hasegawa-Johnson M. Particle filtering approach to Bayesian formant tracking Ieee Workshop On Statistical Signal Processing Proceedings. 2003: 601-604. DOI: 10.1109/SSP.2003.1289549	0.49
2003	Omar MK, Hasegawa-Johnson M. Strong-sense class-dependent features for statistical recognition Ieee Workshop On Statistical Signal Processing Proceedings. 2003: 490-493. DOI: 10.1109/SSP.2003.1289454	0.556
2003	Hasegawa-Johnson M. Bayesian learning for models of human speech perception Ieee Workshop On Statistical Signal Processing Proceedings. 2003: 408-411. DOI: 10.1109/SSP.2003.1289432	0.389
2003	Zheng Y, Hasegawa-Johnson M. Acoustic segmentation using switching state Kalman filter Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: 752-755.	0.533
2002	Omar MK, Hasegawa-Johnson M. Maximum mutual information based acoustic-features representation of phonological features for speech recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1.	0.619
2002	Jing Z, Hasegawa-Johnson M. Auditory-modeling inspired methods of feature extraction for robust automatic speech recognition Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 4.	0.355
2001	Beauchamp JW, Taube H, Tipei S, Wyatt SA, Haken L, Hasegawa-Johnson M. Acoustics, Audio, and Music Technology Education at the University of Illinois at Urbana‐Champaign The Journal of the Acoustical Society of America. 110: 2626-2626. DOI: 10.1121/1.4776867	0.367
2001	Omar MK, Hasegawa-Johnson M, Levinson S. Gaussian mixture models of phonetic boundaries for speech recognition 2001 Ieee Workshop On Automatic Speech Recognition and Understanding, Asru 2001 - Conference Proceedings. 33-36. DOI: 10.1109/ASRU.2001.1034582	0.652
2001	Gunawan W, Hasegawa-Johnson M. PLP coefficients can be quantized at 400 BPS Icassp, Ieee International Conference On Acoustics, Speech and Signal Processing - Proceedings. 1: 77-80.	0.368
Show low-probability matches.