
Contact Information
707 S Mathews Ave
M/C 168
Urbana, IL 61801
Office Hours
Research Interests
Speech intelligibility enhancement
Perceptually-motivated signal processing
Computational modelling of speech perception in noise
Speech perception and production in non-ideal listening conditions
Blind source separation
Robust automatic speech recognition
Research Description
My current research focuses on speech technologies such as speech intelligibility enhancement in adverse listening conditions, computational modelling of speech perception in noise and perceptually-motivated context-sensitive speech modification algorithms. My research interests also include speech perception and production in noise, psychoacoustics, source separation, and robust automatic speech recognition.
Education
Ph.D., Applied Linguistics (Computational Speech and Hearing), Universidad del País Vasco, Spain, 2014
MSc, Software Systems and Internet Technology with Distinction, University of Sheffield, UK, 2008
MSc, Environmental Chemistry, Sichuan University, China, 2007
BSc, Environmental Engineering, Chengdu University of Technology, China, 2004
Courses Taught
LING 402 - Tools and Techniques for Speech and Language Processing
LING 446 - Fundamentals for Speech Signal Processing and Analysis
LING 490 - Special topic: Fundamentals of Digital Signal Processing
LING 506 - Special topic: Introductory Machine Learning
LING 520 - Acoustic Phonetics
Additional Campus Affiliations
Assistant Professor, Beckman Institute for Advanced Science and Technology
Recent Publications
Tang, Y., Cox, T. J., Fazenda, B. M., Liu, Q., & Wang, W. (2019). Background Adaptation for Improved Listening Experience in Broadcasting. In 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019 - Proceedings (pp. 8008-8012). [8682687] (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; Vol. 2019-May). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICASSP.2019.8682687
Coleman, P., Franck, A., Francombe, J., Liu, Q., De Campos, T., Hughes, R. J., Menzies, D., Galvez, M. F. S., Tang, Y., Woodcock, J., Jackson, P. J. B., Melchior, F., Pike, C., Fazi, F. M., Cox, T. J., & Hilton, A. (2018). An Audio-Visual System for Object-Based Audio: From Recording to Listening. IEEE Transactions on Multimedia, 20(8), 1919-1931. https://doi.org/10.1109/TMM.2018.2794780
Demonte, P. J., Tang, Y., Hughes, R. J., Cox, T. J., Fazenda, B. M., & Shirley, B. G. (2018). Speech-To-Screen: Spatial separation of dialogue from noise towards improved speech intelligibility for the small screen. Paper presented at 144th Audio Engineering Society Convention 2018, Milan, Italy.
Tang, Y., Liu, Q., Wang, W., & Cox, T. J. (2018). A non-intrusive method for estimating binaural speech intelligibility from noise-corrupted signals captured by a pair of microphones. Speech Communication, 96, 116-128. https://doi.org/10.1016/j.specom.2017.12.005
Tang, Y., Fazenda, B. M., & Cox, T. J. (2018). Automatic speech-to-background ratio selection to maintain speech intelligibility in broadcasts using an objective intelligibility metric. Applied Sciences (Switzerland), 8(1), [59]. https://doi.org/10.3390/app8010059