Taufiq Hasan's Publications

Research Focus

My research goal is to advance signal processing and machine learning techniques for information extraction from speech, audio and video signals. My past work focused on: 1) speaker verification (voice authentication) in noisy, telephone/microphone channel degraded and non-stationary acoustic conditions, 2) speech enhancement and front-end improvement, 3) automatic highlight generation for sports videos using audio-visual features. Here is my publication list in Google scholar.

Journal Papers

John H. L. Hansen and Taufiq Hasan (2015): "How humans and machines recognize voices: A tutorial review", IEEE Signal Processing Magazine, October 2015.
Taufiq Hasan and John H. L. Hansen (2014): "Maximum-Likelihood Acoustic Factor Analysis for Robust Speaker Verification in Noise", IEEE Trans. on Audio Speech and Lang. Process., February 2014. [No. 2 in popular articles of Feb 2014] [pdf]
Taufiq Hasan Hynek Boril, Abhijeet Sangwan, John H. L. Hansen (2013): “Multi-modal highlight generation for sports videos using an information-theoretic excitability measure,” EURASIP Journal on Advances in Signal Processing, November 2013. [pdf]
Taufiq Hasan and John H. L. Hansen (2012): “Acoustic Factor Analysis for Robust Speaker Verification,” IEEE Trans. Audio Speech and Lang. Process., vol. 21, no. 4, pp. 842-853, April 2013. [bib] [pdf] [cover article, No 6 in popular articles of ASLP in Jul. 2013]
Taufiq Hasan and John H. L. Hansen (2011): “A study on Universal Background Model training in Speaker Verification,” IEEE Trans. Audio Speech Lang. Process. vol. 19, No. 7, Sep 2011. [bib] [pdf]
Taufiq Hasan and Md. Kamrul Hasan (2010): “An MMSE estimator for speech enhancement considering the constructive and destructive interference of noise,” IET Signal Processing, vol. 16, issue 1, pp. 1-11, Feb 2010. [bib] [pdf]
Taufiq Hasan and Md. Kamrul Hasan (2009): “Suppression of Residual Noise from Speech Signals Using Empirical Mode Decomposition,” IEEE Signal Process. Lett., vol. 4, no 1, pp. 2-5, Jan 2009. [bib] [pdf]

Conference Papers

Srinivas Parthasarathy and Taufiq Hasan (2015): “Automatic Broadcast News Summarization via Rank Classifiers and Crowdsourced Annotation,” in Proc. ICASSP, Brisbane, Australia.
Taufiq Hasan, John H.L. Hansen (2013): “Acoustic Factor Analysis based Universal Background Model for Robust Speaker Verification in Noise,” in Proc. InterSpeech, Lyon, France. [pdf]
Ville Hautamaki, Kong Aik Lee, David van Leeuwen, Rahim Saeidi, Anthony Larcher, Tomi Kinnunen, Taufiq Hasan, Seyed Omid Sadjadi, Gang Liu, Hynek Boril, John H.L. Hansen and Benoit Fauve (2013): “Automatic regularization of cross-entropy cost for speaker recognition fusion,” in Proc. InterSpeech, Lyon, France. [pdf]
Rahim Saeidi, Kong Aik Lee, Tomi Kinnunen, Taufiq Hasan, Benoit Fauve, Pierre-Michel Bousquet, Elie Khoury, Pablo L. Sordo Martinez, Karen Kua, Changhuai You, hanwu sun, Anthony Larcher, Paddy Rajan, Ville Hautamaki, Cemal Hanilci, Billy Braithwaite, Rosa Gonzalez Hautamaki, Seyed Omid Sadjadi, Liu Gang and Hynek Boril (2013): “I4U submission to NIST SRE 2012: A large-scale collaborative effort for noise-robust speaker verification,” in Proc. InterSpeech, Lyon, France. [pdf]
Taufiq Hasan, Seyed Omid Sadjadi, Gang Liu, Navid Shokouhi, Hynek Boril, John H.L. Hansen (2013): “CRSS Systems for 2012 NIST Speaker Recognition Evaluation,” in Proc. IEEE ICASSP, Vancouver, Canada. [bib] [pdf]
Taufiq Hasan, Rahim Saeidi, John H. L. Hansen, David A. van Leeuwen (2013): “Duration Mismatch Compensation for I-vector based Speaker Recognition Systems,” in Proc. IEEE ICASSP, Vancouver, Canada. [bib] [pdf]
Gang Liu, Taufiq Hasan, Hynek Boril, John H.L. Hansen (2013): “Back-end Investigation for Multi-session Speaker Identification ,” in Proc. IEEE ICASSP, Vancouver, Canada. [bib] [pdf]
Taufiq Hasan and John H. L. Hansen (2012): “Integrated Feature Normalization and Enhancement for robust Speaker Recognition using Acoustic Factor Analysis,” in Proc. InterSpeech, Portland, OR, Sep. 2012. [pdf] [bib]
Taufiq Hasan and John H. L. Hansen (2012): “Front-end Channel Compensation using Mixture-dependent Feature Transformations for i-Vector Speaker Recognition,” in Proc. InterSpeech, Portland, OR, Sep. 2012. [pdf] [bib]
Keith W. Godin, Taufiq Hasan and John H. L. Hansen (2012): “Glottal Waveform Analysis of Physical Task Stress Speech,”in Proc. InterSpeech, Portland, OR, Sep. 2012.
Seyed Omid Sadjadi, Taufiq Hasan and John H. L. Hansen (2012): “Mean Hilbert Envelope Coefficients (MHEC) for Robust Speaker Recognition,” in Proc. InterSpeech, Portland, OR, Sep. 2012.
Taufiq Hasan and John H. L. Hansen (2012): “Factor Analysis of Acoustic Features using a Mixture of Probabilistic Principal Component Analyzers for robust Speaker Verification ,”in Proc. Odyssey, Singapore, Jun. 2012. [pdf] [bib]
Taufiq Hasan, Hynek Boril, Abhijeet Sangwan and John H. L. Hansen (2012): “A multi-modal highlight extraction scheme for sports videos using an information-theoretic excitability measure,” in Proc. IEEE ICASSP, Kyoto, Japan [pdf] [bib] [demo] [slides]
Taufiq Hasan and John H. L. Hansen (2011): “Robust Speaker Recognition in Non-Stationary Room Environments Based on Empirical Mode Decomposition,” in Proc. InterSpeech, Florence, Italy. [pdf] [bib] [slides]
Hynek Boril, Abhijeet Sangwan, Taufiq Hasan and John H. L. Hansen (2010): “Automatic Excitement-Level Detection for Sports Highlights Generation,” in Proc. InterSpeech, Makuhari, Chiba, Japan. [pdf] [bib]
Taufiq Hasan and John H. L. Hansen (2010): “A novel feature sub-sampling method for efficient universal background model training in speaker verification,”in Proc. IEEE ICASSP, Dallas, Texas. [pdf] [bib] [appendix] [slides] [code]
M. Ryyan Khan, Taufiq Hasan and M. Rezwan Khan (2008): “Iterative Noise Power Subtraction Technique for Improved Speech Quality”,in Proc. ICECE, Dhaka, Bangladesh. [pdf] [bib]
Taufiq Hasan and Md. Kamrul Hasan (2007): “A Probabilistic Speech Enhancement Filter Utilizing the Constructive and Destructive Interference of Noise”, in Proc. EUSIPCO, 3-7 September 2007, Poznan, Poland. [pdf] [bib] [slides]
Taufiq Hasan, Moyeenul Huq, Rakesh Mitra and Md. Kamrul Hasan (2006): “A two stage speech enhancement method A two stage speech enhancement method for further improvement of speech quality by extracting signal from residual,” in Proc. ISSPA, Sharjah, UAE, 12-15 February, 2007. [pdf] [bib] [slides]

Theses

Taufiq Hasan (2013): "Effective Acoustic Modeling for Robust Speaker Recognition", PhD Dissertation, The University of Texas at Dallas (UTD), Richardson, TX. [pdf]
Taufiq Hasan (2008): "A Hybrid Speech Enhancement Method using Optimal Dual Gain Filters and EMD Based Post Processing", MSc Thesis, Bangladesh University of Engineering and Technology (BUET), Dhaka, Bangladesh. [pdf]

Unrefereed publications

Taufiq Hasan, Gang Liu, Seyed Omid Sadjadi, Navid Shokouhi, Hynek Boril, Abhinav Misra, Keith W. Godin, and John H.L. Hansen, (2012): “UTD-CRSS Systems for 2012 NIST Speaker Recognition Evaluation,” NIST 2012 Speaker Recognition Evaluation Workshop, Orlando, Florida, 11-12 December 2012 [bib] [pdf]
Jun-Won Suh, Seyed Omid Sadjadi, Gang Liu, Taufiq Hasan, Keith W. Godin, and John H.L. Hansen, (2011): “Exploring Hilbert envelope based acoustic features in i-vector speaker verification using HT-PLDA,” NIST 2011 Speaker Recognition Evaluation Workshop, Atlanta, GA, USA, 7-9 December 2011 [bib] [pdf]
G. Liu, S.O. Sadjadi, Taufiq Hasan, J.-W. Suh, C. Zhang, M. Mehrabani, H. Boril, A. Sangwan, and J.H.L. Hansen, (2011): “UTD-CRSS SYSTEMS FOR NIST LANGUAGE RECOGNITION EVALUATION 2011,” NIST 2011 Language Recognition Evaluation Workshop, Atlanta, GA, USA, 7-9 December 2011 [bib] [pdf]
Yun Lei, Taufiq Hasan, Jun-Won Suh, Abhijeet Sangwan, Hynek Boril, Liu Gang, Keith Godin, Chi Zhang, and John H. L. Hansen, (2010): “The CRSS Systems for the 2010 NIST Speaker Recognition Evaluation,” NIST 2010 Speaker Recognition Evaluation Workshop, Brno, Czech Republic, 24-25 June 2010 [bib] [pdf]