Recent Publications Haizhou Li - COLIPSeleliha/Publications - Haizhou Li.pdf · · 2017-12-31Recent Publications – Haizhou Li Patents 1) US Patent number 6311152, ... Marcello

Recent Publications – Haizhou Li

Patents

1) US Patent number 6311152, Shuanhu Bai, Horng Jyh Paul Wu, Haizhou Li, Gareth Loudon, System

for Chinese tokenization and named entity recognition, Publication date 2001/10/30

2) US Patent Number 6674861, Changsheng Xu, Jiankang Wu, Qibin Sun, Kai Xin, Haizhou Li, Digital

audio watermarking using content-adaptive, multiple echo hopping, Publication date: 2004/1/6

3) US Patent Number 6397181 B1, Haizhou Li, Jiankang Wu, Method and Apparatus for Voice

Annotation and Retrieval of Multimedia Data, Publication date: 2000/8/3

4) US Patent Number 7,917,361 B2, Haizhou Li, Bin Ma, George M. White, Spoken Language

Identification System and Methods for Training and Operating Same, Publication date: March 29, 2011

5) USPTO Application #: #20100299136, Rong Tong, Shuanghu Bai, Haizhou Li, dialogue system and a

method for executing a fully mixed initiative dialogue (fmid) interaction between a human and a

machine, Publish Date: 25 November 2010

6) USPTO Application #: #20150025892, Siu Wa Lee, Ling Cen, Haizhou Li, Yaozhu Paul Chan,

Minghui Dong, Method and system for template-based personalized singing synthesis, Publish Date:

22 January 2015

7) USPTO Application #: #20100198760, Namunu C. Maddage, Haizhou Li, Apparatus and methods for

music signal analysis, Publish Date: 5 August 2010

8) USPTO Application #: #20100004931, Bin Ma, Haizhou Li, Minghui Dong, Apparatus and method for

speech utterance verification , Publish Date: 7 January 2010

Books & Book Chapters

1) Haizhou Li, Kar-Ann Toh, Liyuan Li, Advanced Topics in Biometrics, World Scientific, 2011.

2) Haizhou Li, Bin Ma, and Chin-Hui Lee, Vector-based Spoken Language Classification. In Jacob

Benesty, M. Mohan Sondhi, Arden Huang (editors) Springer Handbook of Speech Processing, Springer,

2007.

3) Chin-Hui Lee, Haizhou Li, Lin-shan Lee, Renhua Wang, and Qiang Huo (editors), Advances in

Chinese Spoken Language Processing, World Scientific, 2007.

4) Shuzhi Sam Ge, Haizhou Li, John-John Cabibihan and Yeow Kee Tan (editors), Social Robotics,

Springer Lecture Notes in Artificial Intelligence 6414, 2010.

5) Qiang Huo, Bin Ma, Eng Siong Chng, and Haizhou Li (editors), Chinese Spoken Language Processing,

Springer Lecture Notes in Artificial Intelligence 4274, 2006.

6) Yinglin Yu and Haizhou Li, Neural Networks and Signal Analysis, South China University of

Technology Press, Guangzhou.

Journal Articles

1) Kaavya Sriskandaraja, Vidhyasaharan Sethu, Eliathamby Ambikairajah, Haizhou Li, Front-End for

Antispoofing Countermeasures in Speaker Verification: Scattering Spectral Decomposition, IEEE

Journal of Selected Topics in Signal Processing 11(4): 632-643, 2017

2) Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li, Multitask Feature Learning for Low-

Resource Query-by-Example Spoken Term Detection, IEEE Journal of Selected Topics in Signal

Processing 11(8): 1329-1339, 2017

3) Xiaohai Tian, Siu Wa Lee, Zhizheng Wu, Eng Siong Chng, Haizhou Li, An Exemplar-Based

Approach to Frequency Warping for Voice Conversion, IEEE/ACM Trans. Audio, Speech & Language

Processing 25(10): 1863-1876, 2017

http://www.google.com/patents/US6311152




http://www.freshpatents.com/Siu-Wa-Lee-Singapore-invdxl.php

http://www.freshpatents.com/Ling-Cen-Singapore-invdxc.php

http://www.freshpatents.com/Haizhou-Li-Singapore-invdxl.php

http://www.freshpatents.com/Yaozhu-Paul-Chan-Singapore-invdxc.php

http://www.freshpatents.com/Minghui-Dong-Singapore-invdxd.php

http://www.freshpatents.com/Namunu-C-Maddage-Singapore-invdxm.php


http://www.freshpatents.com/Bin-Ma-Singapore-invdxm.php


http://www.freshpatents.com/Minghui-Dong-Singapore-invdxd.php

4) Hongjie Chen, Lei Xie, Cheung-Chi Leung, Xiaoming Lu, Bin Ma, Haizhou Li, Modeling Latent

Topics and Temporal Distance for Story Segmentation of Broadcast News, IEEE/ACM Trans. Audio,

Speech & Language Processing 25(1): 108-119, 2017

5) Xiong Xiao, Shengkui Zhao, Duc Hoang Ha Nguyen, Xionghu Zhong, Douglas L. Jones, Eng Siong

Chng, Haizhou Li, Speech dereverberation for enhancement and recognition using dynamic features

constrained deep neural networks and feature adaptation. EURASIP J. Adv. Sig. Proc. 2016.

6) Zhizheng Wu, Haizhou Li, On the study of replay and voice conversion attacks to text-dependent

speaker verification. Multimedia Tools Appl. 75(9) , pp. 5311-5327, 2016.

7) Nancy F. Chen, Darren Wee, Rong Tong, Bin Ma, Haizhou Li, Large-scale characterization of non-

native Mandarin Chinese spoken by speakers of European origin: Analysis on iCALL. Speech

Communication 84, pp. 46-56, 2016.

8) Sven Ewan Shepstone, Kong-Aik Lee, Haizhou Li, Zheng-Hua Tan, Søren Holdt Jensen, Total

Variability Modeling Using Source-Specific Priors. IEEE/ACM Trans. Audio, Speech & Language

Processing 24(3), pp. 504-517, 2016.

9) Duc Hoang Ha Nguyen, Xiong Xiao, Eng Siong Chng, Haizhou Li, Feature Adaptation Using Linear

Spectro-Temporal Transform for Robust Speech Recognition. IEEE/ACM Trans. Audio, Speech &

Language Processing 24(6), pp. 1006-1019, 2016.

10) Qiang Yu, Rui Yan, Huajin Tang, Kay Chen Tan, Haizhou Li, A Spiking Neural Network System for

Robust Sequence Recognition. IEEE Trans. Neural Netw. Learning Syst. 27(3), pp. 621-635, 2016.

11) Yuma Ueda, Longbiao Wang, Atsuhiko Kai, Xiong Xiao, Engsiong Chng, Haizhou Li, Single-channel

Dereverberation for Distant-Talking Speech Recognition by Combining Denoising Autoencoder and

Temporal Structure Normalization. Signal Processing Systems 82(2), pp. 151-161, 2016.

12) Liping Chen, Kong-Aik Lee, Bin Ma, Wu Guo, Haizhou Li, Li-Rong Dai, Exploration of Local

Variability in Text-Independent Speaker Verification. Signal Processing Systems 82(2), pp. 217-228 ,

2016.

13) Jun Hu, Huajin Tang, Kay Chen Tan, Haizhou Li, How the Brain Formulates Memory: A Spatio-

Temporal Model, IEEE Computational Intelligence Magazine, accepted in 2015

14) Qiang Yu, Rui Yan, Huajin Tang, Kay Chen Tan, Haizhou Li, A Spiking Neural Network System for

Robust Sequence Recognition, IEEE Transactions on Neural Networks and Learning Systems,

accepted in 2015 (DOI: 10.1109/TNNLS.2015.2416771)

15) Jonathan Dennis, Huy Dat Tran, Haizhou Li, Generalized Hough Transform for Speech Pattern

Classification, IEEE/ACM Transactions on Audio, Speech and Language Processing, 23(11), pp. 1963-

1972, 2015.

16) Chang Huai You, Haizhou Li, and Kong-Aik Lee, “Relevance factor of maximum a posteriori

adaptation for GMM-NAP-SVM in speaker and language recognition”, Computer Speech and

Language, vol.30, no.1, pp.116-134, 2015.

17) Dau-Cheng Lyu, Tien Ping Tan, Eng siong Chng, Haizhou Li, Mandarin-English code-switching

speech corpus in South-East Asia: SEAME. Language Resources and Evaluation 49(3): 581-600 (2015)

18) Haipeng Wang, Tan Lee, Cheung-Chi Leung, Bin Ma, and Haizhou Li, “Acoustic Segment Modeling

with Spectral Clustering Methods”, IEEE/ACM Transactions on Audio, Speech and Language

Processing, vol.23, no.2, pp.264-277, 2015.

19) Van Hai Do, Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Context-dependent Phone Mapping for

Acoustic Modeling of Under-resourced Languages”, International Journal of Asian Language


20) Haizhou Li, Marcello Federico, Xiaodong He, Helen M. Meng, and Isabel Trancoso, “Introduction to

the Special Section on Continuous Space and Related Methods in Natural Language Processing”,

IEEE/ACM Transactions on Audio, Speech and Language Processing, vol.23, no.3, pp.427-430, 2015.

21) Tze Yuang Chong, Rafael E. Banchs, Eng siong Chng, Haizhou Li, “Decoupling Word-Pair Distance

and Co-occurrence Information for Effective Long History Context Language Modeling,” IEEE/ACM

Transactions on Audio, Speech and Language Processing, vol 23, no. 7, (7): pp. 1221-1232, 2015

http://ieeexplore.ieee.org.ezlibproxy1.ntu.edu.sg/xpl/articleDetails.jsp?arnumber=7086059&newsearch=true&queryText=Q.%20Yu,%20R.%20Yan,%20H.%20Tang,%20K.%20C.%20Tan,%20and%20H.%20Li.%20A%20Spiking%20Neural%20Network%20System%20for%20Robust%20Sequence%20Recognition.%20IEEE%20Trans.%20on%20Neural%20Networks%20and%20Learning%20Systems,

http://ieeexplore.ieee.org.ezlibproxy1.ntu.edu.sg/xpl/articleDetails.jsp?arnumber=7086059&newsearch=true&queryText=Q.%20Yu,%20R.%20Yan,%20H.%20Tang,%20K.%20C.%20Tan,%20and%20H.%20Li.%20A%20Spiking%20Neural%20Network%20System%20for%20Robust%20Sequence%20Recognition.%20IEEE%20Trans.%20on%20Neural%20Networks%20and%20Learning%20Systems,

http://ieeexplore.ieee.org.ezlibproxy1.ntu.edu.sg/xpl/RecentIssue.jsp?punumber=5962385

http://ieeexplore.ieee.org.ezlibproxy1.ntu.edu.sg/xpl/RecentIssue.jsp?punumber=5962385

http://dx.doi.org.ezlibproxy1.ntu.edu.sg/10.1109/TNNLS.2015.2416771

http://dblp.uni-trier.de/db/journals/taslp/taslp23.html#ChongBCL15

http://dblp.uni-trier.de/db/journals/taslp/taslp23.html#ChongBCL15

22) Rafael E. Banchs, Luis F. D'Haro, and Haizhou Li, “Adequacy-Fluency Metrics: Evaluating MT in the

Continuous Space Model Framework”, IEEE/ACM Transactions on Audio, Speech and Language


23) Zhizheng Wu, Nicholas Evans, Tomi Kinnunen, Junichi Yamagishi, Federico Alegre, and Haizhou Li,

"Spoofing and countermeasures for speaker verification: a survey", Speech Communication, vol.66, pp.

130-153, 2015.

24) Haizhou Li, Inaugural editorial: Embracing Opportunities for Growth, IEEE/ACM Transactions on

Audio, Speech and Language Processing, 23(1): 5-6, 2015.

25) Van Hai Do, Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Cross-lingual phone mapping for large

vocabulary speech recognition of under-resourced languages”, IEICE Transactions on Information and

Systems, vol.97-D, no.2, pp. 285-295, 2014.

26) Miaolong Yuan, Huajin Tang, and Haizhou Li, “Real-Time Keypoint Recognition Using Restricted

Boltzmann Machine,” IEEE Transactions on Neural Networks and Learning Systems, vol.25, no.11, pp.

2119-2126, 2014.

27) Zhizheng Wu and Haizhou Li, “Voice conversion versus speaker verification: an overview”, APSIPA

Transactions on Signal and Information Processing, vol.3, 2014.

28) Zhizheng Wu, Eng Siong Chng, and Haizhou Li, “Exemplar-based voice conversion using joint

nonnegative matrix factorization”, Multimedia Tools and Applications, Springer, 2014.

29) Zhizheng Wu, Tuomas Virtanen, Eng Siong Chng, and Haizhou Li, “Exemplar-based sparse

representation with residual compensation for voice conversion”, IEEE/ACM Transactions on Audio,

Speech and Language Processing, vol.22, no.10, pp. 1506-1521, 2014.

30) Anthony Larcher, Kong Aik Lee, Bin Ma, and Haizhou Li, “Text-dependent speaker verification:

Classifiers, databases and RSR2015”, Speech Communication, vol.60, pp. 56-77, 2014.

31) S. J. Wright, D. Kanevsky, Li Deng, Xiaodong He, G. Heigold, and Haizhou Li, “Optimization

Algorithm and Applications for Speech and Language Processing”, IEEE Transactions on Audio,

Speech and Language Processing, vol.21, no.11, pp. 2231-2243, 2013.

32) Raymond W. M. Ng, Tan Lee, Cheung-Chi Leung, Bin Ma, and Haizhou Li, “Spoken Language

Recognition With Prosodic Features”, IEEE Transactions on Audio, Speech and Language Processing,

vol.21, no.9, pp. 1841-1853, April 2013.

33) Ville Hautamäki, Tomi Kinnunen, Filip Sedlak, Kong Aik Lee, Bin Ma, and Haizhou Li, “Sparse

Classifier Fusion for Speaker Verification”, IEEE Transactions on Audio, Speech and Language

Processing, vol.21, no.8, pp. 1622-1631, August 2013.

34) Qiang Yu, Huajin Tang, Kay Chen Tan, and Haizhou Li, “Precise-Spike-Driven Synaptic Plasticity:

Learning Hetero-Association of Spatiotemporal Spike Patterns”, PLoS ONE, vol.8, no.11, November

2013.

35) Haizhou Li, Kong Aik Lee, and Bin Ma, “Spoken Language Recognition: From Fundamentals to

Practice”, Proceedings of the IEEE, vol. 101, no. 5, pp. 1136-1159, May 2013.

36) Douglas D. O'Shaughnessy, Li Deng, and Haizhou Li, “Speech Information Processing: Theory and

Applications”, Proceedings of the IEEE, vol. 101, no. 5, pp. 1034-1037, May 2013.

37) Jiali Yu, Huajin Tang, and Haizhou Li, “Dynamics Analysis of a Population Decoding Model”, IEEE

Transactions on Neural Networks and Learning Systems, vol. 24, no. 3, pp. 498-503, 2013.

38) Qiang Yu, Huajin Tang, Kay Chen Tan, and Haizhou Li, “Rapid Feedforward Computation by

Temporal Encoding and Learning With Spiking Neurons”, IEEE Transactions on Neural Networks and

Learning Systems, vol.24, no.10, pp. 1539-1552, October 2013.

39) Haipeng Wang, Cheung-Chi Leung, Tan Lee, Bin Ma, and Haizhou Li, “Shifted-Delta MLP Features

for Spoken Language Recognition”, IEEE Signal Processing Letters, vol. 20, no. 1, pp. 15-18, January

2013.

40) Andreea Niculescu, Betsy van Dijk, Anton Nijholt, Haizhou Li, and See Swee Lan, “Making Social

Robots More Attractive: The Effects of Voice Pitch, Humor and Empathy”, International Journal of

Social Robotics, vol. 5, no. 2, pp. 171-191, April 2013.

41) Jiali Yu, Huajin Tang, and Haizhou Li, “Continuous attractors of discrete-time recurrent neural

networks”, Neural Computing and Applications, vol. 23, no. 1, pp. 89-96, 2013.

42) Jiali Yu, Huajin Tang, Haizhou Li, and Luping Shi, “Dynamical properties of continuous attractor

neural network with background tuning”, Neurocomputing, vol. 99, pp. 439-447, 2013.

43) Jun Hu, Huajin Tang, Kay Chen Tan, Haizhou Li, and Luping Shi, “A Spike-Timing-Based Integrated

Model for Pattern Recognition”, Neural Computation, vol. 25, no. 2, pp. 450-472, 2013.

44) Sakriani Sakti, Michael Paul, Andrew Finch, Shinsuke Sakai, Thang Tat Vu, Noriyuki Kimura, Chiori

Hori, Eiichiro Sumita, Satoshi Nakamura, Jun Park, Chai Wutiwiwatchai, Bo Xu, Hammam Riza,

Karunesh Arora, Chi Mai Luong, and Haizhou Li, “A-STAR: Toward Translating Asian Spoken

Languages”, Computer Speech and Language, vol. 27, no. 2, pp. 509-527, 2013.

45) Zhizheng Wu, Tomi Kinnunen, Eng Siong Chng, and Haizhou Li, “Mixture of factor analyzers using

priors from non-parallel speech for voice conversion”, IEEE Signal Processing Letters, vol. 19, no. 12,

pp. 914-917, 2012.

46) Omid Dehzangi, Bin Ma, Eng-Siong Chng, and Haizhou Li, “Discriminative Feature Extraction for

Speech Recognition Using Continuous Output Codes”, Pattern Recognition Letters, vol. 33, pp. 1703-

1709, 2012.

47) Liyuan Li, Shuicheng Yan, Xinguo Yu, Yeow Kee Tan, and Haizhou Li, “Robust Multiperson

Detection and Tracking for Mobile Service and Social Robots”, IEEE Transactions on Systems, Man,

and Cybernetics -PART B: CYBERNETICS, vol. 42, no. 5, 2012.

48) Tomi Kinnunen, Rahim Saeidi, Filip Sedlak, Kong Aik Lee, Johan Sandberg, Maria Hansson-Sandsten,

and Haizhou Li, ”Low-Variance Multitaper MFCC Features: a Case Study in Robust Speaker

Verification”, IEEE Transactions on Audio, Speech and Language Processing, vol. 20, no. 7, pp. 1990-

2001, September 2012.

49) Andreea Niculescu, Betsy van Dijk, Anton Nijholt, Haizhou Li, and Swee Lan See, “Making social

robots more attractive: the effects of voice pitch, humor and empathy”, International Journal of Social

Robotics, vol. 5, no. 2, pp. 171-191, April 2013.

50) Wenliang Chen, Jun'ichi Kazama, Min Zhang, Yoshimasa Tsuruoka, Yujie Zhang, Yiou Wang,

Kentaro Torisawa, and Haizhou Li, “Bitext dependency parsing with auto-generated bilingual

treebank”, IEEE Transactions on Audio, Speech and Language Processing, vol. 20, no. 5, pp. 1461-

1472, 2012.

51) Xiaoxuan Wang, Lei Xie, Mimi Lu, Bin Ma, Engsiong Chng, and Haizhou Li, “Broadcast news story

segmentation using conditional random fields and multimodal features”, IEICE Transactions on

Information and Systems, vol. E95-D, no. 5, pp.1206-1215, 2012.

52) Yi Ren Leng, Tran Huy Dat, Norihide Kitaoka, and Haizhou Li, “Selective gammatone envelope

feature for robust sound event recognition”, IEICE Transactions, vol. 95-D, no. 5, pp. 1229-1237, 2012.

53) Rui Yan, Keng Peng Tee, Yuanwei Chua, Haizhou Li, and Huajin Tang, “Gesture Recognition Based

on Localist Attractor Networks with Application to Robot Control”, IEEE Computational Intelligence

Magazine, vol. 7, No. 1, pp. 64-74, 2012.

54) Keng Peng Tee, Rui Yan, Yuanwei Chua, Zhiyong Huang, and Haizhou Li, “Modular IK: a Robust

Inverse Kinematic Algorithm for Gesture Imitation in an Upper-Body Humanoid Robot”, International

Journal of Humanoid Robotics, vol. 9, no. 2, June 2012.

55) Jin-Shea Kuo and Haizhou Li, “Learning regional transliteration variants”, Information Processing

and Management, vol. 48, no. 1, pp. 154-169, 2012.

56) Tin Lay Nwe, Hanwu Sun, Bin Ma, and Haizhou Li, “Speaker Clustering and Cluster Purification

Methods for RT07 and RT09 Evaluation Meeting Data”, IEEE Transactions on Audio, Speech and

Language Processing, vol. 20, no. 2, pp. 461-473, 2012.

57) Haizhou Li, “FOREWORD - Special Section on Recent Advances in Multimedia Signal Processing

Techniques and Applications”, IEICE TRANSACTIONS on Information and Systems, vol. 95-D, no. 5,

pp. 1181-1181, May 2012.

58) Haizhou Li , John-John Cabibihan, and Yeow Kee Tan, “Towards an Effective Design of Social

Robots”, International Journal of Social Robotics, vol. 3, no. 4, pp. 333-335, November 2011.

59) Huajin Tang and Haizhou Li, “Book Review: Information Theoretic Learning: Renyi’s Entropy and

Kernel Perspectives”, IEEE Computational Intelligence Magazine, vol. 6, no. 3, August 2011.

60) Eliathamby Ambikairajah, Haizhou Li, Liang Wang, Bo Yin, and Vidhyasaharan Sethu, “Language

Identification: A Tutorial”, IEEE Circuits and Systems Magazine, vol. 11, no. 2, pp. 82-108, 2011.

61) Huajin Tang Haizhou Li, and Zhang Yi, “Online learning and stimulus-driven responses of neurons in

visual cortex”, Cognitive Neurodynamics, vol. 5, no. 1, pp. 77-85, 2011.

62) Omid Dehzangi, Bin Ma, Eng-Siong Chng, and Haizhou Li, “Error Corrective Fusion of Classifier

Scores for Spoken Language”, IEICE Transactions on Information and Systems, vol. E94-D, no.12, pp.

2503-2512, 2011.

63) Deyi Xiong, Min Zhang, and Haizhou Li, “A Maximum Entropy Segmentation Model for Statistical

Machine Translation”, IEEE Transactions on Audio, Speech and Language Processing, vol. 19, no. 8,

November 2011.

64) Huy Dat Tran and Haizhou Li, “Sound Event Recognition with Probabilistic Distance SVMs”, IEEE

Transactions on Audio, Speech and Language Processing, vol. 19, no. 6, pp. 1556-1568, 2011.

65) Jonathan Dennis, Huy Dat Tran, and Haizhou Li, “Spectrogram Image Feature for Sound Event

Classification in Mismatched Conditions”, IEEE Signal Processing Letters, vol. 18, no. 2, pp. 130-133,

February 2011.

66) Kong Aik Lee, Chang Huai You, Haizhou Li, Tomi Kinnunen, and Khe Chai Sim, “Using Discrete

Probabilities with Bhattacharyya Measure for SVM-based Speaker Verification”, IEEE Transactions

on Audio, Speech and Language Processing, vol. 19, no. 4, pp. 861-870, May 2011.

67) Donglai Zhu, Bin Ma, and Haizhou Li, “Speaker Verification with Feature-Space MAPLR

Parameters”, IEEE Transactions on Audio, Speech and Language Processing, vol. 19, no. 3, pp. 505-

515, March 2011.

68) Namunu C. Maddage and Haizhou Li, “Beat Space Segmentation and Octave Scale Cepstral Feature

for Sung Language Recognition in Pop Music”, ACM Transactions on Multimedia Computing,

Communications and Applications (TOMCCAP), vol. 7, no. 4, November 2011.

69) Haizhou Li and Ma Bin, “TechWare: Speaker and Spoken Language Recognition Resources”, IEEE

Signal Processing Magazine, vol. 27, no. 6, pp. 139-142, November 2010.

70) Deyi Xiong, Min Zhang, Aiti Aw, and Haizhou Li, “Linguistically Annotated Reordering Evaluation

and Analysis”, Computational Linguistics, vol. 36, no. 3, pp. 535-568, 2010.

71) Huajin Tang, Haizhou Li, and Zhang Yi, “A Discrete-Time Neural Network for Optimization

Problems with Hybrid Constraints”, IEEE Transactions on Neural Networks, vol. 21, no. 7, pp. 1184-

1189, 2010.

72) Lei Wang, Eng Siong Chng, and Haizhou Li, “A Tree-Construction Search Approach for Multivariate

Time Series Motifs Discovery”, Pattern Recognition Letters, vol. 31, no. 9, pp. 869-875, 2010.

73) Huajin Tang, Haizhou Li, and Rui Yan, “Memory Dynamics in Attractor Networks with Saliency

Weights”, Neural Computation, vol. 22, no. 7, pp. 1899-1926, July 2010.

74) Chang Huai You, Kong Aik Lee, and Haizhou Li, “GMM-SVM Kernel with a Bhattacharyya-Based

Distance for Speaker Recognition”, IEEE Transactions on Audio, Speech and Language Processing,

vol. 18, no. 6, pp. 1300-1312, 2010.

75) Tomi Kinnunen and Haizhou Li, “An Overview of Text-Independent Speaker Recognition: from

Features to Supervectors”, Speech Communication, vol. 52, no. 1, pp. 12-40, 2010. (Speech

Communication Most Cited Article since 2007)

76) Xiong Xiao, Jinyu Li, Eng Siong Chng, Haizhou Li, and Chin-Hui Lee, “A Study on the

Generalization Capability of Acoustic Models for Robust Speech Recognition”, IEEE Transactions on

Audio, Speech and Language Processing, vol. 18, no. 6, pp. 1158-1169, 2010.

77) Namunu C. Maddage, Khe Chai Sim, and Haizhou Li, “Word Level Automatic Alignment of Music

and Lyrics using Vocal Synthesis”, ACM Transactions on Multimedia Computing, Communications,

and Applications (TOMCCAP), vol. 6, no. 3, 2010.

78) Tee Kiah Chia, Khe Chai Sim, Haizhou Li, and Hwee Tou Ng, “Statistical Lattice-Based Spoken

Document Retrieval”, ACM Transactions on Information Systems, vol. 28, no. 1, 2010.

79) Huy Dat Tran and Haizhou Li, “Jump Function Kolmogorov for Audio Classification in Noise-

mismatch Conditions”, IEEE Transactions on Signal Processing, vol. 57, no. 8, pp. 2908-2918, 2009.

80) Rong Tong, Bin Ma, Haizhou Li, and Eng Siong Chng, “A Target-Oriented Phonotactic Front-end for

Spoken Language Recognition”, IEEE Transactions on Audio, Speech and Language Processing, vol.

17, no. 7, pp. 1335-1347, 2009.

81) Chang Hui You, Kong-Aik Lee, and Haizhou Li, “An SVM Kernel with GMM-Supervector Based on

the Bhattacharyya Distance for Speaker Recognition”, IEEE Signal Processing Letters, vol. 16, no. 1,

pp. 49-52, 2009.

82) Donglai Zhu, Haizhou Li, Bin Ma, and Chin-Hui Lee, “Optimizing the Performance of Spoken

Language Recognition with Discriminative Training”, IEEE Transactions on Audio, Speech and

Language Processing, vol. 16, no. 8, pp. 1642-165, 2008.

83) Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Normalization of the Speech Modulation Spectra for

Robust Speech Recognition”, IEEE Transactions on Audio, Speech and Language Processing, vol. 16,

no. 8, pp. 1662-1674, 2008.

84) Haizhou Li, Jin-Shea Kuo, Jian Su, and Chih-Lung Lin, “Mining Live Transliterations using

Incremental Learning Algorithms”, International Journal of Computer Processing of Languages, vol.

21, no. 2, pp. 183-203, 2008.

85) Khe Chia Sim and Haizhou Li, “On Acoustic Diversification Front-end for Spoken Language

Identification”, IEEE Transactions on Audio, Speech and Language Processing, vol. 16, no. 5, pp.

1029-1037, 2008.

86) Jin-shea Kuo, Haizhou Li, and Ying-Kuei Yang, “Active Learning for Constructing Transliteration

Lexicons from the Web”, Journal of the American Society for Information Science and Technology, vol.

59, no. 1, 2008.

87) Bin Ma, Haizhou Li, and Rong Tong, “Spoken Language Recognition with Ensemble Classifiers”,

IEEE Transactions on Audio, Speech and Language Processing, vol. 15, no. 7, 2007.

88) Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Temporal structure normalization of speech feature

for robust speech recognition”, IEEE Signal Processing Letters, vol. 14, no. 7, 2007.

89) Jin-Shea Kuo, Haizhou Li, and Ying-Kuei Yang, “A Phonetic Similarity Model for Automatic

Extraction of Transliteration Pairs”, ACM Transactions on Asian Language Information Processing,

vol. 6, no. 2, September 2007.

90) Tin Lay Nwe and Haizhou Li, “Exploring Vibrato-Motivated Acoustic Features for Singer

Identification”, IEEE Transactions on Audio, Speech and Language Processing, vol. 15, no. 2, 2007.

91) Haizhou Li, Bin Ma, and Chin-Hui Lee, “A Vector Space Modeling Approach to Spoken Language

Identification”, IEEE Transactions on Audio, Speech and Language Processing, vol. 15, no. 1, 2007.

92) Minghui Dong, Kim-Teng Lua, and Haizhou Li, “A Unit Selection-based Speech Synthesis Approach

for Mandarin Chinese”, Journal of Chinese Language and Computing, vol. 16, no. 1, March 2006.

93) Bin Ma and Haizhou Li, “A Comparative Study of Four Language Identification Systems”,

Computational Linguistics and Chinese Language Processing, vol. 11, no. 2, June 2006.

94) Jian Su, K. T. Ng, Haizhou Li, and Jean-Paul Haton, “Nonparametric distance measures of speaker

verification”, IEE Electronics Letters, vol. 31, no. 9, April 1995.

95) Haizhou Li, Jian Su, Jean-Paul Haton, “Short-timed speech dynamics for speaker recognition”, IEE

Electronics Letters, vol. 31, no. 17, August 1995.

Conference Papers (since 2004)

2017

1) Xiong Xiao, Shengkui Zhao, Douglas L. Jones, Eng Siong Chng, Haizhou Li:On time-frequency mask

estimation for MVDR beamforming with application in robust speech recognition. ICASSP 2017:

3246-3250

2) Liping Chen, Kong-Aik Lee, Bin Ma, Long Ma, Haizhou Li, Li-Rong Dai:Adaptation of PLDA for

multi-source text-independent speaker verification. ICASSP 2017: 5380-5384

http://dblp.uni-trier.de/pers/hd/x/Xiao:Xiong

http://dblp.uni-trier.de/pers/hd/z/Zhao:Shengkui

http://dblp.uni-trier.de/pers/hd/j/Jones:Douglas_L=

http://dblp.uni-trier.de/pers/hd/c/Chng:Eng_Siong

http://dblp.uni-trier.de/db/conf/icassp/icassp2017.html#XiaoZJCL17

http://dblp.uni-trier.de/pers/hd/c/Chen:Liping

http://dblp.uni-trier.de/pers/hd/l/Lee:Kong=Aik

http://dblp.uni-trier.de/pers/hd/m/Ma:Bin

http://dblp.uni-trier.de/pers/hd/m/Ma:Long

http://dblp.uni-trier.de/pers/hd/d/Dai:Li=Rong

http://dblp.uni-trier.de/db/conf/icassp/icassp2017.html#ChenLMMLD17

3) Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, Bin Ma, Haizhou Li: Pairwise learning

using multi-lingual bottleneck features for low-resource query-by-example spoken term detection.

ICASSP 2017: 5645-5649

4) Shan Yang, Lei Xie, Xiao Chen, Xiaoyan Lou, Xuan Zhu, Dongyan Huang, Haizhou Li: Statistical

Parametric Speech Synthesis Using Generative Adversarial Networks Under A Multi-task Learning

Framework. CoRR abs/1707.01670 (2017)

5) D.-Y. Huang, Wan Ding, Mingyu Xu, Huaiping Ming, Minghui Dong, Xinguo Yu, Haizhou Li,

Multimodal Prediction of Affective Dimensions via Fusing Multiple Regression Techniques,

INTERSPEECH 2017

6) Kong Aik Lee, Haizhou Li , Gain Compensation for Fast i-Vector Extraction Over Short Duration,

INTERSPEECH 2017

7) Chenglin Xu, Xiong Xiao, Sining Sun, Wei Rao, Eng Siong Chng, Haizhou Li, Weighted Spatial

Covariance Matrix Estimation for MUSIC Based TDOA Estimation of Speech Source,

INTERSPEECH 2017

8) Saad Irtza, Vidhyasaharan Sethu, Eliathamby Ambikairajah, Haizhou Li , Investigating Scalability in

Hierarchical Language Identification System, INTERSPEECH 2017

9) Jie Wu, D.-Y. Huang, Lei Xie, Haizhou Li , Denoising Recurrent Neural Network for Deep

Bidirectional LSTM Based Voice Conversion, INTERSPEECH 2017

10) Berrak Sisman, Haizhou Li, Kay Chen Tan, Transformation of Prosody in Voice Conversion, APSIPA

ASC 2017

11) Chitralekha Gupta, Haizhou Li, Ye Wang, Perceptual Evaluation of Singing Quality, APSIPA ASC

2017

12) Berrak Sisman, Haizhou Li, Kay Chen Tan, Sparse Representation of Phonetic Features for Voice

Conversion with and without parallel data, ASRU 2017

13) Shan Yang, Lei Xie, Xiao Chen, Xiaoyan Lou, Xuan Zhu, Dongyan Huang, Haizhou Li, Statistical

Parametric Speech Synthesis using Generative Adversarial Networks under a Multi-task Learning

Framework, ASRU 2017

14) Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li, Multilingual bottle-neck feature

learning from Untranscribed Speech, ASRU 2017

15) Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, Bin Ma, Haizhou Li, Extracting Bottleneck

Features and Word-like Pairs from Untranscribed Speech from Feature Representation, ASRU 2017

2016

16) Seokhwan Kim, Rafael E. Banchs, Haizhou Li, Exploring Convolutional and Recurrent Neural

Networks in Sequential Labelling for Dialogue Topic Tracking. ACL (1) 2016

17) Nancy F. Chen, Haizhou Li, “Computer-assisted pronunciation training: From pronunciation scoring

towards spoken language learning”, in Proceedings of APSIPA 2016, pp. 1-7

18) Xiaohai Tian, Xiong Xiao, Eng Siong Chng, Haizhou Li, “Spoofing speech detection using temporal

convolutional neural network”, in Proceedings of APSIPA 201, pp. 1-6.

19) Xiong Xiao, Shinji Watanabe, Eng Siong Chng, Haizhou Li, “Beamforming networks using spatial

covariance features for far-field speech recognition”, in Proceedings of APSIPA 2016, pp. 1-6.

20) Haihua Xu, Wei Rao, Xiong Xiao, Hao Huang, Eng Siong Chng, Haizhou Li, “I-vector based deep

neural network acoustic model adaptation using multilingual language resource”, in Proceedings of

APSIPA 2016, pp. 1-5.

21) Xiaohai Tian, Zhizheng Wu, Xiong Xiao, Eng Siong Chng, Haizhou Li, “Spoofing detection from a

feature representation perspective”, in Proceedings of ICASSP 2016, pp. 2119-2123.

22) Huaiping Ming, Dong-Yan Huang, Lei Xie, Shaofei Zhang, Minghui Dong, Haizhou Li, “Exemplar-

based sparse representation of timbre and prosody for voice conversion”, in Proceedings of ICASSP

2016, pp. 5175-5179.

23) Liping Chen, Kong-Aik Lee, Eng Siong Chng, Bin Ma, Haizhou Li, Li-Rong Dai, “Content-aware

local variability vector for speaker verification with short utterance”, in Proceedings of ICASSP 2016,

pp.5485-5489.

http://dblp.uni-trier.de/pers/hd/y/Yuan:Yougen

http://dblp.uni-trier.de/pers/hd/l/Leung:Cheung=Chi

http://dblp.uni-trier.de/pers/hd/x/Xie:Lei

http://dblp.uni-trier.de/pers/hd/c/Chen:Hongjie

http://dblp.uni-trier.de/pers/hd/m/Ma:Bin

http://dblp.uni-trier.de/db/conf/icassp/icassp2017.html#YuanLXCML17

http://dblp.uni-trier.de/pers/hd/y/Yang:Shan

http://dblp.uni-trier.de/pers/hd/x/Xie:Lei

http://dblp.uni-trier.de/pers/hd/c/Chen:Xiao

http://dblp.uni-trier.de/pers/hd/l/Lou:Xiaoyan

http://dblp.uni-trier.de/pers/hd/z/Zhu:Xuan

http://dblp.uni-trier.de/pers/hd/h/Huang:Dongyan

http://dblp.uni-trier.de/db/journals/corr/corr1707.html#YangXCLZHL17

24) Saad Irtza, Vidhyasaharan Sethu, Haris Bavattichalil, Eliathamby Ambikairajah, Haizhou Li, “A

hierarchical framework for language identification”, in Proceedings of ICASSP 2016, pp. 5820-5824.

25) Chongjia Ni, Cheung-Chi Leung, Lei Wang, Haibo Liu, Feng Rao, Li Lu, Nancy F. Chen, Bin Ma,

Haizhou Li, “Cross-lingual deep neural network based submodular unbiased data selection for low-

resource keyword search”, in Proceedings of ICASSP 2016, pp. 6015-6019.

26) Haihua Xu, Jingyong Hou, Xiong Xiao, Van Tung Pham, Cheung-Chi Leung, Lei Wang, Van Hai Do,

Hang Lv, Lei Xie, Bin Ma, Eng Siong Chng, Haizhou Li, “Approximate search of audio queries by

using DTW with phone time boundary and data augmentation”, in Proceedings of ICASSP 2016, pp.

6030-6034.

27) Van Tung Pham, Haihua Xu, Xiong Xiao, Nancy F. Chen, Eng Siong Chng, Haizhou Li, “Keyword

search using query expansion for graph-based rescoring of hypothesized detections”, in Proceedings of

ICASSP 2016, pp. 6035-6039.

28) Nancy F. Chen, Van Tung Pharri, Haihua Xu, Xiong Xiao, Van Hai Do, Chongjia Ni, I-Fan Chen,

Sunil Sivadas, Chin-Hui Lee, Eng Siong Chng, Bin Ma, Haizhou Li, “Exemplar-inspired strategies for

low-resource spoken keyword search in Swahili”, in Proceedings of ICASSP 2016, pp. 6040-6044.

29) Xiong Xiao, Shengkui Zhao, Thi Ngoc Tho Nguyen, Douglas L. Jones, Eng Siong Chng, Haizhou Li,

“An expectation-maximization eigenvector clustering approach to direction of arrival estimation of

multiple speech sources”, in Proceedings of ICASSP 2016, pp. 6330-6334.

30) Dong-Yan Huang, Minghui Dong, Haizhou Li, “Combining multiple kernel models for automatic

intelligibility detection of pathological speech”, in Proceedings of ICASSP 2016: 6485-6489.

31) Wan Ding, Mingyu Xu, Dong-Yan Huang, Weisi Lin, Minghui Dong, Xinguo Yu, Haizhou Li, “Audio

and face video emotion recognition in the wild using deep neural networks and small datasets. ”, in

Proceedings of ICMI 2016, pp. 506-513.

32) Yougen Yuan, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li, “Learning Neural Network

Representations Using Cross-Lingual Bottleneck Features with Word-Pair Information”, in

Proceedings of INTERSPEECH 2016, pp. 788-792.

33) Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li, “Unsupervised Bottleneck Features

for Low-Resource Query-by-Example Spoken Term Detection”, in Proceedings of INTERSPEECH

2016, pp. 923-927.

34) Van Tung Pham, Haihua Xu, Xiong Xiao, Nancy F. Chen, Eng Siong Chng, Haizhou Li, “Rescoring

Hypothesized Detections of Out-of-Vocabulary Keywords Using Subword Samples”, in Proceedings of

INTERSPEECH 2016, pp. 933-937.

35) Paul Yaozhu Chan, Minghui Dong, Grace Xue Hui Ho, Haizhou Li, “SERAPHIM: A Wavetable

Synthesis System with 3D Lip Animation for Real-Time Speech and Singing Applications on Mobile

Platforms”, in Proceedings of INTERSPEECH 2016, pp. 1225-1229.

36) Haihua Xu, Hang Su, Chongjia Ni, Xiong Xiao, Hao Huang, Eng Siong Chng, Haizhou Li, “Semi-

Supervised and Cross-Lingual Knowledge Transfer Learnings for DNN Hybrid Acoustic Models

Under Low-Resource Conditions”, in Proceedings of INTERSPEECH 2016, pp. 1315-1319.

37) Jia Yu, Xiong Xiao, Lei Xie, Eng Siong Chng, Haizhou Li, “A DNN-HMM Approach to Story

Segmentation”, in Proceedings of INTERSPEECH 2016, pp. 1527-1531.

38) Nancy F. Chen, Rong Tong, Darren Wee, Pei Xuan Lee, Bin Ma, Haizhou Li, “SingaKids-Mandarin:

Speech Corpus of Singaporean Children Speaking Mandarin Chinese”, in Proceedings of


39) Xiaohai Tian, Zhizheng Wu, Xiong Xiao, Eng Siong Chng, Haizhou Li, “An Investigation of Spoofing

Speech Detection Under Additive Noise and Reverberant Conditions”, in Proceedings of


40) Paul Yaozhu Chan, Minghui Dong, Grace Xue Hui Ho, Haizhou Li, “SERAPHIM Live! - Singing

Synthesis for the Performer, the Composer, and the 3D Game Developer”, in Proceedings of


41) Huaiping Ming, Dong-Yan Huang, Lei Xie, Jie Wu, Minghui Dong, Haizhou Li, “Deep Bidirectional

LSTM Modeling of Timbre and Prosody for Emotional Voice Conversion”, in Proceedings of


42) Rong Tong, Nancy F. Chen, Bin Ma, Haizhou Li, “Context Aware Mispronunciation Detection for

Mandarin Pronunciation Training”, in Proceedings of INTERSPEECH 2016, pp. 3112-3116.

43) Kong-Aik Lee, Haizhou Li, Li Deng, Ville Hautamäki, Wei Rao, Xiong Xiao, Anthony Larcher,

Hanwu Sun, Trung Hieu Nguyen, Guangsen Wang, Aleksandr Sizov, Jianshu Chen, Ivan Kukanov,

Amir Hossein Poorjam, Trung Ngo Trong, Chenglin Xu, Haihua Xu, Bin Ma, Eng Siong Chng, Sylvain

Meignier, “The 2015 NIST Language Recognition Evaluation: The Shared View of I2R, Fantastic4 and

SingaMS”, in Proceedings of INTERSPEECH 2016, pp. 3211-3215.

44) Saad Irtza, Vidhyasaharan Sethu, Sarith Fernando, Eliathamby Ambikairajah, Haizhou Li, “Out of Set

Language Modelling in Hierarchical Language Identification”, in Proceedings of INTERSPEECH 2016,

pp. 3270-3274.

45) Chongjia Ni, Lei Wang, Cheung-Chi Leung, Feng Rao, Li Lu, Bin Ma, Haizhou Li, “Rapid Update of

Multilingual Deep Neural Network for Low-Resource Keyword Search”, in Proceedings of


46) Cheung-Chi Leung, Lei Wang, Haihua Xu, Jingyong Hou, Van Tung Pham, Hang Lv, Lei Xie, Xiong

Xiao, Chongjia Ni, Bin Ma, Eng Siong Chng, Haizhou Li, “Toward High-Performance Language-

Independent Query-by-Example Spoken Term Detection for MediaEval 2015: Post-Evaluation

Analysis”, in Proceedings of INTERSPEECH 2016, pp. 3703-3707.

2015

47) Huaiping Ming, Dong-Yan Huang, Minghui Dong, Haizhou Li, Lei Xie, Shaofei Zhang “Fundamental

frequency modeling using wavelets for emotional voice conversion”, in Proceedings of ACII 2015, pp.

804-809.

48) Van Hai Do, Xiong Xiao, Eng Siong Chng, Haizhou Li “Distance metric learning for kernel density-

based acoustic model under limited training data conditions”, in Proceedings of APSIPA 2015, pp.

54-58.

49) Jia Yu, Lei Xie, Xiong Xiao, Eng Siong Chng, Haizhou Li, “A density peak clustering approach to

unsupervised acoustic subword units discovery”, in Proceedings of APSIPA 2015, pp. 178-183.

50) Shaofei Zhang, Dong-Yan Huang, Lei Xie, Eng Siong Chng, Haizhou Li, Minghui Dong, “Non-

negative matrix factorization using stable alternating direction method of multipliers for source

separation”, in Proceedings of APSIPA 2015, pp. 222-228.

51) Van Tung Pham, Haihua Xu, Van Hai Do, Tze Yuang Chong, Xiong Xiao, Eng Siong Chng, Haizhou

Li, “On the study of very low-resource language keyword search”, in Proceedings of APSIPA 2015, pp.

358-364.

52) Minghui Dong, Chenyu Yang, Yanfeng Lu, Jochen Walter Ehnes, Dong-Yan Huang, Huaiping Ming,

Rong Tong, Siu Wa Lee, Haizhou Li, “Mapping frames with DNN-HMM recognizer for non-parallel

voice conversion” in Proceedings of APSIPA 2015, pp. 488-494.

53) Van Hai Do, Xiong Xiao, Haihua Xu, Eng Siong Chng, Haizhou Li, “Multilingual exemplar-based

acoustic model for the NIST Open KWS 2015 evaluation”, in Proceedings of APSIPA 2015, pp. 594-

98.

54) Shengkui Zhao, Xiong Xiao, Zhaofeng Zhang, Thi Ngoc Tho Nguyen, Xionghu Zhong, Bo Ren,

Longbiao Wang, Douglas L. Jones, Engsiong Chng, Haizhou Li, “Robust speech recognition using

beamforming with adaptive microphone gains and multichannel noise reduction”, in Proceedings of

ASRU 2015, pp. 460-467.

55) Haihua Xu, Xiong Xiao, Engsiong Chng, Haizhou Li “On statistical machine translation method for

lexicon refinement in speech recognition”, in Proceedings of ChinaSIP 2015, pp. 25-29.

56) Xiaohai Tian, Steven Du, Xiong Xiao, Haihua Xu, Engsiong Chng, Haizhou Li, “Detecting synthetic

speech using long term magnitude and phase information”, in Proceedings of ChinaSIP 2015, pp.

611-615.

57) Seokhwan Kim, Rafael E. Banchs, Haizhou Li, “Wikification of Concept Mentions within Spoken

Dialogues Using Domain Constraints from Wikipedia”, in Proceedings of EMNLP 2015, pp. 2225-

2229.

58) Kui Wu, Xuancong Wang, Nina Zhou, AiTi Aw, Haizhou Li, “Joint Chinese word segmentation and

punctuation prediction using deep recurrent neural network for social media data”, in Proceedings of

IALP 2015, pp. 41-44.

59) Gillian Chua, Qian Ci Chang, Ye Won Park, Paul Yaozhu Chan, Minghui Dong, Haizhou Li, “The

expression of singing emotion - contradicting the constraints of song”, in Proceedings of IALP 2015,

pp. 98-102.

60) Yang Yu, Weisi Lin, Dong-Yan Huang, Minghui Dong, Haizhou Li, “Performance scoring of singing

voice”, in Proceedings of IALP 2015, pp. 119-122.

61) Ridong Jiang, Seokhwan Kim, Rafael E. Banchs, Haizhou Li, “Towards improving the performance of

Vector Space Model for Chinese Frequently Asked Question Answering”, in Proceedings of IALP

2015, pp. 136-139.

62) Miaolong Yuan, Bo Tian, Vui Ann Shim, Huajin Tang, and Haizhou Li, “An Entorhinal-Hippocampal

Model for Simultaneous Cognitive Map Building”, in Proceedings of AAAI-15, Austin Texas, USA,

2015, pp.586-592.

63) Jonathan Dennis, Tran Huy Dat, and Haizhou Li, “Combining Robust Spike Coding with Spiking

Neural Networks for Sound Event Classification”, in Proceedings of ICASSP 2015, Brisbane, Australia,

April 2015.

64) Xiong Xiao, Shengkui Zhao, Xionghu Zhong, Douglas L. Jones, Eng Siong Chng, and Haizhou Li, “A

Learning-based Approach to Direction of Arrival Estimation in Noisy and reverberant Environments”,

in Proceedings of ICASSP 2015, Brisbane, Australia, April 2015.

65) Sven Ewan Shepstone, Kong Aik Lee, Haizhou Li, Zheng-Hua Tan, and Søren Holdt Jensen ,

“Source-Specific Informative Prior for i-Vector Extraction”, in Proceedings of ICASSP 2015, Brisbane,

Australia, April 2015.

66) Haihua Xu, Peng Yang, Xiong Xiao, Lei Xie, Cheung-Chi Leung, Hongjie Chen, Jia Yu, Hang Lv, Lei

Wang, Su Jun Leow, Bin Ma, Eng Siong Chng, and Haizhou Li, “Language Independent Query-by-

Example Spoken Term Detection using N-Best Phone Sequences and Partial Matching”, in

Proceedings of ICASSP 2015, Brisbane, Australia, April 2015.

67) Liping Chen, Kong Aik Lee, Bin Ma, Wu Guo, Haizhou Li, and Li Rong Dai, “Channel Adaptation of

PLDA for Text-Independent Speaker Verification”, in Proceedings of ICASSP 2015, Brisbane,


68) Rong Tong, Nancy F. Chen, Boon Pang Lim, Bin Ma, and Haizhou Li, “Tokenizing Fundamental

Frequency Variation for Mandarin Tone Error Detection”, in Proceedings of ICASSP 2015, Brisbane,


69) Nancy F. Chen, Chongjia Ni, I-Fan Chen, Sunil Sivadas, Van Tung Pham, Haihua Xu, Xiong Xiao,

Tze Siong Lau, Su Jun Leow, Boon Pang Lim, Cheung-Chi Leung, Lei Wang, Chin-Hui Lee, Alvina

Goh, Eng Siong Chng, Bin Ma, and Haizhou Li, “Low-Resource Keyword Search Strategies for

Tamil”, in Proceedings of ICASSP 2015, Brisbane, Australia, April 2015.

70) Liping Chen, Kong-Aik Lee, Bin Ma, Wu Guo, Haizhou Li, Li-Rong Dai, “Phone-centric local

variability vector for text-constrained speaker verification”, in Proceedings of INTERSPEECH 2015,

pp. 229-233.

71) Nancy F. Chen, Rong Tong, Darren Wee, Pei Xuan Lee, Bin Ma, Haizhou Li, “iCALL corpus:

Mandarin Chinese spoken by non-native speakers of European descent” in Proceedings of


72) Rong Tong, Nancy F. Chen, Bin Ma, Haizhou Li, “Goodness of tone (GOT) for non-native Mandarin

tone recognition”, in Proceedings of INTERSPEECH 2015, pp. 801-805.

73) Saad Irtza, Vidhyasaharan Sethu, Phu Ngoc Le, Eliathamby Ambikairajah, Haizhou Li “Phonemes

frequency based PLLR dimensionality reduction for language recognition”, in Proceedings of


74) Longting Xu, Kong-Aik Lee, Haizhou Li, Zhen Yang, “Sparse coding of total variability matrix” in


75) Tze Yuang Chong, Rafael E. Banchs, Engsiong Chng, Haizhou Li, “TDTO language modeling with

feedforward neural networks” in Proceedings of INTERSPEECH 2015, pp. 1458-1462.

76) Shaofei Zhang, Dong-Yan Huang, Lei Xie, Engsiong Chng, Haizhou Li, Minghui Dong, “Regularized

non-negative matrix factorization using alternating direction method of multipliers and its application

to source separation.”, in Proceedings of INTERSPEECH 2015, pp. 1498-1502.

77) Jonathan William Dennis, Tran Huy Dat, Haizhou Li, “Spiking neural networks and the generalised

hough transform for speech pattern detection”, in Proceedings of INTERSPEECH 2015, pp. 1997-

2001.

78) Xiong Xiao, Xiaohai Tian, Steven Du, Haihua Xu, Engsiong Chng, Haizhou Li, “Spoofing speech

detection using high dimensional magnitude and phase features: the NTU approach for ASVspoof 2015

challenge”, in Proceedings of INTERSPEECH 2015, pp. 2052-2056.

79) Kong-Aik Lee, Guangsen Wang, Kam Pheng Ng, Hanwu Sun, Trung Hieu Nguyen, Ngoc Thuy Huong

Thai, Bin Ma, Haizhou Li, ”The reddots platform for mobile crowd-sourcing of speech data”, in


80) Dong-Yan Huang, Minghui Dong, Haizhou Li, ”A real-time variable-q non-stationary Gabor

transform for pitch shifting”, in Proceedings of INTERSPEECH 2015, pp. 2744-2748.

81) Kong-Aik Lee, Anthony Larcher, Guangsen Wang, Patrick Kenny, Niko Brümmer, David A. van

Leeuwen, Hagai Aronowitz, Marcel Kockmann, Carlos Vaquero, Bin Ma, Haizhou Li, Themos

Stafylakis, Md. Jahangir Alam, Albert Swart, Javier Perez, “The reddots data collection for speaker

recognition”, in Proceedings of INTERSPEECH 2015, pp. 2996-3000.

82) Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li ,“Parallel inference of dirichlet

process Gaussian mixture models for unsupervised acoustic modeling: a feasibility study” in


83) Huaiping Ming, Dong-Yan Huang, Lei Xie, Haizhou Li, Minghui Dong, “An alternating optimization

approach for phase retrieval” in Proceedings of INTERSPEECH 2015, pp. 3426-3430.

84) Xiong Xiao, Shengkui Zhao, Xionghu Zhong, Douglas L. Jones, Engsiong Chng, Haizhou Li,

“Learning to estimate reverberation time in noisy and reverberant rooms”, in Proceedings of


85) Sheng Gao, Haizhou Li “Popular song summarization using chorus section detection from audio

signal”, in Proceedings of MMSP 2015, pp. 1-6.

86) Seokhwan Kim, Rafael E. Banchs, Haizhou Li, “Towards Improving Dialogue Topic Tracking

Performances with Wikification of Concept Mentions”, in Proceedings of SIGDIAL Conference 2015,

pp. 124-128.

2014

87) Seokhwan Kim, Rafael E. Banchs, and Haizhou Li, “A Composite Kernel Approach for Dialog Topic

Tracking with Structured Domain Knowledge from Wikipedia”, in Proceedings of ACL-2014, vol.2,

Baltimore, Maryland, USA, 2014, pp.19-13.

88) Dong-Yan Huang, Haizhou Li, and Minghui Dong, “Ensemble Nyström method for predicting conflict

level from speech”, in Proceedings of APSIPA ASC 2014, Cambodia, 2014.

89) Guangpu Huang, Chenglin Xu, Xiong Xiao, Lei Xie, Chng Eng Siong, and Haizhou Li, “Multi-view

features in a DNN-CRF model for improved sentence unit detection on English broadcast news”, in

Proceedings of APSIPA ASC 2014, Cambodia, 2014.

90) Shuojun Liu, Dong-Yan Huang, Weisi Lin, Minghui Dong, Haizhou Li, and Ee Ping Ong, “Emotional

facial expression transfer based on temporal restricted Boltzmann machines”, in Proceedings of

APSIPA ASC 2014, Cambodia, 2014.

91) Zhizheng Wu, Sheng Gao, Eng Siong Chng, and Haizhou Li, “A study on replay attack and anti-

spoofing for text-dependent speaker verification”, in Proceedings of APSIPA ASC 2014, Cambodia,

2014.

92) Haihua Xu, Van Tung Pham, Eng Siong Chng, and Haizhou Li, “Towards better keyword search

performance on Malay broadcast news data”, in Proceedings of APSIPA ASC 2014, Cambodia, 2014.

93) Seokhwan Kim, Rafael E. Banchs, and Haizhou Li, “Wikipedia-based Kernels for dialogue topic

tracking”, in Proceedings of ICASSP 2014, Florence, Italy, May 2014, pp.131-135.

94) Anthony Larcher, Kong-Aik Lee, Bin Ma, and Haizhou Li, “Modelling the alternative hypothesis for

text-dependent speaker verification”, in Proceedings of ICASSP 2014, Florence, Italy, May 2014,

pp.734-738.

95) Anthony Larcher, Kong-Aik Lee, Bin Ma, and Haizhou Li, “Imposture classification for text-

dependent speaker verification”, in Proceedings of ICASSP 2014, Florence, Italy, May 2014, pp.739-

743.

96) Xiong Xiao, Jinyu Li, Eng Siong Chng, and Haizhou Li, “Feature compensation using linear

combination of speaker and environment dependent correction vectors”, in Proceedings of ICASSP

2014, Florence, Italy, May 2014, pp.1720-1724.

97) Duc Hoang Ha Nguyen, Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Generalization of temporal

filter and linear transformation for robust speech recognition”, in Proceedings of ICASSP 2014,

Florence, Italy, May 2014, pp.1730-1734.

98) Jonathan William Dennis, Tran Huy Dat, Haizhou Li, and Eng Siong Chng, “A discriminatively

trained Hough Transform for frame-level phoneme recognition”, in Proceedings of ICASSP 2014,


99) Dong-Yan Huang, Minghui Dong, and Haizhou Li, “Intelligibility detection of pathological speech

using asymmetric sparse kernel partial least squares classifier”, in Proceedings of ICASSP 2014,


100) Liping Chen, Kong-Aik Lee, Bin Ma, Wu Guo, Haizhou Li, and Li-Rong Dai, “Minimum

divergence estimation of speaker prior in multi-session PLDA scoring”, in Proceedings of ICASSP


101) Nancy F. Chen, Sunil Sivadas, Boon Pang Lim, Hoang Gia Ngo, Haihua Xu, Van Tung Pham,

Bin Ma, and Haizhou Li, “Strategies for Vietnamese keyword search”, in Proceedings of ICASSP


102) Tze Yuang Chong, Rafael E. Banchs, Eng Siong Chng, and Haizhou Li, “Improving language

modeling by using distance and co-occurrence information of word-pairs and its application to

LVCSR”, in Proceedings of ICASSP 2014, Florence, Italy, May 2014, pp.4883-4887.

103) Rong Tong, Boon Pang Lim, Nancy F. Chen, Bin Ma, and Haizhou Li, “Subspace Gaussian

mixture model for computer-assisted language learning”, in Proceedings of ICASSP 2014, Florence,

Italy, May 2014, pp.5347-5351.

104) Van Tung Pham, Haihua Xu, Nancy F. Chen, Sunil Sivadas, Boon Pang Lim, Eng Siong Chng,

and Haizhou Li, “Discriminative score normalization for keyword search decision”, in Proceedings of

ICASSP 2014, Florence, Italy, May 2014, pp.7078-7082.

105) Van Hai Do, Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Kernel density-based acoustic

model with cross-lingual bottleneck features for resource limited LVCSR”, in Proceedings of

INTERSPEECH 2014, Singapore, September 2014, pp.6-10.

106) Haipeng Wang, Tan Lee, Cheung-Chi Leung, Bin Ma, and Haizhou Li, “A graph-based

Gaussian component clustering approach to unsupervised acoustic modeling”, in Proceedings of


107) Anthony Larcher, Kong-Aik Lee, Pablo Luis Sordo Martinez, Trung Hieu Nguyen, Bin Ma,

and Haizhou Li, “Extended RSR2015 for text-dependent speaker verification over VHF channel”, in

Proceedings of INTERSPEECH 2014, Singapore, September 2014, pp.1322-1326.

108) Hoang Gia Ngo, Nancy F. Chen, Sunil Sivadas, Bin Ma, and Haizhou Li, “A minimal-

resource transliteration framework for Vietnamese”, in Proceedings of INTERSPEECH 2014,

Singapore, September 2014, pp.1410-1414.

109) Peng Yang, Cheung-Chi Leung, Lei Xie, Bin Ma, and Haizhou Li, “Intrinsic spectral analysis

based on temporal context features for query-by-example spoken term detection”, in Proceedings of


110) Haihua Xu, Hang Su, Eng Siong Chng, and Haizhou Li, “Semi-supervised training for bottle-

neck feature based DNN-HMM hybrid systems”, in Proceedings of INTERSPEECH 2014, Singapore,

September 2014, pp.2078-2082.

111) Minghui Dong, Siu Wa Lee, Haizhou Li, Paul Y. Chan, Xuejian Peng, Jochen Walter Ehnes,

and Dong-Yan Huang, “I2R speech2singing perfects everyone's singing”, in Proceedings of


112) Siu Wa Lee, Zhizheng Wu, Minghui Dong, Xiaohai Tian, and Haizhou Li, “A comparative

study of spectral transformation techniques for singing voice synthesis”, in Proceedings of


113) Zhizheng Wu, Eng Siong Chng, and Haizhou Li, “Joint nonnegative matrix factorization for

exemplar-based voice conversion”, in Proceedings of INTERSPEECH 2014, Singapore, September

2014, pp.2509-2513.

114) Chenglin Xu, Lei Xie, Guangpu Huang, Xiong Xiao, Eng Siong Chng, and Haizhou Li, “A

deep neural network approach for sentence boundary detection in broadcast news”, in Proceedings of


115) Rong Tong, Bin Ma, and Haizhou Li, “Virtual example for phonotactic language recognition”,

in Proceedings of INTERSPEECH 2014, Singapore, September 2014, pp.3017-3021.

116) Vui Ann Shim, Bo Tian, Miaolong Yuan, Huajin Tang, and Haizhou Li, “Direction-driven

navigation using cognitive map for mobile robots”, in Proceedings of the IEEE/RSJ International

Conference on Intelligent Robots and Systems (IROS 2014), Chicago, Illinois, USA, pp.2639-2646.

117) Liping Chen, Kong-Aik Lee, Bin Ma, Wu Guo, Haizhou Li, and Li-Rong Dai, “Local

variability vector for text-independent speaker verification”, in Proceedings of ISCSLP 2014,

Singapore, September 2014, pp.54-58.

118) Yuma Ueda, Longbiao Wang, Atsuhiko Kai, Xiong Xiao, Eng Siong Chng, and Haizhou Li,

“Single-channel dereverberation for distant-talking speech recognition by combining denoising

autoencoder and temporal structure normalization”, in Proceedings of ISCSLP 2014, Singapore,

September 2014, pp.379-383.

119) Kelvin Poon-Feng, Dong-Yan Huang, Minghui Dong, and Haizhou Li, “Acoustic emotion

recognition based on fusion of multiple feature-dependent deep Boltzmann machines”, in Proceedings

of ISCSLP 2014, Singapore, September 2014, pp.584-588.

120) Nicole Mirnig, Yeow Kee Tan, Tai Wen Chang, Yuanwei Chua, Tran Anh Dung, Haizhou Li,

and Manfred Tscheligi, “Screen feedback in human-robot interaction: How to enhance robot

expressiveness”, in Proceedings of IEEE International Symposium on Robot and Human Interactive

Communication (RO-MAN 2014), Edinburgh, UK, 2014, pp.224-230.

121) Van Tung Pham, Nancy F. Chen, Sunil Sivadas, Haihua Xu, I-Fan Chen, Chongjia Ni, Eng

Siong Chng, and Haizhou Li, “System and keyword dependent fusion for spoken term detection”, in

Proceedings of IEEE Spoken Language Technology Workshop (SLT 2014), South Lake Tahoe, Nevada,

USA, 2014, pp.430-435.

122) Andreea I. Niculescu, Rafael E. Banchs, and Haizhou Li, “Why Industrial Robots Should

Become More Social - On the Design of a Natural Language Interface for an Interactive Robot Welder”,

in Proceedings of ICSR 2014, Sydney, Australia, 2014, pp.276-278.

2013

123) Zhizheng Wu and Haizhou Li, “Voice conversion and spoofing attack on speaker verification

systems”, in Proceedings of APSIPA ASC 2013, Kaohsiung, Taiwan, 2013. (Invited paper)

124) Duc Hoang Ha Nguyen, Aleem Mushtaq, Xiong Xiao, Eng Siong Chng, Haizhou Li, and

Chin Hui Lee, “A Particle Filter Compensation Approach to Robust LVCSR”, in Proceedings of

APSIPA ASC 2013, Kaohsiung, Taiwan, 2013.

125) Tze Yuang Chong, Rafael E. Banchs, Eng Siong Chng, and Haizhou Li, “Modeling of term-

distance and term-occurrence information for improving n-gram language model performance”, in

Proceedings of ACL-2013, Sofia, Bulgaria, 2013, pp.233-237.

126) Xiaoming Lu, Lei Xie, Cheung-Chi Leung, Bin Ma, and Haizhou Li, “Broadcast news story

segmentation using manifold learning on latent topic distributions”, in Proceedings of ACL-2013,

Sofia, Bulgaria, 2013, pp. 190-195.

127) Zhizheng Wu, Eng Siong Chng, and Haizhou Li, "Conditional restricted boltzmann machine

for voice conversion", in Proceedings of ChinaSIP 2013, Beijing, China, 2013.

128) Vidhyasaharan Sethu, Julien Epps, Eliathamby Ambikairajah, and Haizhou Li, “GMM Based

Speaker Variability Compensated System for Interspeech 2013 ComParE Emotion Challenge”, in

Proceedings of INTERSPEECH 2013, Lyon, France, August 2013.

129) Van Hai Do, Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Context-Dependent Phone

Mapping for LVCSR of Under-Resourced Languages”, in Proceedings of INTERSPEECH 2013, Lyon,

France, August 2013.

130) Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Attribute-Based Histogram Equalization

(HEQ) and its Adaptation for Robust Speech Recognition”, in Proceedings of INTERSPEECH 2013,

Lyon, France, August 2013.

131) Zhizheng Wu, Anthony Larcher, Kong Aik Lee, Eng Siong Chng, Tomi Kinnunen, and

Haizhou Li, “Vulnerability Evaluation of Speaker Verification Under Voice Conversion Spoofing:

The Effect of Text Constraints”, in Proceedings of INTERSPEECH 2013, Lyon, France, August 2013.

132) R. Saeidi, Kong Aik Lee, Tomi Kinnunen, Taufiq Hasan, Benoit Fauve, P.-M. Bousquet, Elie

Khoury, P.L. Sordo Martinez, J. M. K. Kua, Chang Huai You, Hanwu Sun, Anthony Larcher,

Padmanabhan Rajan, Ville Hautamäki, Cemal Hanilçi, B. Braithwaite, Rosa González Hautamäki,

Seyed Omid Sadjadi, Gang Liu, Hynek Boril, N. Shokouhi, D. Matrouf, L. El Shafey, Pejman

Mowlaee, Julien Epps, T. Thiruvaran, David A. van Leeuwen, Bin Ma, Haizhou Li, John H.L. Hansen,

and Jean-Francois Bonastre, “I4U Submission to NIST SRE 2012: A Large-Scale Collaborative Effort

for Noise-Robust Speaker Verification”, in Proceedings of INTERSPEECH 2013, Lyon, France,

August 2013.

133) Haipeng Wang, Tan Lee, Cheung-Chi Leung, Bin Ma, and Haizhou Li, “Unsupervised

Mining of Acoustic Subword Units with Segment-Level Gaussian Posteriorgrams”, in Proceedings of

INTERSPEECH 2013, Lyon, France, August 2013.

134) Nancy F. Chen, Vivaek Shivakumar, Mahesh Harikumar, Bin Ma, and Haizhou Li, “Large-

Scale Characterization of Mandarin Pronunciation Errors Made by Native Speakers of European

Languages”, in Proceedings of INTERSPEECH 2013, Lyon, France, August 2013.

135) Anthony Larcher, Jean-Francois Bonastre, Benoit Fauve, Kong Aik Lee, Christophe Lévy,

Haizhou Li, John S. D. Mason, and Jean-Yves Parfait, “ALIZE 3.0 — Open Source Toolkit for State-

of-the-Art Speaker Recognition”, in Proceedings of INTERSPEECH 2013, Lyon, France, August 2013.

136) Zhizheng Wu, Tuomas Virtanen, Tomi Kinnunen, Eng Siong Chng, and Haizhou Li,

“Exemplar-Based Unit Selection for Voice Conversion Utilizing Temporal Information”, in

Proceedings of INTERSPEECH 2013, Lyon, France, August 2013.

137) Kong Aik Lee, Anthony Larcher, Chang Huai You, Bin Ma, and Haizhou Li, “Multi-Session

PLDA Scoring of i-Vector for Partially Open-Set Speaker Detection”, in Proceedings of

INTERSPEECH 2013, Lyon, France, August 2013.

138) Zhizheng Wu, Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Synthetic Speech Detection

using Temporal Modulation Feature”, in Proceedings of ICASSP 2013, Vancouver, Canada, May 2013.

139) Dau-Cheng Lyu, Eng-Siong Chng, and Haizhou Li, “Language Diarization for Code-Switch

Conversational Speech”, in Proceedings of ICASSP 2013, Vancouver, Canada, May 2013.

140) Nancy F. Chen, Bin Ma, and Haizhou Li, “Minimal-Resource Phonetic Language Models to

Summarize Untranscribed Speech”, in Proceedings of ICASSP 2013, Vancouver, Canada, May 2013.

141) Anthony Larcher, Kong Aik Lee, Bin Ma, and Haizhou Li, “Phonetically-Constrained PLDA

Modeling for Text-Dependent Speaker Verification with Multiple Short Utterances”, in Proceedings of

ICASSP 2013, Vancouver, Canada, May 2013.

142) Chang Huai You, Haizhou Li, Bin Ma, and Kong Aik Lee, “A Study on GMM-SVM with

Adaptive Relevance Factor and Its Comparison with i-Vector and JFA for Speaker Recognition”, in

Proceedings of ICASSP 2013, Vancouver, Canada, May 2013.

143) Heike Adel, Ngoc Thang Vu, Franziska Kraus, Tim Schlippe, Haizhou Li, and Tanja Schultz,

“Recurrent Neural Network Language Modeling for Code Switching Conversational Speech”, in


144) Xiaoming Lu, Cheung-Chi Leung, Lei Xie, Bin Ma, and Haizhou Li, “Broadcast News Story

Segmentation using Latent Topics on Data Manifold”, in Proceedings of ICASSP 2013, Vancouver,

Canada, May 2013.

145) Jonathan Dennis, Yu Qiang, Tang Huajin, Tran Huy Dat, and Li Haizhou, “Temporal Coding

of Local Spectrogram Features for Robust Sound Recognition”, in Proceedings of ICASSP 2013,

Vancouver, Canada, May 2013.

146) Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Temporal Filter Design by Minimum KL

Divergence Criterion for Robust Speech Recognition”, in Proceedings of ICASSP 2013, Vancouver,

Canada, May 2013.

147) Haipeng Wang, Tan Lee, Cheung-Chi Leung, Bin Ma, and Haizhou Li, “Using Parallel

Tokenizers with DTW Matrix Combination for Low-Resource Spoken Term Detection”, in


148) Yanan Li, Keng Peng Tee, Shuzhi Sam Ge, and Haizhou Li, “Building Companionship

through Human-Robot Collaboration”, in Proceedings of ICSR 2013, Bristol, UK, October, 2013.

2012

149) Zhizheng Wu, Tomi Kinnunen, Eng Siong Chng, Haizhou Li, and Eliathamby Ambikairajah,

“A study on spoofing attack in state-of-the-art speaker verification: the telephone speech case”, in

Proceedings of APSIPA ASC 2012, California, USA, 2012. (Best Paper Award)

150) Tze Yuang Chong, Xiong Xiao, Tien-Ping Tan, Eng Siong Chng, and Haizhou Li,

“Collection and annotation of Malay conversational speech corpus”, in Proceedings of O-COCOSDA

2012, Macau, China, December 2012.

151) Deyi Xiong, Min Zhang, and Haizhou Li, “Modeling the Translation of Predicate-Argument

Structure for SMT”, in Proceedings of ACL-2012, Jeju, Korea, July 2012.

152) Wenliang Chen, Min Zhang, and Haizhou Li, “Utilizing Dependency Language Models for

Graph-based Dependency Parsing Models”, in Proceedings of ACL-2012, Jeju, Korea, July 2012.

153) Rafael E. Banchs and Haizhou Li, “IRIS: a Chat-oriented Dialogue System based on the

Vector Space Model”, in Proceedings of ACL-2012 (System Demonstrations), Jeju, Korea, July 2012.

154) Xiong Xiao, Jinyu Li, Eng Siong Chng, and Haizhou Li, “Lasso Environment Model

Combination for Robust Speech Recognition”, in Proceedings of ICASSP 2012, Kyoto, Japan, March

2012.

155) Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Joint Spectral and Temporal Normalization

of Features for Robust Recognition of Noisy and Reverberated Speech”, in Proceedings of ICASSP

2012, Kyoto, Japan, March 2012.

156) Siu Wa Lee, Shen Ting Ang, Minghui Dong, and Haizhou Li, “Generalized F0 modelling

with absolute and relative pitch features for singing voice synthesis”, in Proceedings of ICASSP 2012,

Kyoto, Japan, March 2012.

157) Lilei Zheng, Cheung-Chi Leung, Lei Xie, Bin Ma, and Haizhou Li, “Acoustic texttiling for

story segmentation of spoken documents”, in Proceedings of ICASSP 2012, Kyoto, Japan, March 2012.

158) Haipeng Wang, Cheung-Chi Leung, Tan Lee, Bin Ma, and Haizhou Li, “An acoustic segment

modeling approach to query-by-example spoken term detection”, in Proceedings of ICASSP 2012,

Kyoto, Japan, March 2012.

159) Anthony Larcher, Pierre-Michel Bousquet, Kong Aik Lee, Driss Matrouf, Haizhou Li, and

Jean-Francois Bonastre, “I-vectors in the context of phonetically-constrained short utterances for

speaker verification”, in Proceedings of ICASSP 2012, Kyoto, Japan, March 2012.

160) Tomi Kinnunen, Zhi-Zheng Wu, Kong Aik Lee, Filip Sedlak, Eng Siong Chng, and Haizhou

Li, “Vulnerability of speaker verification systems against voice conversion spoofing attacks: the case

of telephone speech”, in Proceedings of ICASSP 2012, Kyoto, Japan, March 2012.

161) Ye Jiang, Kong Aik Lee, Zhenmin Tang, Bin Ma, Anthony Larcher, and Haizhou Li, “PLDA

Modeling in I-Vector and Supervector Space for Speaker Verification”, in Proceedings of

INTERSPEECH 2012, Portland, Oregon, September 2012.

162) Anthony Larcher, Kong Aik Lee, Bin Ma, and Haizhou Li, “RSR2015: Database for Text-

Dependent Speaker Verification using Multiple Pass-Phrases”, in Proceedings of INTERSPEECH 2012,

Portland, Oregon, September 2012.

163) You Changhuai, Li Haizhou, Ma Bin, and Lee Kong Aik, “Effect of Relevance Factor of

Maximum a posteriori Adaptation for GMM-SVM in Speaker and Language Recognition”, in

Proceedings of INTERSPEECH 2012, Portland, Oregon, September 2012.

164) Van Hai Do, Xiong Xiao, Engsiong Chng, and Haizhou Li, “Context dependant phone

mapping for cross-lingual acoustic modelling”, in Proceedings of ISCSLP 2012, Hong Kong,

December 2012, pp. 16-20.

165) Cheung-Chi Leung, Bin Ma, and Haizhou Li, “Phonotactic spoken language recognition:

Using diversely adapted acoustic models in parallel phone recognizers”, in Proceedings of ISCSLP

2012, Hong Kong, December 2012, pp. 108-111.

166) Duc Hoang Ha Nguyen, Xiong Xiao, Chng Eng Siong, and Haizhou Li, “An analysis of

vector Taylor series model compensation for non-stationary noise in speech recognition”, in

Proceedings of ISCSLP 2012, Hong Kong, December 2012, pp. 131-135.

167) Siu Wa Lee, Minghui Dong, and Haizhou Li, “A study of F0 modelling and generation with

lyrics and shape characterization for singing voice synthesis”, in Proceedings of ISCSLP 2012, Hong

Kong, December 2012, pp. 150-154.

168) Van Hai Do, Xiong Xiao, Engsiong Chng, and Haizhou Li, “A Phone Mapping Technique for

Acoustic Modeling of Under-Resourced Languages”, in Proceedings of the International Conference

on Asian Language Processing 2012 (IALP 2012), Hanoi, Vietnam, November 2012, pp. 233-236.

169) Liyuan Li, Xinguo Yu, Jun Li, Gang Wang, Ji Yu Shi, Yeow Kee Tan, and Haizhou Li,

“Vision-based attention estimation and selection for social robot to perform natural interaction in the

open world”, in Proceedings of the Seventh Annual Conference on Human-Robot Interaction (HRI

2012), Boston, Massachusetts, USA, March 2012, pp. 183-184.

170) Keng Peng Tee, Shuzhi Sam Ge, Rui Yan, and Haizhou Li, “Adaptive control for robot

manipulators under ellipsoidal task space constraints”, in Proceedings of the IEEE/RSJ International

Conference on Intelligent Robots and Systems (IROS 2012), Vilamoura, Algarve, Portugal, October

2012, pp. 1167-1172.

2011

171) Deyi Xiong, Min Zhang, and Haizhou Li, “Enhancing Language Models in Statistical

Machine Translation with Backward N-grams and Mutual Information Triggers”, in Proceedings of

ACL-2011: HLT, Portland, Oregon, June 2011.

172) Rafael E. Banchs and Haizhou Li, “AM-FM: A Semantic Framework for Translation Quality

Assessment”, in Proceedings of ACL-2011: HLT, Portland, Oregon, June 2011, pp. 153-158.

173) Wenliang Chen, Junichi Kazama, Min Zhang, Yoshimasa Tsuruoka, Yujie Zhang, Yiou Wang,

Kentaro Torisaws, and Haizhou Li, “SMT Helps Bitext Dependency Parsing”, in Proceedings of

EMNLP 2011, Edinburgh, UK, July 2011.

174) Zhenghua Li, Min Zhang, Wanxiang Che, Ting Liu, Wenliang Chen, and Haizhou Li, “Joint

Models for Chinese POS Tagging and Dependency Parsing”, in Proceedings of EMNLP 2011,

Edinburgh, UK, July 2011.

175) Min Zhang, Xiangyu Duan, Ming Liu, Yunqing Xia, and Haizhou Li, “Joint Alignment and

Artificial Data Generation: An Empirical Study of Pivot-based Machine Transliteration”, in

Proceedings of IJCNLP 2011, Chiang Mai, Thailand, November 2011.

176) Guoyu Tang, Yunqing Xia, Min Zhang, Haizhou Li, and Fang Zhang, “CLGVSM: Adapting

Generalized Vector Space Model to Cross-lingual Document Clustering”, in Proceedings of IJCNLP

2011, Chiang Mai, Thailand, November 2011.

177) Huy Dat Tran and Haizhou Li, “Probabilistic Distance SVM With Hellinger-Exponential

Kernel for Sound Event Classification”, in Proceedings of ICASSP 2011, Prague, Czech, May 2011.

178) Huy Dat Tran and Haizhou Li, “Jump Function Kolmogorov for Overlapping Audio Event

Classification”, in Proceedings of ICASSP 2011, Prague, Czech, May 2011.

179) Raymond W. M. Ng, Cheung-Chi Leung, Tan Lee, Bin Ma, and Haizhou Li, “Score Fusion

and Calibration in Multiple Language Detectors With Large Performance Variation”, in Proceedings of

ICASSP 2011, Prague, Czech, May 2011.

180) Filip Sedlak, Tomi Kinnunen, Ville Hautamäki, Kong Aik Lee, Haizhou Li, “Classifier

Subset Selection and Fusion for Speaker Verification”, in Proceedings of ICASSP 2011, Prague, Czech,

May 2011.

181) Eryu Wang, Kong Aik Lee, Bin Ma, Haizhou Li, Wu Guo, Li-Rong Dai, “Factored

Covariance Modeling for Text-Independent Speaker Verification”, in Proceedings of ICASSP 2011,

Prague, Czech, May 2011.

182) Xiong Xiao, Jinyu Li, Eng Siong Chng, Haizhou Li, “Maximum Likelihood Adaptation of

Histogram Equalization With Constraint for Robust Speech Recognition”, in Proceedings of ICASSP

2011, Prague, Czech, May 2011.

183) Kong Aik Lee, Chang Huai You, Ville Hautamäki, Anthony Larcher, and Haizhou Li,

“Spoken Language Recognition in the Latent Topic Simplex”, in Proceedings of INTERSPEECH 2011,

Florence, Italy, August 2011.

184) Chang Huai You, Haizhou Li, and Kong Aik Lee, “Study on the Relevance Factor of

Maximum a Posteriori with GMM for Language Recognition”, in Proceedings of INTERSPEECH 2011,


185) Rong Tong, Bin Ma, Haizhou Li, and Eng Siong Chng, “Target-aware Lattice Rescoring for

Dialect Recognition”, in Proceedings of INTERSPEECH 2011, Florence, Italy, August 2011.

186) Yiren Leng, Huy Dat Tran, Norihide Kitaoka, and Haizhou Li, “Alternative Frequency Scale

Cepstral Coefficient for Robust Sound Event Recognition”, in Proceedings of INTERSPEECH 2011,


187) Kong Aik Lee, Anthony Larcher, Helen Thai, Bin Ma, and Haizhou Li, “Joint Application of

Speech and Speaker Recognition for Automation and Security in Smart Home”, in Proceedings of

INTERSPEECH 2011, Florence, Italy, August 2011.

188) Chien-Lin Huang, Bin Ma, Haizhou Li, and Chung-Hsien Wu, “Speech Indexing Using

Semantic Context Inference”, in Proceedings of INTERSPEECH 2011, Florence, Italy, August 2011.

189) Xiong Xiao, Jinyu Li, Eng Siong Chng, and Haizhou Li, “Feature Normalization Using

Structured Full Transforms for Robust Speech Recognition”, in Proceedings of INTERSPEECH 2011,


190) Sethserey Sam, Xiong Xiao, Laurent Besacier, Eric Castelli, and Haizhou Li, and Eng Siong

Chng, “Speech Modulation Features for Robust Nonnative Speech Accent Detection”, in Proceedings

of INTERSPEECH 2011, Florence, Italy, August 2011.

191) Jonathan William Dennis, Huy Dat Tran, and Haizhou Li, “Image Representation of the

Subband Power Distribution for Robust Sound Classification”, in Proceedings of INTERSPEECH 2011,


192) Mimi Lu, Cheung-Chi Leung, Lei Xie, Bin Ma, and Haizhou Li, “Probabilistic Latent

Semantic Analysis for Broadcast News Story Segmentation”, in Proceedings of INTERSPEECH 2011,


2010

193) Min Zhang, Hui Zhang, and Haizhou Li, “Convolution Kernel over Packed Parse Forest”, in

Proceedings of ACL 2010, Uppsala, Sweden, July 2010. (Full paper)

194) Deyi Xiong, Min Zhang, and Haizhou Li, “Error Detection for Statistical Machine

Translation Using Linguistic Features”, in Proceedings of ACL 2010, Uppsala, Sweden, July 2010.

(Full paper)

195) Xiangyu Duan, Min Zhang, and Haizhou Li. “Pseudo-word for Phrase-based Machine

Translation”, in Proceedings of ACL 2010, Uppsala, Sweden, July 2010. (Full paper)

196) Deyi Xiong, Min Zhang, and Haizhou Li, “Learning Translation Boundaries for Phrase-

Based Decoding”, in Proceedings of NAACL-HLT 2010, Los Angeles, CA, June 2010.

197) Lianhau Lee, Aiti Aw, Min Zhang, and Haizhou Li, “EM-based Hybrid Model for Bilingual

Terminology Extraction from Comparable Corpora”, in Proceedings of COLING 2010, Beijing, China,

August 2010.

198) Vladimir Pervouchine, Min Zhang, Ming Liu, and Haizhou Li, “Improving Name Origin

Recognition with Context Features and Unlabelled Data”, in Proceedings of COLING 2010, Beijing,

China, August 2010.

199) Min Zhang, Xiangyu Duan, Vladimir Pervouchine, and Haizhou Li, “Machine Transliteration:

Leveraging on Third Languages”, in Proceedings of COLING 2010, Beijing, China, August 2010.

200) Raymond W. M. Ng, Cheung-Chi Leung, Ville Hautamaki, Tan Lee, Bin Ma, and Haizhou Li,

“Towards Long-Range Prosodic Attribute Modeling For Language Recognition”, in Proceedings of

INTERSPEECH 2010, Makuhari, Japan, September 2010.

201) Tin Lay Nwe, Hanwu Sun, Bin Ma, and Haizhou Li, “Speaker Diarization in Meeting Audio

for Single Distant Microphone”, in Proceedings of INTERSPEECH 2010, Makuhari, Japan, September

2010.

202) Rong Tong, Bin Ma, Haizhou Li, and Eng Siong Chng, “Selecting Phonotactic Features for

Language Recognition”, in Proceedings of INTERSPEECH 2010, Makuhari, Japan, September 2010.

203) Omid Dehzangi, Bin Ma, Eng Siong Chng, and Haizhou Li, “A Discriminative Performance

Metric for GMM-UBM Speaker Identification”, in Proceedings of INTERSPEECH 2010, Makuhari,

Japan, September 2010.

204) Cheung-Chi Leung, Donglai Zhu, Kong-Aik Lee, Bin Ma, and Haizhou Li, “Incorporating

MAP Estimation and Covariance Transform for SVM based Speaker Recognition”, in Proceedings of


205) Chien-Lin Huang, Hanwu Sun, Bin Ma, and Haizhou Li, “Speaker Characterization Using

Long-Term and Temporal Information”, in Proceedings of INTERSPEECH 2010, Makuhari, Japan,

September 2010.

206) Xiaoxuan Wang, Lei Xie, Bin Ma, Eng Siong Chng, and Haizhou Li, “Phoneme Lattice based

TextTiling towards Multilingual Story Segmentation”, in Proceedings of INTERSPEECH 2010,

Makuhari, Japan, September 2010.

207) Eryu Wang, Kong-Aik Lee, Bin Ma, Haizhou Li, Wu Guo, and Lirong Dai, “The Estimation

and Kernel Metric of Spectral Correlation for Text-Independent Speaker Verification”, in Proceedings

of INTERSPEECH 2010, Makuhari, Japan, September 2010.

208) Donglai Zhu, Bin Ma, Kong-Aik Lee, Cheung-Chi Leung, and Haizhou Li, “MAP Estimation

of Subspace Transform for Speaker Recognition”, in Proceedings of INTERSPEECH 2010, Makuhari,

Japan, September 2010.

209) Hanwu Sun, Bin Ma, Chien-Lin Huang, Trung Hieu Nguyen, and Haizhou Li, “The IIR NIST

SRE 2008 and 2010 Summed Channel Speaker Recognition Systems”, in Proceedings of


210) Ville Hautamaki, Tomi Kinnunen, Mohaddeseh Nosratighods, Kong-Aik Lee, Bin Ma, and

Haizhou Li, “Approaching Human Listener Accuracy with Modern Speaker Verification”, in

Proceedings of INTERSPEECH 2010, Makuhari, Japan, September 2010.

211) Minghui Dong, Paul Chan, Ling Cen, Haizhou Li, Jason Teo, and Ping Jen Kua, “Phonetic

Segmentation of Singing Voice using MIDI and Parallel Speech”, in Proceedings of INTERSPEECH

2010, Makuhari, Japan, September 2010.

212) You Changhuai, Li Haizhou, and Kong-Aik Lee, “A Hybrid Modeling Strategy for GMM-

SVM Speaker Recognition System with Adaptive Relevance factor”, in Proceedings of


213) Leng Yi Ren, Tran Huy Dat, Norihide Kitaoka, and Li Haizhou, “Selective Gammatone

Filterbank Feature for Robust Sound Event Recognition”, in Proceedings of INTERSPEECH 2010,


214) Zhi-Zheng Wu, Tomi Kinnunen, Eng Siong Chng, and Haizhou Li, “Text-Independent F0

Transformation with Non-Parallel Data for Voice Conversion”, in Proceedings of INTERSPEECH

2010, Makuhari, Japan, September 2010.

215) Dau-Cheng Lyu, Tien-Ping Tan, Eng-Siong Chng, and Haizhou Li, “SEAME: a Mandarin-

English Code-switching Speech Corpus in South-East Asia”, in Proceedings of INTERSPEECH 2010,


216) Dat Tran Huy, Yi Ren Leng, and Haizhou Li, “Feature Integration for Heart Sound

Biometrics”, in Proceedings of ICASSP 2010, Dallas, USA, March 2010.

217) Omid Dehzangi, Bin Ma, Eng Siong Chng, and Haizhou Li, “Error Corrective Classifier

Fusion for Spoken Language Recognition”, in Proceedings of ICASSP 2010, Dallas, USA, March

2010.

218) C. P. Santhosh Kumar, Haizhou Li, Rong Tong, Pavel Matejka, Lukas Burget, and Jan

Cernocky, “Tuning Phone Decoders for Language Identification”, in Proceedings of ICASSP 2010,

Dallas, USA, March 2010.

219) Hanwu Sun, Bin Ma, Swe Zin Kalayar Khine, and Haizhou Li, “Speaker Diarization System

for RT07 and RT09 Meeting Room Audio”, in Proceedings of ICASSP 2010, Dallas, USA, March

2010.

220) Yu Tsao, Hanwu Sun, Haizhou Li, and Chin-Hui Lee, “An Acoustic Segment Model

Approach to Incorporating Temporal Information into Speaker Modeling for Text-Independent Speaker

Recognition”, in Proceedings of ICASSP 2010, Dallas, USA, March 2010.

221) Donglai Zhu, Bin Ma, and Haizhou Li, “Soft Margin Estimation of Gaussian Mixture Model

Parameters for Spoken Language Recognition”, in Proceedings of ICASSP 2010, Dallas, USA, March

2010.

222) Shuanhu Bai, Chien-Lin Huang, Bin Ma, and Haizhou Li, “Semi-Supervised Learning of

Language Model using Unsupervised Topic Model”, in Proceedings of ICASSP 2010, Dallas, USA,

March 2010.

223) Raymond W. M. Ng, Cheung-Chi Leung, Tan Lee, Bin Ma, and Haizhou Li, “Prosodic

Attribute Model for Spoken Language Identification”, in Proceedings of ICASSP 2010, Dallas, USA,

March 2010.

2009

224) Vladimir Pervouchine, Haizhou Li, and Bo Lin, “Transliteration Alignment”, in Proceedings

of the 47th Annual Meeting of Association for Computational Linguistics and the 4th International

Joint Conference of Natural Language Processing (ACL-IJCNLP 2009), Singapore, August 2009. (Full

paper)

225) Deyi Xiong, Min Zhang, Aiti Aw and Haizhou Li, “A Syntax-Driven Bracketing Model for

Phrase-Based Translation”, in Proceedings of the 47th Annual Meeting of Association for

Computational Linguistics and the 4th International Joint Conference of Natural Language Processing

(ACL-IJCNLP 2009), Singapore, August 2009. (Full paper)

226) Hendra Setiawan, Min Yen Kan, Haizhou Li, and Philip Resnik, “Topological Ordering of

Function Words in Hierarchical Phrase-based Translation”, in Proceedings of the 47th Annual Meeting

of Association for Computational Linguistics and the 4th International Joint Conference of Natural

Language Processing (ACL-IJCNLP 2009), Singapore, August 2009. (Full paper)

227) Hui Zhang, Min Zhang, Haizhou Li, Aiti Aw, and Chew Lim Tan, “Forest-based Tree

Sequence to String Translation Model”, in Proceedings of the 47th Annual Meeting of Association for

Computational Linguistics and the 4th International Joint Conference of Natural Language Processing

(ACL-IJCNLP 2009), Singapore, August 2009. (Full paper)

228) Boxing Chen, Min Zhang, Haizhou Li, and Aiti Aw, “A Comparative Study of Hypothesis

Alignment and its Improvement for Machine Translation System Combination”, in Proceedings of the

47th Annual Meeting of Association for Computational Linguistics and the 4th International Joint

Conference of Natural Language Processing (ACL-IJCNLP 2009), Singapore, August 2009. (Full

paper)

229) Min Zhang and Haizhou Li, “Tree Kernel-based SVM with Structured Syntactic Knowledge

for BTG-based Phrase Reordering”, in Proceedings of EMNLP 2009, Singapore, August 2009.

230) Hui Zhang, Min Zhang, Haizhou Li, and Chew Lim Tan, “Fast Translation Rule Matching for

Syntax-based Statistical Machine Translation”, in Proceedings of EMNLP 2009, Singapore, August

2009.

231) Hui Zhang, Min Zhang, Chew Lim Tan, and Haizhou Li, “K-Best Combination of Syntactic

Parsers”, in Proceedings of EMNLP 2009, Singapore, August 2009.

232) Rong Tong, Bin Ma, Haizhou Li, Eng Siong Chng, and Kong-Aik Lee, “Target-Aware

Language Models for Spoken Language Recognition”, in Proceedings of INTERSPEECH 2009,

Brighton, UK, September 2009, pp. 200-203.

233) Hanwu Sun, Tin Lay Nwe, Bin Ma, and Haizhou Li, “Speaker Diarization for Meeting Room

Audio”, in Proceedings of INTERSPEECH 2009, Brighton, UK, September 2009, pp. 900-903.

234) Ling Cen, Minghui Dong, Paul Chan, and Haizhou Li, “Unit Selection Based Speech

Synthesis for Poor Channel Condition”, in Proceedings of INTERSPEECH 2009, Brighton, UK,

September 2009, pp. 2075-2078.

235) Donglai Zhu, Bin Ma, and Haizhou Li, “Large Margin Estimation of Gaussian Mixture

Model Parameters with Extended Baum-Welch for Spoken Language Recognition”, in Proceedings of

INTERSPEECH 2009, Brighton, UK, September 2009, pp. 2179-2182.

236) Omid Dehzangi, Bin Ma, Eng Siong Chng, and Haizhou Li, “Discriminative Feature

Transformation Using Output Coding for Speech Recognition”, in Proceedings of INTERSPEECH

2009, Brighton, UK, September 2009, pp. 2979-2982.

237) Khe Chai Sim and Haizhou Li, “Stream-Based Context-Sensitive Phone Mapping for Cross-

Lingual Speech Recognition”, in Proceedings of INTERSPEECH 2009, Brighton, UK, September 2009,

pp. 3019-3022.

238) Yanhua Long, Bin Ma, Haizhou Li, Wu Guo, Eng Siong Chng, and Lirong Dai, “Exploiting

Prosodic Information for Speaker Recognition”, in Proceedings of ICASSP 2009, Taipei, Taiwan, April

2009.

239) Chang Huai You, Kong Aik Lee, and Haizhou Li, “A GMM Supervector Kernel with the

Bhattacharyya Distance for SVM based Speaker Recognition”, in Proceedings of ICASSP 2009, Taipei,

Taiwan, April 2009.

240) Mohaddeseh Nosratighods, Tharmarajah Thiruvaran, Julien Epps, Eliathamby Ambikairajah,

Bin Ma, and Haizhou Li, “Evaluation of a Fused FM and Cepstral-Based Speaker Recognition System

on the NIST 2008 SRE”, in Proceedings of ICASSP 2009, Taipei, Taiwan, April 2009.

241) Hanwu Sun, Bin Ma, and Haizhou Li, “Cross-Validation of Multiple Language Recognition

Systems using Pseudo Keys”, in Proceedings of ICASSP 2009, Taipei, Taiwan, April 2009.

242) Haizhou Li, Bin Ma, Kong-Aik Lee, Hanwu Sun, Donglai Zhu, Khe Chai Sim, Changhuai

You, Rong Tong, Ismo Karkkainen, Chien-Lin Huang, Vladimir Pervouchine, Wu Guo, Yijie Li,

Lirong Dai, Mohaddeseh Nosratighods, Thiruvaran Tharmarajah, Julien Epps, Eliathamby

Ambikairajah, Eng-Siong Chng, Tanja Schultz, and Qin Jin, “The I4U System in NIST 2008 Speaker

Recognition Evaluation”, in Proceedings of ICASSP 2009, Taipei, Taiwan, April 2009.

243) Donglai Zhu, Bin Ma, and Haizhou Li, “Joint MAP Adaptation of Feature Transformation

and Gaussian Mixture Model for Speaker Recognition”, in Proceedings of ICASSP 2009, Taipei,

Taiwan, April 2009.

244) Tran Huy Dat and Haizhou Li, “Sound Event Classification based on Feature Integration,

Recursive Feature elimination and Structured Classification”, in Proceedings of ICASSP 2009, Taipei,

Taiwan, April 2009.

245) Trung Hieu Nguyen, Eng Siong Chng, and Haizhou Li, “Clustering Criterion Functions in

Spectral Subspace and Their Application in Speaker Clustering”, in Proceedings of ICASSP 2009,

Taipei, Taiwan, April 2009.

246) Tin Lay Nwe, Hanwu Sun, Haizhou Li, and Susanto Rahardja, “Speaker Diarization in

Meeting Audio”, in Proceedings of ICASSP 2009, Taipei, Taiwan, April 2009.

2008

247) Min Zhang, Hongfei Jiang, Aiti Aw, Haizhou Li, Chew Lim Tan, and Sheng Li, “A Tree

Sequence Alignment-based Tree-to-Tree Translation Model”, in Proceedings of ACL-08: HLT,

Columbus, Ohio, June 2008. (Full paper)

248) Deyi Xiong, Min Zhang Aiti Aw, and Haizhou Li, “A Linguistically Annotated Reordering

Model for BTG-based Statistical Machine Translation”, in Proceedings of ACL-08: HLT, Columbus,

Ohio, June 2008. (Short paper)

249) Boxing Chen, Min Zhang Aiti Aw, and Haizhou Li, “Exploiting N-best Hypotheses for SMT

Self-Enhancement”, in Proceedings of ACL-08: HLT, Columbus, Ohio, June 2008. (Short paper)

250) Jin-Shea Kuo and Haizhou Li, “Multi-View Co-Training of Transliteration Model”, in

Proceedings of IJCNLP 2008, Hyderabad, India, January 2008.

251) Min Zhang, Chengjie Sun, Haizhou Li, Aiti Aw, and Chew Lim Tan, “Name Origin

Recognition Using Maximum Entropy Model and Diverse Features”, in Proceedings of IJCNLP 2008,

Hyderabad, India, January 2008.

252) Jin-Shea Kuo, Haizhou Li, and Chih-Lung Lin, “Mining Transliterations from Web Query

Results: An Incremental Approach,” in Proceedings of the 6th SIGHAN Workshop, Hyderabad, India,

January 2008.

253) Min Zhang, Hongfei Jiang, Haizhou Li, Aiti Aw, and Sheng Li, “Grammar Comparison

Study for Translational Equivalence Modeling and Statistical Machine Translation”, in Proceedings of

COLING2008, Manchester, UK, August 2008.

254) Boxing Chen, Min Zhang, Aiti Aw, and Haizhou Li, “Regenerating Hypotheses for Statistical

Machine Translation”, in Proceedings of COLING2008, Manchester, UK, August 2008.

255) Deyi Xiong, Min Zhang, Aiti Aw, and Haizhou Li, “Linguistically Annotated BTG for

Statistical Machine Translation”, in Proceedings of COLING2008, Manchester, UK, August 2008.

256) Tee Kiah Chia, Khe Chai Sim, Haizhou Li, and Hwee Tou Ng, “A Lattice-Based Approach to

Query-by-Example Spoken Document Retrieval”, in Proceedings of the 31st Annual International

ACM SIGIR Conference on Research & Development on Information Retrieval, Singapore, July 2008.

(Full paper)

257) Rong Tong, Bin Ma, Haizhou Li, and Eng-Siong Chng, “Target-Oriented Phone Selection

from Universal Phone Set for Spoken Language Recognition”, in Proceedings of INTERSPEECH 2008,

Brisbane, Australia, September 2008.

258) Donglai Zhu, Bin Ma, and Haizhou Li, “Using MAP Estimation of Feature Transformation

For Speaker Recognition”, in Proceedings of INTERSPEECH 2008, Brisbane, Australia, September

2008.

259) Chien-Lin Huang, Bin Ma, Chung-Hsien Wu, Brian Mak, and Haizhou Li, “Robust Speaker

Verification Using Short-Time Frequency with Long-Time Window and Fusion of Multi-Resolutions”,

in Proceedings of INTERSPEECH 2008, Brisbane, Australia, September 2008.

260) Tin Lay Nwe, Minghui Dong, Swe Zin Kalayar Khine, and Haizhou Li, “Multi-Speaker

Meeting Audio Segmentation”, in Proceedings of INTERSPEECH 2008, Brisbane, Australia,

September 2008.

261) Swe Zin Kalayar Khine, Tin Lay Nwe, and Haizhou Li, “Speech/Laughter Classification in

Meeting Audio”, in Proceedings of INTERSPEECH 2008, Brisbane, Australia, September 2008.

262) Tran Huy Dat and Haizhou Li, “Speaker Identification in Noise Mismatch Conditions based

on Jump Function Kolmogorov Analysis in Wavelet Domain”, in Proceedings of INTERSPEECH 2008,

Brisbane, Australia, September 2008.

263) Kong-Aik Lee, Changhuai You, Haizhou Li, Tomi Kinnunen, and Donglai Zhu,

“Characterizing Speech Utterances for Speaker Verification with Sequence Kernel SVM”, in

Proceedings of INTERSPEECH 2008, Brisbane, Australia, September 2008.

264) Namunu Maddage and Haizhou Li, “Rhythm Based Music Segmentation and Octave Scale

Cepstral Features for Sung Language Recognition”, in Proceedings of INTERSPEECH 2008, Brisbane,

Australia, September 2008.

265) Tran Hieu Nguyen , Eng Siong Chng, and Haizhou Li, “T-Test Distance and Clustering

Criterion for Speaker Diarization”, in Proceedings of INTERSPEECH 2008, Brisbane, Australia,

September 2008.

266) Khe Chai Sim and Haizhou Li, “Context-sensitive Probabilistic Phone Mapping Model for

Cross-lingual Speech Recognition”, in Proceedings of INTERSPEECH 2008, Brisbane, Australia,

September 2008.

267) Rong Tong, Bin Ma, Haizhou Li, and Eng Siong Chng, “Target-Oriented Phone Tokenizers

For Spoken Language Recognition”, in Proceedings of ICASSP 2008, Las Vegas, Nevada, March-

April 2008.

268) Donglai Zhu, Haizhou Li, Bin Ma, and Chin-Hui Lee, “Discriminative Learning For

Optimizing Detection Performance In Spoken Language Recognition”, in Proceedings of ICASSP 2008,

Las Vegas, Nevada, March- April 2008.

269) Tin Lay Nwe and Haizhou Li, “On Fusion Of Timbre-Motivated Features For Singing Voice

Detection And Singer Identification”, in Proceedings of ICASSP 2008, Las Vegas, Nevada, March-

April 2008.

270) Swe Zin Kalayar Khine, Tin Lay Nwe, and Haizhou Li, “Singing Voice Detection In Pop

Songs Using Co-Training Algorithm”, in Proceedings of ICASSP 2008, Las Vegas, Nevada, March-

April 2008.

271) Khe Chai Sim and Haizhou Li, “Robust Phone Set Mapping Using Decision Tree Clustering

For Cross-Lingual Phone Recognition”, in Proceedings of ICASSP 2008, Las Vegas, Nevada, March-

April 2008.

272) Kong-Aik Lee, Changhuai You, and Haizhou Li, “Spoken Language Recognition Using

Support Vector Machines With Generative Front-End”, in Proceedings of ICASSP 2008, Las Vegas,

Nevada, March- April 2008.

273) Tran Huy Dat and Haizhou Li, “Jump Function Komogorov And Its Application For Audio

Stream”, in Proceedings of ICASSP 2008, Las Vegas, Nevada, March- April 2008.

274) Chien-Lin Huang, Chung-Hsien Wu, Chia-Hsin Hsieh, Haizhou Li, and Bin Ma,

“Unsupervised Pronunciation Grammar Growing using Knowledge-based and Data-Driven

Approaches”, in Proceedings of ICME 2008, Hannover, Germany, June 2008.

275) Chang Huai You, Susanto Rahardja, and Haizhou Li, “Speech Enhancement for Telephony

Name Speech Recognition”, in Proceedings of ICME 2008, Hannover, Germany, June 2008.

276) Boxing Chen, Deyi Xiong, Min Zhang, Aiti Aw, and Haizhou Li, “I2R Multi-Pass Machine

Translation System for IWSLT 2008”, in Proceedings of IWSLT 2008, Hawaii, USA, 2008, pp.46-51.

277) Omid Dehzangi, Bin Ma, Eng Siong Chng, and Haizhou Li, “Fuzzy Rule Selection using

Iterative Rule Learning for Speech Data Classification”, in Proceedings of the International

Conference on Pattern Recognition 2008 (ICPR 2008), Tampa, Florida, December 2008.

278) Eugene Chin Wei Koh, Hanwu Sun, Tin Lay Nwe, Trung Hieu Nguyen, Bin Ma, Eng-

Siong Chng, Haizhou Li, and Susanto Rahardja, “Speaker Diarization Using Direction of Arrival

Estimate and Acoustic Feature Information: The I2R-NTU Submission for the NIST RT 2007

Evaluation”, in Lecture Notes of Computer Science Vol. 4625, Multimodal Technologies for Perception

of Humans, Springer 2008, pp.484-496.

2007

279) Haizhou Li, Khe Chai Sim, Jin-Shea Kuo, and Minghui Dong, “Semantic Transliteration of

Personal Names”, in Proceedings of ACL 2007, Prague, Czech Republic, June 2007, pp. 120-127.

280) Hendra Setiawan, Min-Yen Kan, and Haizhou Li, “Ordering Phrases with Function Words”,

The in Proceedings of ACL 2007, Prague, Czech Republic, June 2007, pp. 712-719.

281) Tee Kiah Chia, Haizhou Li, and Hwee Tou Ng, “A Statistical Language Modeling Approach

to Lattice-based Spoken Document Retrieval”, in Proceedings of the Joint Meeting Conference on

Empirical Methods in Natural Language Processing, and Conference on Computational Natural

Language Learning(EMNLP-CoNLL 2007), Prague, Czech Republic, June 2007.

282) Bin Ma, Rong Tong, and Haizhou Li, “Discriminative Vector for Spoken Language

Recognition”, in Proceedings of ICASSP 2007, Hawaii, USA, April 2007.

283) Rong Tong, Haizhou Li, Bin Ma, and Eng Siong Chng, “Spoken Language Recognition with

Relevance Feedback”, in Proceedings of ICASSP 2007, Hawaii, USA, April 2007.

284) Donglai Zhu, Bin Ma, Haizhou Li, and Qiang Huo, “A Generalized Feature Transformation

Approach for Channel Robust Speaker Verification”, in Proceedings of ICASSP 2007, Hawaii, USA,

April 2007.

285) Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Normalizing the Speech Modulation

Spectrum for Robust Speech Recognition”, in Proceedings of ICASSP 2007, Hawaii, USA, April 2007.

286) Kong Aik Kee, Changhuai You, Haizhou Li, and Tomi Kinnunen, “A GMM-based

Probabilistic Sequence Kernel for Speaker Verification”, in Proceedings of INTERSPEECH 2007,

Antwerp, Belgium, August 2007.

287) Eugene Chin Wei Koh, Hanwu Sun, Tin Lay Nwe, Trung Hieu Nguyen, Bin Ma, Eng-Siong

Chng, Haizhou Li, and Susanto Rahardja, “Using Direction of Arrival Estimate and Acoustic Feature

Information in Speaker Diarization”, in Proceedings of INTERSPEECH 2007, Antwerp, Belgium,

August 2007.

288) Khe Chai Sim and Haizhou Li, “Fusion of Contrastive Acoustic Models for Parallel

Phonotactic Spoken Language Identification”, in Proceedings of INTERSPEECH 2007, Antwerp,

Belgium, August 2007.

289) Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Evaluating the Temporal Structure

Normalisation Technique on the Aurora-4 Task”, in Proceedings of INTERSPEECH 2007, Antwerp,

Belgium, August 2007.

290) Tin Lay Nwe and Haizhou Li, “Singing Voice Detection using Perceptually-Motivated

Features”, in Proceedings of ACM Multimedia Conference 2007, Augsburg, Germany, September 2007.

291) Lei Wang, Eng Siong Chng, and Haizhou Li, “A vector-based approach to broadcast audio

database indexing and retrieval”, in Proceedings of ICME 2007, Beijing, China, July 2007.

2006

292) Jin-Shea Kuo, Haizhou Li, and Ying-Kuei Yang, “Learning Transliteration Lexicons from the

Web”, in Proceedings of the 44th Annual Meeting of Association for Computational Linguistics

(COLING-ACL 2006), Sydney, Australia, July 2006, pp. 1129 – 1136.

293) Namunu Maddage, Haizhou Li, and Mohan Kankanhalli, “Music Structure based Vector

Space Retrieval”, in Proceedings of the 29th Annual International ACM SIGIR Conference on

Research & Development on Information Retrieval (SIGIR 2006), Seattle, Washington, August 2006,

pp. 67-74. (Full paper)

294) Shuanhu Bai and Haizhou Li, “Bayesian Learning of N-gram statistical Language Modeling”,

in Proceedings of ICASSP 2006, Toulouse, France, May 2006.

295) Haizhou Li and Tin Lay Nwe, “Vibrato-Motivated Acoustic Features for Singer

Identification”, in Proceedings of ICASSP 2006, Toulouse, France, May 2006.

296) Rong Tong, Bin Ma, Donglai Zhu, Haizhou Li, and Eng Siong Chng, “Integrating Acoustic,

Prosodic and Phonotactic features for Spoken language identification”, in Proceedings of ICASSP 2006,

Toulouse, France, May 2006.

297) Tin Lay Nwe, Haizhou Li, and Minghui Dong, “Analysis and Detection of Speech under

Sleep Deprivation”, in Proceedings of INTERSPEECH 2006, Pittsburgh, USA, September 2006.

298) Haizhou Li, Bin Ma, and Rong Tong, “Vector-Based Spoken Language Recognition using

Output Coding”, in Proceedings of INTERSPEECH 2006, Pittsburgh, USA, September 2006.

299) Minghui Dong, Haizhou Li, and Tin Lay Nwe, “Evaluating Prosody of Mandarin Speech for

Language Learning”, in Proceedings of INTERSPEECH 2006, Pittsburgh, USA, September 2006.

300) Ma Bin, Donglai Zhu, Rong Tong, and Haizhou Li, “Speaker Cluster based GMM

Tokenization for Speaker Recognition”, in Proceedings of INTERSPEECH 2006, Pittsburgh, USA,

September 2006.

301) Denny Iskandar, Ye Wang, Min -Yen Kan, and Haizhou Li, “Syllabic Level Automatic

Synchronization of Music Signals and Text Lyrics”, in Proceedings of the ACM Multimedia

Conference 2006, Santa Barbara, USA, October 2006.

302) Namunu C Maddage, Mohan S. Kankanhalli, and Haizhou Li, “A Hirarchical Approach for

Music Chord Modeling based on the Analysis of Tonal Characteristics”, in Proceedings of ICME 2006,

Toronto, Canada, July 2006.

303) Jinyu Li, Sibel Yaman, Chin-Hui Lee, Bin Ma, Rong Tong, Donglai Zhu, and Haizhou Li,

“Language Recognition Based on Score Distribution Feature Vectors and Discriminative Classifier

Fusion”, in Proceedings of the IEEE Odyssey 2006 - The Speaker and Language Recognition

Workshop, San Juan, Puerto Rico, June 2006.

2005

304) Min Zhang, Haizhou Li, Jian Su, and Hendra Setiawan, “A Phrase-based Context-dependent

Joint Probability”, in Proceedings of IJCNLP 2005, Jeju, South Korea, October 2005.

305) Hendra Setiawan, Haizhou Li, Min Zhang, and Beng Chin Ooi, “Phrase-based Statistical

Machine Translation: A Level of Detail Approach”, in Proceedings of IJCNLP 2005, Jeju, South Korea,

October 2005.

306) Haizhou Li and Bin Ma, “A Phonotactic Language Model for Spoken Language

Identification”, in Proceedings of ACL 2005, Ann Arbor, USA, June 2005, pp. 515-522.

307) Bin Ma and Haizhou Li, “A Phonotactic-Semantic Paradigm for Automatic Spoken

Document Classification”, in Proceedings of the 28th Annual International ACM SIGIR Conference

(SIGIR 2005), Salvador, Brazil, August 2005, pp. 369-376. (Full paper)

308) Tin Lay Nwe and Haizhou Li, “Broadcast News Segmentation by Audio Type Analysis”, in

Proceedings of ICASSP 2005, Philadelphia, PA, March 2005.

309) Boon Pang Lim, Haizhou Li, and Bin Ma, “Using Local and Global Phonotactical Features in

Chinese Dialect Identification”, in Proceedings of ICASSP 2005, Philadelphia, PA, March 2005.

310) Santhosh C. Kumar, V.P. Mohandas, and Haizhou Li, “Multilingual Speech Recognition: A

Unified Approach”, in Proceedings of INTERSPEECH 2005 - Eurospeech - 9th European Conference

on Speech Communication and Technology, Lisboa, Portugal, September 2005.

311) Tin Lay Nwe and Haizhou Li, “Identifying Singers of Popular Songs”, in Proceedings of

INTERSPEECH 2005 - Eurospeech - 9th European Conference on Speech Communication and

Technology, Lisboa, Portugal, September 2005.

312) Minghui Dong, Kim-Teng Lua, and Haizhou Li, “A Probabilistic Approach to Prosodic Word

Prediction for Mandarin Chinese TTS”, in Proceedings of INTERSPEECH 2005 - Eurospeech - 9th

European Conference on Speech Communication and Technology, Lisboa, Portugal, September 2005.

313) Sheng Gao, Bin Ma, Haizhou Li, and Chin-Hui Lee, “A Text Categorization Approach to

Automatic Language Identification”, in Proceedings of INTERSPEECH 2005 - Eurospeech - 9th


314) Bin Ma, Haizhou Li, and Chin-Hui Lee, “An Acoustic Segment Modeling Approach to

Automatic Language Identification”, in Proceedings of INTERSPEECH 2005 - Eurospeech - 9th


315) Minghui Dong, Kim Teng Lua, and Haizhou Li, “A Unit Selection based Speech Synthesis

Approach for Chinese Mandarin Text-to-Speech”, in Proceedings of the International Conference on

Chinese Computing 2005 (ICCC 2005), Singapore, March 2005.

316) Bin Ma and Haizhou Li, “Spoken Language Identification Using Bag-of-Sounds”, in

Proceedings of the International Conference on Chinese Computing 2005 (ICCC 2005), Singapore,

March 2005.

317) Manickam K and Haizhou Li, “Complexity Analysis of Normal and Deaf Infant Cry Acoustic

Waves”, in Proceedings of the 4th International Workshop on Model and Analysis of Vocal Emission

for Biomedical Applications (MAVEBA 2005), Florence, Italy, 2005.

318) Boon Pang Lim, Bin Ma, and Haizhou Li, “Using Semantic Context to Improve Voice

Keyword Mining”, in Proceedings of the International Conference on Chinese Computing 2005 (ICCC

2005), Singapore, March 2005.

2004

319) Haizhou Li, Min Zhang, and Jian Su, “A Joint Source-Channel Model for Machine

Transliteration”, in Proceedings of ACL 2004, Barcelona, Spain, July 2004, pp. 160-167.

320) Min Zhang, Haizhou Li, and Jian Su, “Direct Orthographical Mapping for Machine

Transliteration”, in Proceedings of the 20th International Conference on Computational Linguistics

(COLING2004), Geneva, Switzerland, August 2004.

321) Jun Xu, Guohong Fu, and Haizhou Li, “Grapheme-to-Phoneme Conversion for Chinese Text-

to-Speech Session Code”, in Proceedings of INTERSPEECH-ICSLP 2004, Jeju Island, Korea, October

2004.

322) Boon Pang Lim, Haizhou Li, and Yu Chen, “Language Identification through Large

Vocabulary Continuous Speech Recognition”, in Proceedings of ISCSLP 2004, Hong Kong, December

2004.

323) Yeow Kee Tan, Boon Seong Teoh, and Haizhou Li, “A Grapheme to Phoneme Conversion

for Standard Malay”, in Proceedings of ICSLT-O-COCOSDA 2004, New Delhi, India, November 2004.

324) C. S. Kumar and Haizhou Li, “Language identification System for Multilingual Speech

Recognition Systems”, in Proceedings of the 9th International Conference Speech and Computer

(SPECOM 2004), St. Petersburg, Russia, September 2004.

Documents

Recent Publications Haizhou Li - COLIPSeleliha/Publications - Haizhou Li.pdf · · 2017-12-31Recent Publications – Haizhou Li Patents 1) US Patent number 6311152, ... Marcello