Skip to content Skip to navigation

Faculty

Wentao Gu Professor

Academic Area: 
Institute of Linguistic Science and Technology
Research Interests: 
  • Speech Sciences
  • Technologies
  • Speech Prosody
Bio: 

Education Background

PhD, Electronic Engineering, Shanghai Jiaotong University (1999)

MA, Electronic Engineering, Shanghai Jiaotong University (1996)

BA, Electronic Engineering, Shanghai Jiaotong University (1993)

 

Professional Experience

2013/06-2014:

Guest Researcher, The University of Tokyo, JAPAN

2008/08-Present:

Professor, Nanjing Normal University, CHINA

2006/08-2008/08:

Research Associate, The Chinese University of Hong Kong, HONG KONG

2004/07-2006/07:

JSPS Postdoctoral Research Fellow, The University of Tokyo, JAPAN

2001/10-2003/10:

Postdoctoral Researcher, The University of Tokyo, JAPAN

1999/08-2005/10:

Lecturer, Shanghai Jiaotong University, CHINA

1997/09-1998/03:

Visiting Researcher, Bell Laboratories, Lucent Technologies, NJ, USA

 

Services in Professional Societies

Assosiate Editor, Phonetica, 2014-

Guest Editor, Journal of Chinese Linguistics Monograph “Studies on Tonal Aspects of Languages”

General Chair, the Third International Symposium on Tonal Aspects of Languages (www.TAL2012.org), 2012

General Chair, National Workshop on Phonetic Laboratory Construction, 2011

 

Referee for International Journals

Phonetica

Frontiers in Behavioral Neuroscience

Journal of the Acoustical Society of America

Speech Communication

IEICE Transactions on Information and Systems

 

Referee for Grant Proposals

National Social Science Foundation of China

Humanity and Social Science Foundation, the Ministry of Education of China

China Postdoctoral Science Foundation

 

Research Grants

Research Grants as Principal Investigator

2013-2014:

Cross-linguistic analysis, modeling, and perception of attitudinal prosody in speech communication (NICT International Exchange Program, Japan)

2013-2018:

Cross-linguistic and cross-cultural study on the production and perception of speech conveying social affects (Major Program for the National Social Science Fund of China)

2010-2014:

Cross-linguistic comparison of speech prosody, and error analysis and automatic assessment of second language prosody

(Key project, Jiangsu Higher Institutions’ Key Research Base for Philosophy and Social Sciences)

2010-2013:

Prosodic comparison between Mandarin and Cantonese and prosodic analysis of their L2 speech

(National Social Science Fund of China)

2009-2012:

Prosodic analysis for non-native Chinese speech

(Jiangsu Province social Science Fund)

2008-2012:

Prosodic variation in language contact and its applications in speech synthesis and computer-aided L2 learning

(Nanjing Normal University)

2008-2011:

Working platform for phonetic science and speech technology

(Sub-project of the National “211” Project for Key Disciplines at Nanjing NormalU niversity)

2004-2006:

Analysis and formulation of Chinese F0 contours and development of their synthesis method

(Japan Society for the Promotion of Science,  Japan)

2004-2006:

Analysis, modeling, and synthesis of fundamental frequency contours of Mandarin speech

(Ministry of Education of China)

 

Research Grants as Co-Investigator

2012-2014:

EOG and ERP study for syntactic priming mechanism in Chinese sentence comprehension

(National Natural Science Foundation of China)

2012-2015:

Corpus-based automatic assessment of spoken English by Chinese EFL learners

(National Social Science Fund of China)

2009-2012:

Pragmatic analysis of adverbs and the related phonetic study

(National Social Science Fund of China)

2011-2014:

Acoustic modeling and perceptual experiments for tones of Chinese dialects in Jiangsu Province

(Humanity and Social Science Foundation, the Ministry of Education of China)

2011-2014:

Corpus construction and study for spoken German by Chinese undergraduate learners

(Shanghai Social Science Fund)

2010-2013:

Study of tones for Chinese dialects in Jiangsu Province

(Jiangsu Province Social Science Fund)

2006-2008:

Information retrieval from mixed-language spoken documents

(Shun Hing Institute of Advanced Engineering, The Chinese University of Hong Kong, HK)

2006-2007:

Prosody analysis and modeling for Cantonese natural speech synthesis

(Hong Kong Research Grants Council, HK)

 

Publications and Presentations (before 2014)

Books

1. Gu, W. (2013). Experimental Analysis and Quantitative Modeling of Speech Prosody.Beijing: World Publishing Corporation. (ISBN: 978-7-5100-5652-9)

Edited Publications

2. Gu, W. (ed.) (2013). Journal of Chinese Linguistics Monograph “Studies on Tonal Aspects of Languages”, to be published soon.

3. Gu, W. (ed.) (2012). Proceedings of the Third International Symposium on Tonal Aspects of Languages. ISCA Online Archive:http://www.isca-speech.org/archive/tal_2012/.

Invited Talks

4. Gu, W. (2012).“Quantitative analysis and modeling of tonal and intonational variations on different layers.” Tutorial at the Third International Symposium on Tonal Aspects of Languages (TAL 2012), Nanjing, China.

Journal Articles

5. Lu, Y., Aubergé, V., Rilliard, A., and Gu, W. (2013). Perceptual study for Mandarin attitudinal speech. Journal of School of Chinese Language and Culture, Nanjing Normal University 3. (In Chinese)

6. Zhang, T. and Gu, W*. (2011). Corpus design and prosody study for Mandarin affective speech. Journal of School of Chinese Language and Culture, Nanjing Normal University 3: 33-44. (In Chinese)

7. Gu, W. and Lee, T. (2009). Effects of tone and emphatic focus on F0 contours of Cantonese speech: A comparison with Standard Chinese. Chinese Journal of Phonetics 2: 133-147.

8. Gu, W. and Fujisaki, H. (2008). Prosodic structure of spoken Mandarin: A comparison of perception-based prosodic boundaries and model-based analysis of phrasing. Chinese Journal of Phonetics 1: 188-195. (In Chinese)

9. Gu, W., Hirose, K., and Fujisaki, H. (2007). Analysis of tones in Cantonese speech based on the command-response model. Phonetica 64 (1): 29-62.

10. Gu, W., Hirose, K., and Fujisaki, H. (2006). Modeling the effects of emphasis and question on fundamental frequency contours of Cantonese utterances. IEEE Trans. Audio, Speech and Language Processing 14 (4): 1155-1170.

11. Fujisaki, H., Wang, C., Ohno, S., and Gu, W*. (2005). Analysis and synthesis of fundamental frequency contours of Standard Chinese using the command-response model. Speech Communication 47 (1-2): 59-70.

12. Gu, W., Hirose, K., and Fujisaki, H. (2004). Automatic extraction of tone command parameters for the model of F0 contour generation for Standard Chinese. IEICE Trans. Information and Systems E87-D (5): 1079-1085.

13. Gu, W. (1999). A modified greedy algorithm for optimal text selection. Journal of Shanghai Jiaotong University 33 (1): 96-100. (In Chinese)

14. Gu, W. and Zheng, Z. (1997). Associative memory and neural network approach for motion estimation in HDTV image coding. Journal of Shanghai Jiaotong University 31 (12): 24-28. (In Chinese)

Book Chapters

15. Gu, W. and Fujisaki, H. (2013). Data acquisition and prosodic analysis for Mandarin attitudinal speech. In: G. Peng and F. Shi (eds.), East Flows the Great River: Festschrift in Honor of Prof. William S-Y. Wang’s 80th Birthday,Hong Kong: City University of Hong Kong Press, pp. 483-500.

16. Fujisaki, H., Gu, W., and Ohno, S. (2007). Physiological and physical bases of the command-response model for generating fundamental frequency contours in tone languages: Implications for the phonology of tones. In: M.-J. Solé, P. Beddor, and M. Ohala (eds.), Experimental Approaches to Phonology, Oxford: Oxford University Press, pp. 228-245.

Peer-reviewed International Conference Papers

17. Luo, D., Gu, W., and Tsurutani, C. (2013). The roles of prosodic features in evaluating the naturalness of Mandarin L2 speech. Proc. the 16th Oriental COCOSDA, KIIT Gurgaon, India.

18. Gu, W., Wang, F., and Liang, D. (2013). Prosody of Mandarin affective speech by mentally retarded children. Proc. WASSS 2013, Grenoble, France.

19. Wang, T., Ding, H., and Gu, W. (2012). Perceptual study for emotional speech of Mandarin Chinese. Proc. Speech Prosody 2012, pp. 653-656, Shanghai, China.

20. Lu, Y., Aubergé, V., Rilliard, A., and Gu, W. (2012). Prosodic cross-linguistic perception of social affects in Mandarin Chinese by native, French and Vietnamese listeners. Proc. International Conference of Gruppo di Studio sulla Comunicazione Parlata 2012, pp. 141-145, Belo Horizonte, Brazil.

21. Gu, W., Zhang, T., and Fujisaki, H. (2011). Prosodic analysis and perception of Mandarin utterances conveying attitudes. Proc. INTERSPEECH 2011, pp. 1069-1072, Florence, Italy.

22. Gu, W., Lee, T., and Ching, P.C. (2008). Prosodic variation in Cantonese-English code-mixed speech. Proc. ISCSLP 2008, pp. 342-345, Kunming, China.

23. Gu, W., Ho, R. S.-C., and Lee, T. (2007). Modeling tones in Hakka on the basis of the command-response model.Proc. INTERSPEECH 2007, pp. 2633-2636, Antwerp, Belgium.

24. Hirano, H., Hirose, K., Kawai, G., Gu, W., and Minematsu, N. (2007). F0 models show Chinese speakers of Japanese insert intonational boundaries and drop pitch. Proc. INTERSPEECH 2007, pp. 1885-1888, Antwerp, Belgium.

25. Gu, W. and Lee, T. (2007). Quantitative analysis of F0 contours of emotional speech of Mandarin. Proc. 6th ISCA Speech Synthesis Workshop, pp. 228-233, Bonn, Germany.

26. Gu, W. and Lee, T. (2007). Effects of tonal context and focus on Cantonese F0. Proc. 16th ICPhS, pp. 1033- 1036, Saarbrücken, Germany.

27. Gu, W. and Lee, T. (2007). Effects of focus on prosody of Cantonese speech: A comparison of surface feature analysis and model-based analysis. Proc. ParaLing’07, pp. 59-64, Saarbrücken, Germany.

28. Sun, Q., Hirose, K., Gu, W., and Minematsu, N. (2006). Analysis on the effects of tonal co-articulation at word and non-word syllable boundaries of Mandarin based on the tone nucleus model. ASA & ASJ Joint Meeting, Hawaii.

29. Gu, W., Hirose, K., and Fujisaki, H. (2006). Comparison of perceived prosodic boundaries and global characteristics of voice fundamental frequency contours in Mandarin speech. Proc. ISCSLP 2006 (Q. Huo et al., Eds.: LNAI 4274, Springer), pp. 31-42, Singapore.

30. Ng, R.W.-M., Lee, T., and Gu, W. (2006). Towards automatic parameter extraction of command-response model for Cantonese. Proc. INTERSPEECH 2006, pp. 2358-2361, Pittsburgh, PA.

31. Gu, W., Hirose, K., and Fujisaki, H. (2006). A general approach for automatic extraction of tone commands in the command-response model for tone languages. Proc. Speech Prosody 2006, pp. 153-156, Dresden, Germany.

32. Gu, W., Hirose, K., and Fujisaki, H. (2006). The effect of paralinguistic emphasis on F0 contours of Cantonese speech. Proc. Speech Prosody 2006, pp. 430-433, Dresden, Germany.

33. Wang, X., Gu, W., Hirose, K., Sun, Q., and Minematsu, N. (2006). Comparison of tonal co-articulation betweenintra- and inter-word disyllables in Mandarin. Proc. Speech Prosody 2006, pp. 157-160, Dresden, Germany.

34. Sun, Q., Hirose, K., Gu, W., and Minematsu, N. (2006). Rule-based generation of phrase components in two- step synthesis of fundamental frequency contours of Mandarin. Proc. Speech Prosody 2006, pp. 561- 564, Dresden, Germany.

35. Fujisaki, H. and Gu, W. (2006). Phonological representation of tone systems of some tone languages based on the command-response model for F0 contour generation. Proc. TAL 2006, pp. 59-62, La Rochelle, France.

36. Gu, W., Hirose, K., and Fujisaki, H. (2006). A comparative study between intonation question and particle question in Cantonese on their realization of F0 contours. Proc. TAL 2006, pp. 63-66, La Rochelle, France.

37. Gu, W., Hirose, K., and Fujisaki, H. (2006). Modeling the tones in Suzhou and Wujiang dialects on the basis of the command-response model for the process of F0 contour generation. Proc. TAL 2006, pp. 67-70, La Rochelle, France.

38. Gu, W., Hirose, K., and Fujisaki, H. (2005). Analysis of the effects of word emphasis and echo question on F0 contours of Cantonese utterances. Proc. INTERSPEECH 2005, pp. 1825-1828, Lisbon, Portugal.

39. Sun, Q., Hirose, K., Gu, W., and Minematsu, N. (2005). Generation of fundamental frequency contours for Mandarin speech synthesis based on tone nucleus model. Proc. INTERSPEECH 2005, pp. 3265-3268, Lisbon, Portugal.

40. Gu, W., Hirose, K., and Fujisaki, H. (2005). Identification and synthesis of Cantonese tones based on the command-response model for F0 contour generation. Proc. ICASSP 2005, pp. 289-292, Philadelphia, PA.

41. Gu, W., Hirose, K., and Fujisaki, H. (2004). Analysis of Shanghainese F0 contours based on the command-response model. Proc. ISCSLP 2004, pp. 81-84, Hong Kong.

42. Gu, W., Hirose, K., and Fujisaki, H. (2004). Analysis and synthesis of Cantonese F0 contours based on the command-response model. Proc. ISCSLP 2004, pp. 185-188, Hong Kong.

43. Gu, W., Hirose, K., and Fujisaki, H. (2004). Analysis of F0 contours of Cantonese utterances based on the command-response model. Proc. INTERSPEECH 2004, pp. 781-784, Jeju Island, Korea.

44. Fujisaki, H., Gu, W., and Hirose, K. (2004). The command-response model for the generation of F0 contours of Cantonese utterances. Proc. ICSP 2004, pp. 655-658, Beijing, China.

45. Gu, W., Fujisaki, H., and Hirose, K. (2004). Analysis of fundamental frequency contours of Cantonese based on a command-response model. Proc. 5th ISCA Speech Synthesis Workshop, pp. 227-228, Pittsburgh, PA.

46. Fujisaki, H., Ohno, S., and Gu, W. (2004). Physiological and physical mechanisms for fundamental frequency control in some tone languages and a command-response model for generation of their F0 contours. Proc. TAL 2004, pp. 61- 64, Beijing, China.

47. Gu, W., Hirose, K., and Fujisaki, H. (2004). A method for automatic tone command parameter extraction for the model of F0 contour generation for Mandarin. Proc. Speech Prosody 2004, pp. 435-438, Nara, Japan.

48. Gu, W., Hirose, K., and Fujisaki, H. (2003). A method for automatic extraction of F0 contour generation process model parameters for Mandarin. Proc. IEEE Workshop on Automatic Speech Recognition and Understanding 2003, St. Thomas, U.S.Virgin Islands.

49. Gu, W. and Hirose, K. (2003). Acoustic model selection and voice quality assessment for HMM-based Mandarin speech synthesis. Proc. INTERSPEECH 2003, pp. 2457-2460, Geneva, Switzerland.

50. Zhao, F., Raghavan, P., Gupta, S.K., Lu, Z., and Gu, W. (2000). Automatic speech recognition in Mandarin for embedded platforms. Proc. ICSLP 2000, vol. 2, pp. 815-818, Beijing, China.

51. Gu, W., Shih, C., and van Santen, J.P.H. (1999). An efficient speaker adaptation method for TTS duration model.Proc. Eurospeech 1999, pp. 1839-1842, Budapest, Hungary.

52. Shih, C., Gu, W., and van Santen, J.P.H. (1998). Efficient adaptation of TTS duration model to new speakers. Proc. ICSLP 1998, vol. 2, pp. 25-28, Sydney, Australia.

53. Shih, C., Gu, W., and van Santen, J.P.H. (1998). Efficient adaptation of TTS duration model to new speakers. Proc. ESCA/COCOSDA The 3rd International Workshop on Speech Synthesis, Jenolan Caves, Australia.

 

Publications and Presentations (after 2014)

Journal Articles

1. Gu, W. (2016). Errors in Foundationamental frequencies for L2 Mandarin speech by Cantonese and English learners. Journal of Tsinghua University (Science and Technology), accepted. (In Chinese)

2. Li, S. and Gu, W.* (2016). Acoustic characteristics of Mandarin affricates.Journal of Tsinghua University (Science and Technology), accepted. (In Chinese)

3. Chen, Q., Gu, W.*, and Scheepers, C. (2016). Effects of text segmentation on silent reading of Chinese regulated poems: Evidence from eye movements. Journal of Chinese Linguistics 44(2).

4. Gu, W. (2016). Tone and intonation in Mandarin utterances by HK Cantonese L2 learners. Journal of School of Chinese Language and Culture, Nanjing Normal University2. (In Chinese)

5. Xu, B., Ci, J., and Gu, W. (2015). Analysis of tonal errors in Japanese by Chinese learners [Zhongguoren xuexi riyu de shengdiao pianwu fenxi]. Study on Japan 7: 90-96.

6. Gu, W., Liu, X., and Hirose, K. (2014). Acoustic study for rhythmic patterns of Mandarin as a second language. Journal of School of Chinese Language and Culture, Nanjing Normal University 3: 169-175. (In Chinese)

Peer-reviewed International Conference Papers

7. Li, X., Kager, R., and Gu, W. (2016). Surface vs. underlying listening strategies for cross-language listeners in the perception of sandhied tones in the Nanjing dialect. Proc. TAL 2016, Buffalo, NY.

8. Prom-on, S., Xu, Y., Gu, W., Arvaniti, A., Nam, H., and Whalen, D. H. (2016). The Common prosody platform (CPP) — where theories of prosody can be directly compared. Proc. Speech Prosody 2016, Boston, MA.

9. Tang, P., Liu, L., Li, S., and Gu, W. (2015). Cross-linguistic perception of Chinese attitudes praising and blaming. Proc. Oriental COCOSDA 2015, Shanghai, China.

10. Gu, W., Tang, P., Hirose, K., and Aubergé, V. (2015). Crosslinguistic comparison on the perception of Mandarin attitudinal speech. Proc. INTERSPEECH 2015, Dresden, Germany.

11. Li, S. and Gu, W. (2015). Acoustic analysis of Mandarin affricates. Proc. INTERSPEECH 2015, Dresden, Germany.

12. Gu, W. and Liu, L. (2015). Declarative and interrogative Mandarin intonation by native speakers and Cantonese L2 learners. Proc. SLaTE 2015, Leipzig, Germany, pp. 41-46.

13. Gu, W. (2015). Tone, intonation and emphatic stress in L2 Mandarin speech by English and Cantonese learners. Proc. ICPhS 2015, Glasgow, UK.

14. Tang, P. and Gu, W. (2015). Perceptual experiment and acoustic analysis of Chinese attitudes: A preliminary study. Proc. ICPhS 2015, Glasgow, UK.

15. Gu, W. (2014). Stress, tone and intonation in L2 Mandarin speech by English and Cantonese learners. Workshop on the Role of Prosody in Language Learning: Stress, Tone and Intonation, Sydney, Australia.

16. Gu, W., Zhang, T., and Tsurutani, C. (2014). Segmental and tonal errors in L2 Mandarin speech produced by Australian English learners. Proc. Australian International Speech Science and Technology Conference 2014, Christchurch, New Zealand, pp. 50-53.

17. Gu, W., Tsurutani, C., and Zhang, T. (2014). Prosodic characteristics in Mandarin polite speech by native and non-native speakers, AILA World Congress 2014, Brisbane, Australia.

18. Gu, W. and Hirose, K. (2014). Rhythmic patterns in native and nonnative Mandarin speech. Proc. Speech Prosody 2014, Dublin, Ireland, pp. 592-596.

19. Gu, W., Hirose, K., and Fujisaki, H. (2014). Prosodic patterns of Mandarin disyllabic words by Japanese learners. Proc. TAL 2014, Nijmegen, The Netherlands.

20. Chen, Q. and Gu, W. (2014). Effects of text segmentation on silent reading of Chinese. Proc. TAL 2014, Nijmegen, The Netherlands.

 

Tel./Fax :  +86-25-8359-8624

Mobile : +86-18936872840

E-mails : wtgu@njnu.edu.cn    wentaogu@gmail.com