ridm@nrct.go.th   ระบบคลังข้อมูลงานวิจัยไทย   รายการโปรดที่คุณเลือกไว้

A first speech recognition system for Mandarin-English code-switch conversational speech

หน่วยงาน Nanyang Technological University, Singapore

รายละเอียด

ชื่อเรื่อง : A first speech recognition system for Mandarin-English code-switch conversational speech
นักวิจัย : Vu, Ngoc Thang , Lyu, Dau-Cheng , Weiner, Jochen , Telaar, Dominic , Schlippe, Tim , Blaicher, Fabian , Chng, Eng Siong , Schultz, Tanja , Li, Haizhou
คำค้น : DRNTU::Engineering::Computer science and engineering
หน่วยงาน : Nanyang Technological University, Singapore
ผู้ร่วมงาน : -
ปีพิมพ์ : 2555
อ้างอิง : Vu, N. T., Lyu, D.-C., Weiner, J., Telaar, D., Schlippe, T., Blaicher, F., & et al. (2012). A first speech recognition system for Mandarin-English code-switch conversational speech. 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 4889-4892. , http://hdl.handle.net/10220/13411 , http://dx.doi.org/10.1109/ICASSP.2012.6289015
ที่มา : -
ความเชี่ยวชาญ : -
ความสัมพันธ์ : -
ขอบเขตของเนื้อหา : -
บทคัดย่อ/คำอธิบาย :

This paper presents first steps toward a large vocabulary continuous speech recognition system (LVCSR) for conversational Mandarin-English code-switching (CS) speech. We applied state-of-the-art techniques such as speaker adaptive and discriminative training to build the first baseline system on the SEAME corpus [1] (South East Asia Mandarin-English). For acoustic modeling, we applied different phone merging approaches based on the International Phonetic Alphabet (IPA) and Bhattacharyya distance in combination with discriminative training to improve accuracy. On language model level, we investigated statistical machine translation (SMT) - based text generation approaches for building code-switching language models. Furthermore, we integrated the provided information from a language identification system (LID) into the decoding process by using a multi-stream approach. Our best 2-pass system achieves a Mixed Error Rate (MER) of 36.6% on the SEAME development set.

บรรณานุกรม :
Vu, Ngoc Thang , Lyu, Dau-Cheng , Weiner, Jochen , Telaar, Dominic , Schlippe, Tim , Blaicher, Fabian , Chng, Eng Siong , Schultz, Tanja , Li, Haizhou . (2555). A first speech recognition system for Mandarin-English code-switch conversational speech.
    กรุงเทพมหานคร : Nanyang Technological University, Singapore.
Vu, Ngoc Thang , Lyu, Dau-Cheng , Weiner, Jochen , Telaar, Dominic , Schlippe, Tim , Blaicher, Fabian , Chng, Eng Siong , Schultz, Tanja , Li, Haizhou . 2555. "A first speech recognition system for Mandarin-English code-switch conversational speech".
    กรุงเทพมหานคร : Nanyang Technological University, Singapore.
Vu, Ngoc Thang , Lyu, Dau-Cheng , Weiner, Jochen , Telaar, Dominic , Schlippe, Tim , Blaicher, Fabian , Chng, Eng Siong , Schultz, Tanja , Li, Haizhou . "A first speech recognition system for Mandarin-English code-switch conversational speech."
    กรุงเทพมหานคร : Nanyang Technological University, Singapore, 2555. Print.
Vu, Ngoc Thang , Lyu, Dau-Cheng , Weiner, Jochen , Telaar, Dominic , Schlippe, Tim , Blaicher, Fabian , Chng, Eng Siong , Schultz, Tanja , Li, Haizhou . A first speech recognition system for Mandarin-English code-switch conversational speech. กรุงเทพมหานคร : Nanyang Technological University, Singapore; 2555.