Article Contents

Article Navigation> JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY> 2014> 23(3): 395-400

WU Wang-hui, XIE Xiang, JIAO Yi-shan, ZHANG Zheng, GAO Gao. Design and implementation of scenic-spot introduction-task-oriented 3D virtual human spoken dialogue system[J]. JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 2014, 23(3): 395-400.

Citation:

WU Wang-hui, XIE Xiang, JIAO Yi-shan, ZHANG Zheng, GAO Gao. Design and implementation of scenic-spot introduction-task-oriented 3D virtual human spoken dialogue system[J].JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 2014, 23(3): 395-400.

Citation:

WU Wang-hui, XIE Xiang, JIAO Yi-shan, ZHANG Zheng, GAO Gao. Design and implementation of scenic-spot introduction-task-oriented 3D virtual human spoken dialogue system[J].JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 2014, 23(3): 395-400.

PDF( 1055 KB)

Design and implementation of scenic-spot introduction-task-oriented 3D virtual human spoken dialogue system

Research Institute of Communication Technology, Beijing Institute of Technology, Beijing 100081, China

Received Date:2013-03-23

Abstract

Abstract

A scenic-spot introduction-task-oriented 3D virtual human spoken dialogue system—EasyGuide is introduced. The system includes five modules: natural language processing, task domain knowledge database, dialogue management, voice processing and 3D virtual human text-to-visual speech synthesis. In the first module, dictionary construction along with sentence analysis and semantic representation are illustrated specifically. A tree-structured knowledge database is designed for the task domain. A novel framework based on the keyword analysis and context constraints is proposed as the dialogue management. As for voice processing module, a software development kit which performs speech recognition and synthesis is introduced briefly. In the last module, 3D viseme synthesis is explained with examples and a text-driven facial animation system is presented. Evaluation results show that the system can achieve satisfactory performance.
- spoken dialogue system,
- speech recognition,
- dialogue management

FullText(HTML)

References (16)

References

[1]	Bohus D, Puerto S G, Huggins-Daines D, et al. Conquest-an open-sourcedialog system for conferences[C]//Proceedings of NAACLHLT, New York, USA,2007: 9-12.
[2]	Zue V, Seneff S, Glass J R,et al. Jupiter: a telephone-based conversational interface for weather information[J]. IEEE Transactions on Speech and Audio Processing, 2000, 8: 85-96.
[3]	Rudnicky A, Thayer E, Constantinides P,et al.Creating natural dialogs in the Carnegie Mellon communicator system[C]//Proceedings of the 6th European Conference on Speech Communication and Technology, Budapest, Hungary,1999.
[4]	Swerts M, Litman D, Hirschberg J. Corrections in spoken dialogue systems[C]//Proceedings of ICSLP, Beijing, China, 2000: 615-618.
[5]	Huang Yinfei, Zheng Fang, Yan Pengju, et al. The design and implementation of Campus Navigation System: EasyNav[J]. Chinese Information Processing,2001, 15(4): 35-40. (in Chinese)
[6]	Huang C, Xu P, Zang X, et al. LODESTAR: a mandarin spoken dialogue system for travel information retrieval[C]//European Conference on Speech Communiation and Technology, Budapest, Hungary,1999:1159-1162.
[7]	Lee C, Jung S, Kim K, et al.Recent approaches to dialog management for spoken dialog systems[J]. Journal of Computing Science and Engineering, 2010, 4(1): 1-22.
[JP3]Bohus D, Rudnicky A I. The RavenClaw dialog management framework: architecture and systems[J]. Computer Speech & Language, 2009, 23(3): 332-361.[JP]
[9]	Xu Y, Seneff S. Dialogue management based on entities and constraints[C]//SIGDIAL Conference,Tokyo, Japan, 2010: 87-90.
[10]	Microsoft. Microsoft speech SDK 5.1[EB/OL]. [2009-03-03]. http://www.microsoft.com/en-us/download/details.aspx?id10121
[11]	Parke F. Parameterized models for facial animation[J]. IEEE Computer Graphics and Applications, 1982, 2(9): 61-68.
[12]	Fujiwara T,Koshimizu H, Fujimura K,et al. A method for 3D face modeling and caricatured figure generation[C]//Multimedia and Expo, ICME02. Proceedings of IEEE International Conference, Lausanne, Switzerland, 2002: 137-140.
[13]	Huang H, Chai J, Tong X, et al. Leveraging motion capture and 3D scanning for high-fidelity facial performance acquisition[J]. ACM Trans Graph,2011, 30(4): 74.
[14]	Blanz V, Vetter T. A morphablemodel for [JP3]the synthesis of 3D faces[J]. SIGGRAPH, 1999: 187-194.[JP]
[15]	Gonzalez-Mora J, De la Torre F, Guil N, et al.[JP3]Learning a generic 3D face model from 2D image databases using incremental Structure-from-Motion[J]. Image Vision Comput, 2010, 28(7): 1117-1129.[JP]
[16]	Singular Inversions. FaceGen modeller (Version 3.1)[EB/OL]. [2006-01-01]. http://www.facegen.com/modeller.htm.

Relative Articles

Supplements (0)

Cited By

Proportional views

Proportional views

通讯作者:陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Get Citation

PDF

XML

Article Metrics

Article views (1226) PDF downloads(552)

Design and implementation of scenic-spot introduction-task-oriented 3D virtual human spoken dialogue system

Abstract

References

Proportional views

Catalog

通讯作者:陈斌, bchen63@163.com

Article Metrics

Proportional views

Related

Design and implementation of scenic-spot introduction-task-oriented 3D virtual human spoken dialogue system

Abstract

References

Proportional views

Catalog

通讯作者:陈斌, bchen63@163.com

Article Metrics

Proportional views

Related

Export File

Citation

Format

Content