Welcome to Journal of Beijing Institute of Technology
Volume 23Issue 3
.
Turn off MathJax
Article Contents
WU Wang-hui, XIE Xiang, JIAO Yi-shan, ZHANG Zheng, GAO Gao. Design and implementation of scenic-spot introduction-task-oriented 3D virtual human spoken dialogue system[J]. JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 2014, 23(3): 395-400.
Citation: WU Wang-hui, XIE Xiang, JIAO Yi-shan, ZHANG Zheng, GAO Gao. Design and implementation of scenic-spot introduction-task-oriented 3D virtual human spoken dialogue system[J].JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 2014, 23(3): 395-400.

Design and implementation of scenic-spot introduction-task-oriented 3D virtual human spoken dialogue system

  • Received Date:2013-03-23
  • A scenic-spot introduction-task-oriented 3D virtual human spoken dialogue system—EasyGuide is introduced. The system includes five modules: natural language processing, task domain knowledge database, dialogue management, voice processing and 3D virtual human text-to-visual speech synthesis. In the first module, dictionary construction along with sentence analysis and semantic representation are illustrated specifically. A tree-structured knowledge database is designed for the task domain. A novel framework based on the keyword analysis and context constraints is proposed as the dialogue management. As for voice processing module, a software development kit which performs speech recognition and synthesis is introduced briefly. In the last module, 3D viseme synthesis is explained with examples and a text-driven facial animation system is presented. Evaluation results show that the system can achieve satisfactory performance.
  • loading
  • [1]
    Bohus D, Puerto S G, Huggins-Daines D, et al. Conquest-an open-sourcedialog system for conferences[C]//Proceedings of NAACLHLT, New York, USA,2007: 9-12.
    [2]
    Zue V, Seneff S, Glass J R,et al. Jupiter: a telephone-based conversational interface for weather information[J]. IEEE Transactions on Speech and Audio Processing, 2000, 8: 85-96.
    [3]
    Rudnicky A, Thayer E, Constantinides P,et al.Creating natural dialogs in the Carnegie Mellon communicator system[C]//Proceedings of the 6th European Conference on Speech Communication and Technology, Budapest, Hungary,1999.
    [4]
    Swerts M, Litman D, Hirschberg J. Corrections in spoken dialogue systems[C]//Proceedings of ICSLP, Beijing, China, 2000: 615-618.
    [5]
    Huang Yinfei, Zheng Fang, Yan Pengju, et al. The design and implementation of Campus Navigation System: EasyNav[J]. Chinese Information Processing,2001, 15(4): 35-40. (in Chinese)
    [6]
    Huang C, Xu P, Zang X, et al. LODESTAR: a mandarin spoken dialogue system for travel information retrieval[C]//European Conference on Speech Communiation and Technology, Budapest, Hungary,1999:1159-1162.
    [7]
    Lee C, Jung S, Kim K, et al.Recent approaches to dialog management for spoken dialog systems[J]. Journal of Computing Science and Engineering, 2010, 4(1): 1-22.
    [JP3]Bohus D, Rudnicky A I. The RavenClaw dialog management framework: architecture and systems[J]. Computer Speech & Language, 2009, 23(3): 332-361.[JP]
    [9]
    Xu Y, Seneff S. Dialogue management based on entities and constraints[C]//SIGDIAL Conference,Tokyo, Japan, 2010: 87-90.
    [10]
    Microsoft. Microsoft speech SDK 5.1[EB/OL]. [2009-03-03]. http://www.microsoft.com/en-us/download/details.aspx?id10121
    [11]
    Parke F. Parameterized models for facial animation[J]. IEEE Computer Graphics and Applications, 1982, 2(9): 61-68.
    [12]
    Fujiwara T,Koshimizu H, Fujimura K,et al. A method for 3D face modeling and caricatured figure generation[C]//Multimedia and Expo, ICME02. Proceedings of IEEE International Conference, Lausanne, Switzerland, 2002: 137-140.
    [13]
    Huang H, Chai J, Tong X, et al. Leveraging motion capture and 3D scanning for high-fidelity facial performance acquisition[J]. ACM Trans Graph,2011, 30(4): 74.
    [14]
    Blanz V, Vetter T. A morphablemodel for [JP3]the synthesis of 3D faces[J]. SIGGRAPH, 1999: 187-194.[JP]
    [15]
    Gonzalez-Mora J, De la Torre F, Guil N, et al.[JP3]Learning a generic 3D face model from 2D image databases using incremental Structure-from-Motion[J]. Image Vision Comput, 2010, 28(7): 1117-1129.[JP]
    [16]
    Singular Inversions. FaceGen modeller (Version 3.1)[EB/OL]. [2006-01-01]. http://www.facegen.com/modeller.htm.
  • 加载中

Catalog

    通讯作者:陈斌, bchen63@163.com
    • 1.

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views (1226) PDF downloads(552) Cited by()
    Proportional views
    Related

    /

      Return
      Return
        Baidu
        map