Fujipress Home | Search | About FINDER

Paper:
Language: English:

Specification and Implementation of Open Source Software Suite for Realizing Communication Intelligence


Yosuke Matsusaka, Hideki Asoh, Isao Hara,
and Futoshi Asano


National Institute of Advanced Industrial Science and Technology (AIST), 1-1-1 Umezono, Tsukuba, Ibaraki 305-0045, Japan


Received: May 2, 2011

Accepted: July 27, 2011


Keywords: RT-middleware, human robot interaction, speech recognition, semantics

Journal ref: Journal of Robotics and Mechatronics, Vol.24, No.1 pp. 86-94, 2012

Abstract



We are presently developing a set of software called the Open Source Software Suite for Human Robot Interaction (OpenHRI). The OpenHRI has the following features: It is implemented on RT-Component, an Object Management Group (OMG) compliant robot technology component specification that can be easily integrated into any robot system. It can perform various functions, from audio signal processing to dialog management, in a uniform and reconfigurable manner. It not only implements each required function of components but also defines a meta-level specification to enable the developer to verify whether the structural design of components is correct. In this paper, we introduce the implementation of the OpenHRI, present the architectural design of the system, and provide examples of applications.
preview Preview (PDF)  full text Full Text (PDF 562KB)

Reference

[1] J. Valin, F. Michaud, and J. Rouat, “Robust Localization and Tracking of Simultaneous Moving Sound Sources Using Beamforming and Particle Filtering,” Robotics and Autonomous Systems J., Vol.55, No.3, pp. 216-228, 2007.

[2] K. Nakadai, H. Okuno, H. Nakajima, Y. Hasegawa, and H. Tsujino, “An Open Source Software System For Robot Audition HARK and Its Evaluation,” Proc. of the IEEE/RAS Int. Conf. on Humanoid Robots, pp. 561-566, 2008.

[3] T. Kawahara, A. Lee, T. Kobayashi, K. Takeda, N. Minematsu, S. Sagayama, K. Itou, A. Ito, M. Yamamoto, A. Yamada, T. Utsuro, and K. Shikano, “Free Software Toolkit for Japanese Large Vocabulary Continuous Speech Recognition,” Proc. of the Int. Conf. on Spoken Language Processing, Vol.4, pp. 476-479, 2000.

[4] I. Hara, F. Asano, H. Asoh, J. Ogata, N. Ichimura, Y. Kawai, F. Kanehiro, H. Hirukawa, and K. Yamamoto, “Robust SpeechInterface Based on Audio and Video Information Fusion for Humanoid HRP-2,” Proc. of the IEEE/RSJ Int. Conf. on Intelligent Robots and Systems, pp. 2404-2410, 2004.

[5] N. Ando, T. Suehiro, K. Kitagaki, T. Kotoku, and W. Yoon, “RTMiddleware: Distributed Component Middleware for RT (Robot Technology),” Proc. of the Int. Conf. on Intelligent Robots and Systems, pp. 3555-3560, 2005.

[6] Y.Matsusaka and I. Hara, “An Extensible Dialogue Script for Robot Based on Unification of State Transition Models,” Proc. of the 8th IEEE Int. Symp. on Computational Intelligence in Robotics and Automation, 2009.

[7] P. S. Rosenbloom, J. E. Laird, and A. Newell (Eds), “The Soar Papers,” Readings on Integrated Intelligence, MIT Press, 1993.

[8] Y. Matsusaka, H. Fujii, and I. Hara, “Health Exercise Demonstration Robot TAIZO and Effects of Using Voice Command in Robot-Human Collaborative Demonstration,” Proc. of the IEEE/RSJ Int. Symp. on Robot and Human Interactive Communication, 2009.

[9] S. Kawamoto et al., “Open-Source Software for Developing Anthropomorphic Spoken Dialog Agent,” Proc. of PRICAI-02, Int. Workshop on Lifelike Animated Agents, pp. 64-69, 2002.


Supporting Online Materials:

[a] Orca,
http://orca-robotics.sourceforge.net/

[b] ROS – Robot Open Source – Willow Garage,
http://www.willowgarage.com/pages/software/ros-platform

[c] NEDO IRTS Communication Intelligence SWG, “Communication Intelligence Common Specification version 1,”
http://www.openrtm.org/OpenRTM-aist/documents/IRT PJ/irt-speccommunication-ver1.pdf

[d] PortAudio – an Open-Source Cross-Platform Audio API,
http://www.portaudio.com/

[e] The Festival Speech Synthesis System,
http://www.cstr.ed.ac.uk/projects/festival/

[f] Open JTalk,
http://open-jtalk.sourceforge.net/

[g] Web Service Semantics – WSDL-S,
http://www.w3.org/Submission/WSDL-S/

[Notice]
* "Preview" is the first 2 pages of the article. You don't need the registration.
* To read the PDF file you will then need to download and install the Adobe Reader.
Adobe Reader is free and available for download here:

adobe reader

Terms and Conditions | Privacy Policy | Recruit | Advertising Information | Contact Us