Specification and Implementation of Open Source Software Suite for Realizing Communication Intelligence

Yosuke Matsusaka; Hideki Asoh; Isao Hara; Futoshi Asano

doi:10.20965/jrm.2012.p0086

single-rb.php

« previous

JRM Vol.24 No.1 pp. 86-94

doi: 10.20965/jrm.2012.p0086

(2012)

Paper:

Views over last 60 days: 831

Specification and Implementation of Open Source Software Suite for Realizing Communication Intelligence

Yosuke Matsusaka, Hideki Asoh, Isao Hara,
and Futoshi Asano

National Institute of Advanced Industrial Science and Technology (AIST), 1-1-1 Umezono, Tsukuba, Ibaraki 305-0045, Japan

Received:

May 2, 2011

Accepted:

July 27, 2011

Published:

February 20, 2012

Keywords:

RT-middleware, human robot interaction, speech recognition, semantics

Abstract

We are presently developing a set of software called the Open Source Software Suite for Human Robot Interaction (OpenHRI). The OpenHRI has the following features: It is implemented on RT-Component, an Object Management Group (OMG) compliant robot technology component specification that can be easily integrated into any robot system. It can perform various functions, from audio signal processing to dialog management, in a uniform and reconfigurable manner. It not only implements each required function of components but also defines a meta-level specification to enable the developer to verify whether the structural design of components is correct. In this paper, we introduce the implementation of the OpenHRI, present the architectural design of the system, and provide examples of applications.

Cite this article as:

Y. Matsusaka, H. Asoh, I. Hara, and F. Asano, “Specification and Implementation of Open Source Software Suite for Realizing Communication Intelligence,” J. Robot. Mechatron., Vol.24 No.1, pp. 86-94, 2012.

Data files:

References

[1] J. Valin, F. Michaud, and J. Rouat, “Robust Localization and Tracking of Simultaneous Moving Sound Sources Using Beamforming and Particle Filtering,” Robotics and Autonomous Systems J., Vol.55, No.3, pp. 216-228, 2007.
[2] K. Nakadai, H. Okuno, H. Nakajima, Y. Hasegawa, and H. Tsujino, “An Open Source Software System For Robot Audition HARK and Its Evaluation,” Proc. of the IEEE/RAS Int. Conf. on Humanoid Robots, pp. 561-566, 2008.
[3] T. Kawahara, A. Lee, T. Kobayashi, K. Takeda, N. Minematsu, S. Sagayama, K. Itou, A. Ito, M. Yamamoto, A. Yamada, T. Utsuro, and K. Shikano, “Free Software Toolkit for Japanese Large Vocabulary Continuous Speech Recognition,” Proc. of the Int. Conf. on Spoken Language Processing, Vol.4, pp. 476-479, 2000.
[4] I. Hara, F. Asano, H. Asoh, J. Ogata, N. Ichimura, Y. Kawai, F. Kanehiro, H. Hirukawa, and K. Yamamoto, “Robust SpeechInterface Based on Audio and Video Information Fusion for Humanoid HRP-2,” Proc. of the IEEE/RSJ Int. Conf. on Intelligent Robots and Systems, pp. 2404-2410, 2004.
[5] N. Ando, T. Suehiro, K. Kitagaki, T. Kotoku, and W. Yoon, “RTMiddleware: Distributed Component Middleware for RT (Robot Technology),” Proc. of the Int. Conf. on Intelligent Robots and Systems, pp. 3555-3560, 2005.
[6] Y.Matsusaka and I. Hara, “An Extensible Dialogue Script for Robot Based on Unification of State Transition Models,” Proc. of the 8th IEEE Int. Symp. on Computational Intelligence in Robotics and Automation, 2009.
[7] P. S. Rosenbloom, J. E. Laird, and A. Newell (Eds), “The Soar Papers,” Readings on Integrated Intelligence, MIT Press, 1993.
[8] Y. Matsusaka, H. Fujii, and I. Hara, “Health Exercise Demonstration Robot TAIZO and Effects of Using Voice Command in Robot-Human Collaborative Demonstration,” Proc. of the IEEE/RSJ Int. Symp. on Robot and Human Interactive Communication, 2009.
[9] S. Kawamoto et al., “Open-Source Software for Developing Anthropomorphic Spoken Dialog Agent,” Proc. of PRICAI-02, Int. Workshop on Lifelike Animated Agents, pp. 64-69, 2002.
[10]
Supporting Online Materials:[a] Orca,
http://orca-robotics.sourceforge.net/
[11] [b] ROS – Robot Open Source – Willow Garage,
http://www.willowgarage.com/pages/software/ros-platform
[12] [c] NEDO IRTS Communication Intelligence SWG, “Communication Intelligence Common Specification version 1,”
http://www.openrtm.org/OpenRTM-aist/documents/IRT PJ/irt-speccommunication-ver1.pdf
[13] [d] PortAudio – an Open-Source Cross-Platform Audio API,
http://www.portaudio.com/
[14] [e] The Festival Speech Synthesis System,
http://www.cstr.ed.ac.uk/projects/festival/
[15] [f] Open JTalk,
http://open-jtalk.sourceforge.net/
[16] [g] Web Service Semantics – WSDL-S,
http://www.w3.org/Submission/WSDL-S/

This article is published under a Creative Commons Attribution-NoDerivatives 4.0 Internationa License.

[1] [1] J. Valin, F. Michaud, and J. Rouat, “Robust Localization and Tracking of Simultaneous Moving Sound Sources Using Beamforming and Particle Filtering,” Robotics and Autonomous Systems J., Vol.55, No.3, pp. 216-228, 2007.

[2] [2] K. Nakadai, H. Okuno, H. Nakajima, Y. Hasegawa, and H. Tsujino, “An Open Source Software System For Robot Audition HARK and Its Evaluation,” Proc. of the IEEE/RAS Int. Conf. on Humanoid Robots, pp. 561-566, 2008.

[3] [3] T. Kawahara, A. Lee, T. Kobayashi, K. Takeda, N. Minematsu, S. Sagayama, K. Itou, A. Ito, M. Yamamoto, A. Yamada, T. Utsuro, and K. Shikano, “Free Software Toolkit for Japanese Large Vocabulary Continuous Speech Recognition,” Proc. of the Int. Conf. on Spoken Language Processing, Vol.4, pp. 476-479, 2000.

[4] [4] I. Hara, F. Asano, H. Asoh, J. Ogata, N. Ichimura, Y. Kawai, F. Kanehiro, H. Hirukawa, and K. Yamamoto, “Robust SpeechInterface Based on Audio and Video Information Fusion for Humanoid HRP-2,” Proc. of the IEEE/RSJ Int. Conf. on Intelligent Robots and Systems, pp. 2404-2410, 2004.

[5] [5] N. Ando, T. Suehiro, K. Kitagaki, T. Kotoku, and W. Yoon, “RTMiddleware: Distributed Component Middleware for RT (Robot Technology),” Proc. of the Int. Conf. on Intelligent Robots and Systems, pp. 3555-3560, 2005.

[6] [6] Y.Matsusaka and I. Hara, “An Extensible Dialogue Script for Robot Based on Unification of State Transition Models,” Proc. of the 8th IEEE Int. Symp. on Computational Intelligence in Robotics and Automation, 2009.

[7] [7] P. S. Rosenbloom, J. E. Laird, and A. Newell (Eds), “The Soar Papers,” Readings on Integrated Intelligence, MIT Press, 1993.

[8] [8] Y. Matsusaka, H. Fujii, and I. Hara, “Health Exercise Demonstration Robot TAIZO and Effects of Using Voice Command in Robot-Human Collaborative Demonstration,” Proc. of the IEEE/RSJ Int. Symp. on Robot and Human Interactive Communication, 2009.

[9] [9] S. Kawamoto et al., “Open-Source Software for Developing Anthropomorphic Spoken Dialog Agent,” Proc. of PRICAI-02, Int. Workshop on Lifelike Animated Agents, pp. 64-69, 2002.

[10] [10]
Supporting Online Materials:[a] Orca,
http://orca-robotics.sourceforge.net/

[11] [11] [b] ROS – Robot Open Source – Willow Garage,
http://www.willowgarage.com/pages/software/ros-platform

[12] [12] [c] NEDO IRTS Communication Intelligence SWG, “Communication Intelligence Common Specification version 1,”
http://www.openrtm.org/OpenRTM-aist/documents/IRT PJ/irt-speccommunication-ver1.pdf

[13] [13] [d] PortAudio – an Open-Source Cross-Platform Audio API,
http://www.portaudio.com/

[14] [14] [e] The Festival Speech Synthesis System,
http://www.cstr.ed.ac.uk/projects/festival/

[15] [15] [f] Open JTalk,
http://open-jtalk.sourceforge.net/

[16] [16] [g] Web Service Semantics – WSDL-S,
http://www.w3.org/Submission/WSDL-S/

Specification and Implementation of Open Source Software Suite for Realizing Communication Intelligence

Yosuke Matsusaka, Hideki Asoh, Isao Hara, and Futoshi Asano

Yosuke Matsusaka, Hideki Asoh, Isao Hara,
and Futoshi Asano