HMM-based Temporal Difference Learning with State Transition Updating for Tracking Human Communicational Behaviors

Minh Anh T. Ho; Yoji Yamada; Yoji Umetani

doi:10.20965/jrm.2003.p0271

single-rb.php

« previous

JRM Vol.15 No.3 pp. 271-277

(2003)

doi: 10.20965/jrm.2003.p0271

Paper:

Views over last 60 days: 558

HMM-based Temporal Difference Learning with State Transition Updating for Tracking Human Communicational Behaviors

Minh Anh T. Ho, Yoji Yamada, and Yoji Umetani

Intelligent Systems Laboratory, Graduate School of Toyota Technological Institute, 2-12 Hisakata, Tenpa-ku, Nagoya, 468-2511 Japan

Received:

November 11, 2002

Accepted:

March 4, 2003

Published:

June 20, 2003

Keywords:

visual tracking, intended gestures, hidden Markov model, reinforcement learning, state transition update

Abstract

In our original system, we used hidden Markov models (HMMs) to model rough gesture patterns. We later utilized temporal difference (TD) learning to adjust the action model of the tracker for its behavior in the tracking task. We integrated the above two methods into an algorithm by assigning state transition probability in HMMs as a reward in TD learning. Identification of the sign gesture context through wavelet analysis autonomously provides a reward value for optimizing the attentive visual attentive tracker's AVAT's action patterns. A bound of state value functions as a constraint factor for the updating procedure in TD models has been determined to recognize whether predictive models need to be updated according with action models. Experimental results of extracting an operator's hand sign sequence during natural walking demonstrates AVAT development in the perceptual organization framework.

Cite this article as:

M. Ho, Y. Yamada, and Y. Umetani, “HMM-based Temporal Difference Learning with State Transition Updating for Tracking Human Communicational Behaviors,” J. Robot. Mechatron., Vol.15 No.3, pp. 271-277, 2003.

Data files:

This article is published under a Creative Commons Attribution-NoDerivatives 4.0 Internationa License.