Hajime Murao and Shinzo Kitamura
In this paper, we propose actor-critic learning with adaptive state space construction. The gaussian radial basis function neural network is employed for both the actor and the critic modules, where each hidden neuron covers a subspace of the sensor space and so the hidden layer corresponds to the state space. In the proposed algorithm, a robot starts without any states and a new state is generated incrementally by adding a new hidden neuron. One clear advantage of the proposed algorithm to others is the performance improvement by the minimal training after adding a new state, i.e. the adjustment of the connective strength between the new neuron and others is only required after adding a new hidden neuron. This provides an efficient method to construct the state space during the learning in the real world. Resulting state space represents aspects of the environment in which the robot works and the characteristic of the robot itself. In this sense, the obtained state space is said to represent the embodiment of the robot.
Keywords: Actor-Critic learning, Reinforcement learning, State space construction, Embodiment, Mobile robot