Walking gait of four-legged robot obtained through Q-learning
Abstract eng: The possible method of walking policy obtaining of four-legged robot through Q-learning is discussed in the contribution. Q-learning is implemented using architecture represented by non-deterministic state machine that defines both possible discrete states and admissible transitions between them. Discrete state is designed as indicators vector of goals achievement by single simultaneously activated instances of two basic controllers. Only simultaneous activations that guarantee static stability of robot are admissible even in the case when single activations cold not achieve its goals. The controllers attempt to achieve its goals using on-line minimization process. Q-learning sequentially improves an estimation of future benefit from usage of admissible simultaneous activations in single discrete states. Walking policy is generated through activations with the highest estimation of future benefit.
Contributors:
Publisher:
Institute of Theoretical and Applied Mechanics AS CR, Prague
Conference Title:
Conference Title:
Engineering Mechanics 2003
Conference Venue:
Svratka (CZ)
Conference Dates:
2003-05-12 / 2003-05-15
Rights:
Text je chráněný podle autorského zákona č. 121/2000 Sb.
Record appears in:
Record created 2014-10-24, last modified 2014-11-18