STOCHASTlC POLlCY IN Q-LEARNING USED FOR CONTROL OF AMB


Abstract cze:
Abstrakt: A great intention is lately focused on Reinforcement Learning (RL) methods. The article is focused on improving model free RL method known as Q-Iearning algorithm used on active magnetic bearing (AMB) model. Stochastic strategy and adaptive integration step increased the speed of learning approximately hundred times. Impossibility of using proposed improvement online is the only drawback, however it might be used for pretraining on simulation model and further fined online.

Abstract eng:
Abstract: A great intention is lately focused on Reinforcement Learning (RL) methods. The article is focused on improving model free RL method known as Q-Iearning algorithm used on active magnetic bearing (AMB) model. Stochastic strategy and adaptive integration step increased the speed of learning approximately hundred times. Impossibility of using proposed improvement online is the only drawback, however it might be used for pretraining on simulation model and further fined online.

Contributors:
Publisher:
Institute of Mechanics and Solids, FME, TU Brno
Conference Title:
Conference Title:
Engineering Mechanics 2002
Conference Venue:
Svratka (CZ)
Conference Dates:
2002-05-13 / 2002-05-16
Rights:
Text je chráněný podle autorského zákona č. 121/2000 Sb.



Record appears in:

 Record created 2014-10-23, last modified 2014-11-18


Original version of the author's contribution as presented on CD, . :
Download fulltext
PDF

Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)