Using Expert Traces to Reduce Training Time in Reinforcement Learning
Monday, April 21, 2008, 11:15am
325b ITE
Reinforcement learning algorithms are typically used to solve Markov
decision problems. Using reinforcement learning to find an optimal or
sub-optimal solution is a slow process. The use of prior knowledge can
shorten the length of this task. In this thesis, we explore the use of
expert traces to speed up learning. An expert trace is a record of the
states visited and actions taken by an expert in a given domain, which is
then used to reduce the training time of reinforcement learning. We
demonstrate in a variety of domains significant improvements in learning
when complementing reinforcement learning with expert traces.