Reinforcement Learning describes the situation of a machine learning system, where the only training signal provided by the environment is that of success or failure of the agent, after the system has acted over a sequence of decision cycles. This learning problem can be formulated as a Markov Decision Process (MDP) within the framework of Dynamic Programming. The main motivation behind the Brainstormers' effort in the soccer domain is to investigate Reinforcement Learning (RL) methods in complex domains and to develop new variants and practical algprithmus. We consider it important that we not only demonstrate the principal feasibility of RL, but actually do apply learned behavior in our competition team. Our long term goal is a team of learning agent, where we only plug in 'Win the match' - and our agents learn to generate the appropriate behavior.
Below you'll find a mini
tutorial, made up of five lections, on the use of Reinforcement Learning methods in the context of
Robotic Soccer.
[Note
that Shockwave Flash is required for the tutorial's animations.]
For
the purpose of illustration this tutorial is kept quite
straightforward. The first page includes an explanation of
Reinforcement Learning and wants to equip you with the mandatory notion
used throughout the tutorial.
During the subsequent lections you
will get to know how a soccer-playing agent can manage to autonomously
acquire some specific capabilities, such as intercepting a ball or
dribbling. Finally, a multi agent setting is considered, where two
players must cooperatively learn to shoot a goal.
Lection |
Setting |
Topic |
|
I |
Basics |
Explanation of Reinforcement Learning |
|
II |
One Agent |
Intercept the Ball |
|
III |
One Agent |
Goalshot |
|
IV |
One Agent |
Dribble |
|
V |
Multi Agent |
Pass and Goalshot |