Reinforcement Learning for the Soccer Dribbling Task Arthur Carvalho Renato Oliveira

Introduction RoboCup soccer simulation Scoring A Data Mining Approach to Solve the Goal Scoring Problem Passing A New Passing Strategy Based on Q-Learning Algorithm in RoboCup Dribbling ?

Soccer Dribbling Task

Outline The soccer dribbling task as a RL problem RL solution Experiments Conclusion

The Soccer Dribbling Task as a RL Problem Coach Setting positions Dribbler is placed in the center-left region together with the ball Adversary is placed in a random position Manage the play Adversary wins when he gains possession or when the ball goes out of the field Dribbler wins when he crosses the field with the ball

The Soccer Dribbling Task as a RL Problem When an episode ends, the coach starts a new one RoboCup soccer simulator operates in discrete time steps Episodic reinforcement-learning framework

The Soccer Dribbling Task as a RL Problem Actions HoldBall() Dribble(, k) Dribble(30, 5), Dribble(330, 5), Dribble(0, 5), Dribble(0, 10) The dribbler can kick the ball forward (strongly and weakly), diagonally upward, and diagonally downward.

The Soccer Dribbling Task as a RL Problem State VariableMeaning posY (dribbler) Vertical position of the dribbler ang(dribbler)Global angle of the dribbler ang(dribbler; adversary) The relative angle between the dribbler and the adversary ang(ball; adversary) The relative angle between the ball and the adversary dist(ball; adversary) Distance between the ball and the adversary

RL Solution

CMAC Partitioning the state space into several receptive fields (hyper-rectangles) Each one is associated with a weight Multiple partitions of the state space (layers) are usually used The CMACs response to a given input is equal to the sum of the weights of the excited receptive fields

RL Solution

Experiments

Adversary Fixed policy It computes a near-optimal interception point (UvA Trilearn 2003 team) Two phases Training Testing

Experiments Training Phase: 5 independent runs, each one lasting 50,000 episodes 53%

Experiments Qualitatively Rule #1

Experiments Qualitatively Rule #2

Experiments

Dribble Soccer dribbling task Reinforcement learning solution Benchmark Start point for dribbling tasks in other sports games E.g., hockey, basketball, and football

Thank you! Source code available at: http://sites.google.com/site/soccerdribbling Arthur Carvalho Renato Oliveira a3carval@cs.uwaterloo.ca rmo@cin.ufpe.br

Reinforcement Learning for the Soccer Dribbling Task Arthur Carvalho Renato Oliveira

Documents

Dribbling Skill of Junior Basketballers

Isis Didier Lins1*, Paulo Renato Alves Firmino2, Diogo de Carvalho

Renato sued

Scanned Document - Inicial — UFRGS · Plantas Forrageiras — Renato Borges de Medeiros e Paulo César de Fáccio Carvalho; e o representante discente Felipe Jochims para apreciar

Carvalho veto

Renato Constantino

Dribbling drills

REDDeX Carvalho

Volunteer Proﬁle - Renato Mauro Preda Baseline Newsletter... · 2016. 7. 11. · Image: Renato Mauro Preda Volunteer Proﬁle - Renato Mauro Preda Renato Mauro Preda is the founder

Shawn Farmer: A Guide to Basketball Dribbling

Warm Up: Dribbling

dribbling book.pdf

Collective Reaction and Dribbling

Soccer Dribbling and Juggling Warm-up

Concerning Renato

Trieste, 26 novembre © 2005 – Renato Lukač Renato@spark.si Using OSS in Slovenian High Schools doc. dr. Renato Lukač Renato@spark.si LinuxDay 2005 - Trieste

Meteorology applied to urban air pollution problems...vador, Mill an Mill an, Erik Berge, Leiv H avard Sl˝rdal, Renato Carvalho, Victor Prior, Antonio Viseu, Francois Bompay, Alexis

Reinforcement Learning for the Soccer Dribbling Task

Renato Recio

Lesson Plan Topics - Amazon Web Services...Page | 1 Lesson Plan Topics INDEX Page Module 1 Dribbling Week 1 Dribbling and Turning Page 2 Week 2 Dribbling to beat an Opponent Page 3