<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Reinforcement-Learning on Galliard7</title><link>https://galliard7.github.io/tags/reinforcement-learning/</link><description>Recent content in Reinforcement-Learning on Galliard7</description><generator>Hugo</generator><language>en-us</language><lastBuildDate>Tue, 15 Sep 2020 00:00:00 +0000</lastBuildDate><atom:link href="https://galliard7.github.io/tags/reinforcement-learning/index.xml" rel="self" type="application/rss+xml"/><item><title>Google Research Football</title><link>https://galliard7.github.io/projects/google-research-football/</link><pubDate>Tue, 15 Sep 2020 00:00:00 +0000</pubDate><guid>https://galliard7.github.io/projects/google-research-football/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;Building AI agents that play 11v11 simulated football. Agents receive game observations (player positions, ball state, game mode) and return actions, competing head-to-head on Kaggle&amp;rsquo;s evaluation servers with Elo-style rating.&lt;/p&gt;
&lt;h2 id="approach"&gt;Approach&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Rule-based tactical foundation&lt;/strong&gt; — &amp;ldquo;marauding wingers&amp;rdquo; formation: wide players sprint down flanks and deliver crosses into the box&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Zone-based decision architecture&lt;/strong&gt; — field divided into zones (defensive third, wing corridors, crossing range, shooting range) with different behaviors per zone&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Opponent-aware mechanics&lt;/strong&gt; — proximity detection for context-sensitive decisions: sprint in open space, dribble under pressure, pass when crowded&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Goalkeeper exploitation&lt;/strong&gt; — specific logic to detect when the opposing keeper is out of position and trigger long-range shots&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Sprint/dribble state machine&lt;/strong&gt; — manages action mode based on field position and opponent proximity&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="result"&gt;Result&lt;/h2&gt;
&lt;p&gt;61/1138 🥉&lt;/p&gt;</description></item></channel></rss>