We trained a single AI for the past 10 months, something we haven't seen before in reinforcement learning.
OpenAI Five at The International was 1.5 months old.
OpenAI Five at Finals was 10 months old.
*Huge* difference in performance. And the curves still haven't leveled off.
19
99
21
513
(The x-axis is total amount of compute consumed, and comes with labels!)
Apr 16, 2019 · 5:16 AM UTC
3
7




