Why does so much brainpower and computation solve irrelevant problems?
Here's my view:
It's embarrassing to admit we lack simulators for meaningful problems. And solving the mismatch to worthwhile problems is a different type of "hard problem" than today's practitioners enjoy.
We trained a single AI for the past 10 months, something we haven't seen before in reinforcement learning.
OpenAI Five at The International was 1.5 months old.
OpenAI Five at Finals was 10 months old.
*Huge* difference in performance. And the curves still haven't leveled off.
3
2
2
13
Dota is a convenient testbed for pushing the limits of general-purpose deep RL technology.
Here's a physical robotics problem we solved using the learning system we wrote for Dota: blog.openai.com/learning-dex…
Apr 15, 2019 · 10:52 PM UTC
2
4
1
32



