From a students perspective: when getting more performance from a model increase like this one usually worries about overfitting. Is that a problem for you? How do you manage it?
1
3
This is one of the things I find most remarkable about OpenAI Five: somehow, the bot and human strategy spaces overlap significantly. We know that for minigame environments we needed to add randomizations to cause that to happen. Less certain as the env has gotten harder.
Sep 7, 2018 · 7:45 PM UTC
3



