How large neural networks are trained across increasingly massive clusters — cleverly slicing the computation on a wide variety of axes, rematerializing intermediate results, and much more:
Techniques for training large neural networks, by @lilianweng and @gdb: openai.com/blog/techniques-f…
Jun 9, 2022 · 4:27 PM UTC
2
4
53


