These days The Matrix really is everywhere.
When designing a deep learning model, whenever a decision must be made, the answer is simple: multiply your data by a weight matrix. For example, in an LSTM, need to forget some data? Just multiply by a weight matrix to get the forget gate. The system will learn what you meant.

Dec 6, 2018 · 5:31 PM UTC

2
13
1
54
Replying to @gdb
Simulacra and Simulation. Small probability we’re living in it, so the fact it’s “everywhere” makes sense.