When designing a deep learning model, whenever a decision must be made, the answer is simple: multiply your data by a weight matrix. For example, in an LSTM, need to forget some data? Just multiply by a weight matrix to get the forget gate. The system will learn what you meant.
Gives us insight into the multidimensional nature of human cognition; We suffered dearly under the unidimensionalism of behaviorism. Symbol processing mindset was an improvement but still prayed at the church of mechanistic thinking. Now we can move into #holisticai
it's remarkable that many problems are amenable - historically it is probably not uncommon that the complexity of a problem is overestimated until a solution is found