Super interesting paper just out in Nature Human Behaviour!
Do humans learn like transformers?
In a smart experiment, the authors trained humans and transformer networks on the same rule-learning task, manipulating only one thing: the distribution of training examples, from fully diverse (every example unique) to highly redundant (the same items repeated).
The first results are already interesting:
Diverse examples lead both human and artificial systems to generalise rules to novel situations.
Redundant examples lead both humans and artificial systems to memorize examples.
Additionally, the switching between these two strategies appear at similar tradeoffs.
So, do humans and transformers learn in the same way? Not quite! And it’s here that things get super interesting:
If you show diverse examples first, humans learn to generalize without losing the ability to memorize later. Transformers, by contrast, do not show the same benefit: when training shifts toward memorization, earlier generalization does not reliably carry over.
Humans can accumulate learning strategies more flexibly than transformers.
Paper in the first reply