Grokking: When Neural Networks Suddenly Understand
Series: Mechanistic Interpretability | Part: 4 of 9
In 2021, researchers at OpenAI noticed something strange. They trained a small transformer on simple modular arithmetic—the kind of problem you learn in middle school. At first, the network memorized the training examples perfectly but failed completely on new ones. Standard overfitting