A Minimal Model of Representation Collapse
We build a minimal dynamical model directly in representation space, abstracting away the details of network architecture and parameters. We use the concept of frustration from statistical physics to describe the core mechanism behind representation collapse, and analyze how Stop-Gradient can break the symmetry and open up a non-collapsing subspace that preserves geometric separation between classes.
- Authors: Louie Hong Yao*, Yuhao Li*, Shengchao Liu
- Preprint: arXiv:2604.09979
- Detailed Introduction in English: My Blog
- Detailed Introduction in Chinese: WeChat Article
