Blog Post··11 min read
Representation Geometry: How Neural Networks Encode Meaning
The linear representation hypothesis, superposition, polysemanticity, and why transformer activations are more structured than they look.
The linear representation hypothesis, superposition, polysemanticity, and why transformer activations are more structured than they look.