Blog Post··3 min read
The Architecture Playground: What a Transformer Config Actually Buys You
An interactive research blog. Drag the config of a decoder-only transformer — hidden size, head counts, FFN type — and watch the parameter count, KV cache, and mixture-of-experts routing recompute live.