Blog Post··5 min read
Masked Autoencoders: Learning by Filling in the Blanks
Mask 75% of an image's patches. Train a model to reconstruct them. The result is a rich visual representation — and the recipe works because pixels are redundant and structure is not.