Blog Post··4 min read
I-JEPA: Self-Supervised Vision at Scale
I-JEPA applies the JEPA idea to images: predict the representations of target patches from a context region, without any view-level augmentations. The result transfers better to semantic tasks than pixel-level methods.