Tag: jepa

Blog Post·2024-06-19·4 min read

I-JEPA: Self-Supervised Vision at Scale

I-JEPA applies the JEPA idea to images: predict the representations of target patches from a context region, without any view-level augmentations. The result transfers better to semantic tasks than pixel-level methods.

ssl jepa i-jepa vision-transformers

Blog Post·2024-06-19·5 min read

JEPA: Predicting in Representation Space

MAE predicts pixels. Contrastive methods match views. JEPA predicts representations of target regions from context regions — in an abstract space where irrelevant details have already been discarded.

ssl jepa representation-learning world-models

Blog Post·2024-06-19·4 min read

V-JEPA: Predicting the Future in Representation Space

V-JEPA extends JEPA to video: predict the representations of future or masked frames from context frames. No pixel reconstruction, no contrastive loss — just abstract prediction across time.

ssl jepa v-jepa video world-models

Blog Post·2024-06-19·5 min read

World Models: The Bigger Picture Behind JEPA

JEPA is a learning architecture. World models are the goal it points toward — internal simulators that can predict the consequences of actions and support planning without interacting with the real world.

ssl world-models jepa embodied-ai planning