Multimodal Monday #43: Stop Looking, Start Seeing
Week of Jan 26 - Feb 1: World models dominate as EgoWM simulates robot actions from single images, Drive-JEPA predicts what matters for driving, Project Genie halluccinates playable games, and Google's Agentic Vision turns image analysis into active exploration.