V-JEPA: AI Model Learning Physical Intuition from Videos