今日从 arXiv 订阅中筛选 8 篇论文。
⚡ Emergent Semantic Representations in World Models through Physical Interaction without Linguistic Supervision
⚡ VisualThink-VLA: Visual Intermediate Reasoning for Effective and Low-Latency VLA Policies

⚡ minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models

⚡ Mitigating State Aliasing in VLA Models via Inverse Dynamics Learning

⚡ BitTP: The Lightweight Trajectory Prediction Model with BitLLM for Edge-Devices
⚡ YoCausal: How Far is Video Generation from World Model? A Causality Perspective

自动生成于 2026-05-30 · 基于 arXiv Daily Digest

