今日从 arXiv 订阅中筛选 8 篇论文。
⚡ Agentic World Modeling Foundations, Capabilities, Laws, and Beyond

⚡ OccDirector Language-Guided Behavior and Interaction Generation in 4D Occupancy Space
⚡ Towards Safe Mobility A Unified Transportation Foundation Model enabled by Open-Ended Vision-Language Dataset

⚡ SpaMEM Benchmarking Dynamic Spatial Reasoning via Perception-Memory Integration in Embodied Environments

⚡ Cross-Stage Coherence in Hierarchical Driving VQA Explicit Baselines and Learned Gated Context Projectors
⚡ GenMatter Perceiving Physical Objects with Generative Matter Models

⚡ CGC Compositional Grounded Contrast for Fine-Grained Multi-Image Understanding

⚡ Towards Temporal Compositional Reasoning in Long-Form Sports Videos
自动生成于 2026-04-28 · 基于 arXiv Daily Digest