今日从 arXiv 订阅中筛选 8 篇论文。

⚡ Agentic World Modeling Foundations, Capabilities, Laws, and Beyond

Agentic World Modeling Foundations, Capabilities, Laws, and Beyond

⚡ OccDirector Language-Guided Behavior and Interaction Generation in 4D Occupancy Space

⚡ Towards Safe Mobility A Unified Transportation Foundation Model enabled by Open-Ended Vision-Language Dataset

Towards Safe Mobility A Unified Transportation Foundation Model enabled by Open-Ended Vision-Language Dataset

⚡ SpaMEM Benchmarking Dynamic Spatial Reasoning via Perception-Memory Integration in Embodied Environments

SpaMEM Benchmarking Dynamic Spatial Reasoning via Perception-Memory Integration in Embodied Environments

⚡ Cross-Stage Coherence in Hierarchical Driving VQA Explicit Baselines and Learned Gated Context Projectors

⚡ GenMatter Perceiving Physical Objects with Generative Matter Models

GenMatter Perceiving Physical Objects with Generative Matter Models

⚡ CGC Compositional Grounded Contrast for Fine-Grained Multi-Image Understanding

CGC Compositional Grounded Contrast for Fine-Grained Multi-Image Understanding

⚡ Towards Temporal Compositional Reasoning in Long-Form Sports Videos


自动生成于 2026-04-28 · 基于 arXiv Daily Digest