今日从 arXiv 订阅中筛选 8 篇论文。
⚡ EgoDyn-Bench Evaluating Ego-Motion Understanding in Vision-Centric Foundation Models for Autonomous Driving
⚡ PhysNote Self-Knowledge Notes for Evolvable Physical Reasoning in Vision-Language Model

⚡ SceneSelect Selective Learning for Trajectory Scene Classification and Expert Scheduling

⚡ ESIA An Energy-Based Spatiotemporal Interaction-Aware Framework for Pedestrian Intention Prediction

⚡ CF-VLA Efficient Coarse-to-Fine Action Generation for Vision-Language-Action Policies

⚡ Towards Lawful Autonomous Driving Deriving Scenario-Aware Driving Requirements from Traffic Laws and Regulations

⚡ Global Context or Local Detail Adaptive Visual Grounding for Hallucination Mitigation

⚡ UpstreamQA A Modular Framework for Explicit Reasoning on Video Question Answering Tasks

自动生成于 2026-04-29 · 基于 arXiv Daily Digest