今日从 arXiv 订阅中筛选 8 篇论文。
⚡ AD4AD Benchmarking Visual Anomaly Detection Models for Safer Autonomous Driving
⚡ RAD-2 Scaling Reinforcement Learning in a Generator-Discriminator Framework

⚡ Chain-of-Glimpse Search-Guided Progressive Object-Grounded Reasoning for Video Understanding

⚡ UniDoc-RL Coarse-to-Fine Visual RAG with Hierarchical Actions and Dense Rewards

⚡ Reasoning Dynamics and the Limits of Monitoring Modality Reliance in Vision-Language Models

⚡ ADAPT Benchmarking Commonsense Planning under Unspecified Affordance Constraints

⚡ HY-World 2.0 A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds

自动生成于 2026-04-18 · 基于 arXiv Daily Digest
