今日从 arXiv 订阅中筛选 8 篇论文。

⚡ OmniVTG A Large-Scale Dataset and Training Paradigm for Open-World Video Temporal Grounding

OmniVTG A Large-Scale Dataset and Training Paradigm for Open-World Video Temporal Grounding

⚡ Instruction-Evidence Contrastive Dual-Stream Decoding for Grounded Vision-Language Reasoning

⚡ Leveraging Previous-Traversal Point Cloud Map Priors for Camera-Based 3D Object Detection and Tracking

Leveraging Previous-Traversal Point Cloud Map Priors for Camera-Based 3D Object Detection and Tracking

⚡ Biased Dreams Limitations to Epistemic Uncertainty Quantification in Latent Space Models

⚡ Prefill-Time Intervention for Mitigating Hallucination in Large Vision-Language Models

Prefill-Time Intervention for Mitigating Hallucination in Large Vision-Language Models

⚡ SIEVES Selective Prediction Generalizes through Visual Evidence Scoring

SIEVES Selective Prediction Generalizes through Visual Evidence Scoring

⚡ Interactive Episodic Memory with User Feedback

Interactive Episodic Memory with User Feedback

⚡ Control Your Queries Heterogeneous Query Interaction for Camera-Radar Fusion

Control Your Queries Heterogeneous Query Interaction for Camera-Radar Fusion

自动生成于 2026-04-30 · 基于 arXiv Daily Digest