EventVAD: Training-Free Event-Aware Video Anomaly Detection Paper • 2504.13092 • Published Apr 17, 2025
RoadSceneVQA: Benchmarking Visual Question Answering in Roadside Perception Systems for Intelligent Transportation System Paper • 2511.18286 • Published Dec 26, 2025
AirCopBench: A Benchmark for Multi-drone Collaborative Embodied Perception and Reasoning Paper • 2511.11025 • Published Nov 22, 2025
Demystifying Hidden-State Recurrence: Switchable Latent Reasoning with On-Policy Reinforcement Learning Paper • 2606.13106 • Published 3 days ago • 18
Demystifying Hidden-State Recurrence: Switchable Latent Reasoning with On-Policy Reinforcement Learning Paper • 2606.13106 • Published 3 days ago • 18
OralGPT-Omni: A Versatile Dental Multimodal Large Language Model Paper • 2511.22055 • Published Nov 27, 2025 • 9
ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall Paper • 2510.07896 • Published Oct 9, 2025 • 11
MOSS-ChatV: Reinforcement Learning with Process Reasoning Reward for Video Temporal Reasoning Paper • 2509.21113 • Published Sep 25, 2025 • 6
Hallucination at a Glance: Controlled Visual Edits and Fine-Grained Multimodal Learning Paper • 2506.07227 • Published Jun 8, 2025
How to Enable LLM with 3D Capacity? A Survey of Spatial Reasoning in LLM Paper • 2504.05786 • Published Apr 8, 2025
Towards Better Dental AI: A Multimodal Benchmark and Instruction Dataset for Panoramic X-ray Analysis Paper • 2509.09254 • Published Sep 11, 2025 • 6