Picture for Zhongyuan Wang

Zhongyuan Wang

When the Tool Decides: LLM Agents Defer Blindly to Graph Neural Network Tools, and Stronger Backbones Defer More

Add code
Jun 12, 2026
Viaarxiv icon

ComAct: Reframing Professional Software Manipulation via COM-as-Action Paradigm

Add code
Jun 11, 2026
Viaarxiv icon

StoryVideoQA: Scaling Deep Video Understanding with a Large-Scale, Multi-Genre and Auto-Generated Dataset

Add code
Jun 04, 2026
Viaarxiv icon

Divide and Conquer: Reliable Multi-View Evidential Learning for Deepfake Detection

Add code
Jun 01, 2026
Viaarxiv icon

OmniUMI: Towards Physically Grounded Robot Learning via Human-Aligned Multimodal Interaction

Add code
Apr 12, 2026
Viaarxiv icon

BAAI Cardiac Agent: An intelligent multimodal agent for automated reasoning and diagnosis of cardiovascular diseases from cardiac magnetic resonance imaging

Add code
Apr 05, 2026
Viaarxiv icon

ClawKeeper: Comprehensive Safety Protection for OpenClaw Agents Through Skills, Plugins, and Watchers

Add code
Mar 25, 2026
Viaarxiv icon

Tutor-Student Reinforcement Learning: A Dynamic Curriculum for Robust Deepfake Detection

Add code
Mar 25, 2026
Viaarxiv icon

PRM-as-a-Judge: A Dense Evaluation Paradigm for Fine-Grained Robotic Auditing

Add code
Mar 23, 2026
Viaarxiv icon

SaPaVe: Towards Active Perception and Manipulation in Vision-Language-Action Models for Robotics

Add code
Mar 12, 2026
Viaarxiv icon