Picture for Chunhui Zhang

Chunhui Zhang

PhyAVBench: A Challenging Audio Physics-Sensitivity Benchmark for Physically Grounded Text-to-Audio-Video Generation

Add code
Dec 30, 2025
Viaarxiv icon

How Far are Modern Trackers from UAV-Anti-UAV? A Million-Scale Benchmark and New Baseline

Add code
Dec 08, 2025
Viaarxiv icon

What Makes a Good Curriculum? Disentangling the Effects of Data Ordering on LLM Mathematical Reasoning

Add code
Oct 21, 2025
Viaarxiv icon

Mind the Gap: The Divergence Between Human and LLM-Generated Tasks

Add code
Aug 01, 2025
Viaarxiv icon

SoundMind: RL-Incentivized Logic Reasoning for Audio-Language Models

Add code
Jun 15, 2025
Viaarxiv icon

Music's Multimodal Complexity in AVQA: Why We Need More than General Multimodal LLMs

Add code
May 27, 2025
Viaarxiv icon

Systematic Bias in Large Language Models: Discrepant Response Patterns in Binary vs. Continuous Judgment Tasks

Add code
Apr 28, 2025
Viaarxiv icon

COST: Contrastive One-Stage Transformer for Vision-Language Small Object Tracking

Add code
Apr 02, 2025
Viaarxiv icon

ToM-RL: Reinforcement Learning Unlocks Theory of Mind in Small LLMs

Add code
Apr 02, 2025
Viaarxiv icon

Scaled Supervision is an Implicit Lipschitz Regularizer

Add code
Mar 19, 2025
Viaarxiv icon