Picture for Haoqian Wang

Haoqian Wang

Learning to Search: A Decision-Based Agent for Knowledge-Based Visual Question Answering

Add code
Apr 09, 2026
Viaarxiv icon

Stabilizing Unsupervised Self-Evolution of MLLMs via Continuous Softened Retracing reSampling

Add code
Apr 04, 2026
Viaarxiv icon

When Models Judge Themselves: Unsupervised Self-Evolution for Multimodal Reasoning

Add code
Mar 22, 2026
Viaarxiv icon

Efficient Long-Horizon GUI Agents via Training-Free KV Cache Compression

Add code
Feb 27, 2026
Viaarxiv icon

R3G: A Reasoning--Retrieval--Reranking Framework for Vision-Centric Answer Generation

Add code
Jan 25, 2026
Viaarxiv icon

Language-Guided and Motion-Aware Gait Representation for Generalizable Recognition

Add code
Jan 17, 2026
Viaarxiv icon

Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation

Add code
Dec 30, 2025
Viaarxiv icon

EchoMotion: Unified Human Video and Motion Generation via Dual-Modality Diffusion Transformer

Add code
Dec 21, 2025
Viaarxiv icon

Measuring and Controlling the Spectral Bias for Self-Supervised Image Denoising

Add code
Oct 01, 2025
Viaarxiv icon

DreamSwapV: Mask-guided Subject Swapping for Any Customized Video Editing

Add code
Aug 20, 2025
Viaarxiv icon