Picture for Keze Wang

Keze Wang

Why Keep Your Doubts to Yourself? Trading Visual Uncertainties in Multi-Agent Bandit Systems

Add code
Jan 26, 2026
Viaarxiv icon

ResAgent: Entropy-based Prior Point Discovery and Visual Reasoning for Referring Expression Segmentation

Add code
Jan 23, 2026
Viaarxiv icon

Weather-R1: Logically Consistent Reinforcement Fine-Tuning for Multimodal Reasoning in Meteorology

Add code
Jan 20, 2026
Viaarxiv icon

Exploring Talking Head Models With Adjacent Frame Prior for Speech-Preserving Facial Expression Manipulation

Add code
Jan 19, 2026
Viaarxiv icon

Stable Language Guidance for Vision-Language-Action Models

Add code
Jan 07, 2026
Viaarxiv icon

3D-Agent:Tri-Modal Multi-Agent Collaboration for Scalable 3D Object Annotation

Add code
Jan 07, 2026
Viaarxiv icon

Robust Egocentric Referring Video Object Segmentation via Dual-Modal Causal Intervention

Add code
Dec 30, 2025
Viaarxiv icon

CoAgent: Collaborative Planning and Consistency Agent for Coherent Video Generation

Add code
Dec 27, 2025
Viaarxiv icon

Self-Rewarded Multimodal Coherent Reasoning Across Diverse Visual Domains

Add code
Dec 27, 2025
Viaarxiv icon

RevFFN: Memory-Efficient Full-Parameter Fine-Tuning of Mixture-of-Experts LLMs with Reversible Blocks

Add code
Dec 24, 2025
Viaarxiv icon