Picture for Wei Yang

Wei Yang

Huazhong University of Science and Technology, Wuhan, China

Food-R1: A Unified Multi-Task Food Vision-Language Model with Reinforcement Learning

Add code
Jun 03, 2026
Viaarxiv icon

Memory Retrieval for Changing Preferences

Add code
Jun 02, 2026
Viaarxiv icon

PRISM: Synergizing Vision Foundation Models via Self-organized Expert Specialization

Add code
Jun 02, 2026
Viaarxiv icon

UnityMAS-O: A General RL Optimization Framework for LLM-Based Multi-Agent Systems

Add code
May 26, 2026
Viaarxiv icon

Tournament-GRPO: Group-Wise Tournament Rewards for Reinforcement Learning in Open-Ended Long-Form Generation

Add code
May 26, 2026
Viaarxiv icon

Evaluating the Expressive Appropriateness of Speech in Rich Contexts

Add code
May 10, 2026
Viaarxiv icon

FORTIS: Benchmarking Over-Privilege in Agent Skills

Add code
May 09, 2026
Viaarxiv icon

COEVO: Co-Evolutionary Framework for Joint Functional Correctness and PPA Optimization in LLM-Based RTL Generation

Add code
Apr 16, 2026
Viaarxiv icon

Hypergraph-State Collaborative Reasoning for Multi-Object Tracking

Add code
Apr 14, 2026
Viaarxiv icon

PRAISE: Prefix-Based Rollout Reuse in Agentic Search Training

Add code
Apr 04, 2026
Viaarxiv icon