Picture for Weiming Lu

Weiming Lu

ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents

Add code
Apr 13, 2026
Viaarxiv icon

KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation

Add code
Apr 09, 2026
Viaarxiv icon

Seeing but Not Thinking: Routing Distraction in Multimodal Mixture-of-Experts

Add code
Apr 09, 2026
Viaarxiv icon

SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization

Add code
Apr 02, 2026
Viaarxiv icon

CoVerRL: Breaking the Consensus Trap in Label-Free Reasoning via Generator-Verifier Co-Evolution

Add code
Mar 18, 2026
Viaarxiv icon

Code-A1: Adversarial Evolving of Code LLM and Test LLM via Reinforcement Learning

Add code
Mar 16, 2026
Viaarxiv icon

Active Confusion Expression in Large Language Models: Leveraging World Models toward Better Social Reasoning

Add code
Oct 09, 2025
Viaarxiv icon

Cooper: Co-Optimizing Policy and Reward Models in Reinforcement Learning for Large Language Models

Add code
Aug 07, 2025
Figure 1 for Cooper: Co-Optimizing Policy and Reward Models in Reinforcement Learning for Large Language Models
Figure 2 for Cooper: Co-Optimizing Policy and Reward Models in Reinforcement Learning for Large Language Models
Figure 3 for Cooper: Co-Optimizing Policy and Reward Models in Reinforcement Learning for Large Language Models
Figure 4 for Cooper: Co-Optimizing Policy and Reward Models in Reinforcement Learning for Large Language Models
Viaarxiv icon

OmniEAR: Benchmarking Agent Reasoning in Embodied Tasks

Add code
Aug 07, 2025
Viaarxiv icon

Test-Time Reinforcement Learning for GUI Grounding via Region Consistency

Add code
Aug 07, 2025
Viaarxiv icon