Picture for Weiwen Liu

Weiwen Liu

MuonRec: Shifting the Optimizer Paradigm Beyond Adam in Scalable Generative Recommendation

Add code
Feb 28, 2026
Viaarxiv icon

Plan-MCTS: Plan Exploration for Action Exploitation in Web Navigation

Add code
Feb 15, 2026
Viaarxiv icon

LogitsCoder: Towards Efficient Chain-of-Thought Path Search via Logits Preference Decoding for Code Generation

Add code
Feb 15, 2026
Viaarxiv icon

Adaptive Milestone Reward for GUI Agents

Add code
Feb 12, 2026
Viaarxiv icon

ReMiT: RL-Guided Mid-Training for Iterative LLM Evolution

Add code
Feb 03, 2026
Viaarxiv icon

ARTIS: Agentic Risk-Aware Test-Time Scaling via Iterative Simulation

Add code
Feb 03, 2026
Viaarxiv icon

ToolACE-MCP: Generalizing History-Aware Routing from MCP Tools to the Agent Web

Add code
Jan 13, 2026
Viaarxiv icon

Agent-Dice: Disentangling Knowledge Updates via Geometric Consensus for Agent Continual Learning

Add code
Jan 08, 2026
Viaarxiv icon

LoopTool: Closing the Data-Training Loop for Robust LLM Tool Calls

Add code
Nov 18, 2025
Figure 1 for LoopTool: Closing the Data-Training Loop for Robust LLM Tool Calls
Figure 2 for LoopTool: Closing the Data-Training Loop for Robust LLM Tool Calls
Figure 3 for LoopTool: Closing the Data-Training Loop for Robust LLM Tool Calls
Figure 4 for LoopTool: Closing the Data-Training Loop for Robust LLM Tool Calls
Viaarxiv icon

Fints: Efficient Inference-Time Personalization for LLMs with Fine-Grained Instance-Tailored Steering

Add code
Oct 31, 2025
Viaarxiv icon