Picture for Qi Liu

Qi Liu

Xidian University

TabClaw: An Interactive and Self-Evolving Agent for Spreadsheet Manipulation and Table Reasoning

Add code
Jun 09, 2026
Viaarxiv icon

Claw-R1: A Step-Level Data Middleware System for Agentic Reinforcement Learning

Add code
Jun 08, 2026
Viaarxiv icon

TIDE: Task-Isolated Diffusion for Unified Video Editing and Generation

Add code
Jun 06, 2026
Viaarxiv icon

SocraticPO: Policy Optimization via Interactive Guidance

Add code
Jun 03, 2026
Viaarxiv icon

The Flip Side of RLHF: On-Policy Feedback for Reward Model Self-Supervised Improvement

Add code
May 29, 2026
Viaarxiv icon

Entropy-KL Divergence-based Token Masking: A Novel Approach for Selective Fine-tuning of Large Language Models

Add code
May 28, 2026
Viaarxiv icon

UnityMAS-O: A General RL Optimization Framework for LLM-Based Multi-Agent Systems

Add code
May 26, 2026
Viaarxiv icon

What Really Improves Mathematical Reasoning: Structured Reasoning Signals Beyond Pure Code

Add code
May 19, 2026
Viaarxiv icon

More Edits, More Stable: Understanding the Lifelong Normalization in Sequential Model Editing

Add code
May 12, 2026
Viaarxiv icon

Polyphonia: Zero-Shot Timbre Transfer in Polyphonic Music with Acoustic-Informed Attention Calibration

Add code
May 11, 2026
Viaarxiv icon