Picture for Xuebo Liu

Xuebo Liu

Loong: A Human-Like Long Document Translation Agent with Observe-and-Act Adaptive Context Selection

Add code
May 28, 2026
Viaarxiv icon

Mitigating Context-Memory Conflicts in LLMs through Dynamic Cognitive Reconciliation Decoding

Add code
May 12, 2026
Viaarxiv icon

MASPO: Joint Prompt Optimization for LLM-based Multi-Agent Systems

Add code
May 07, 2026
Viaarxiv icon

OGER: A Robust Offline-Guided Exploration Reward for Hybrid Reinforcement Learning

Add code
Apr 20, 2026
Viaarxiv icon

NoveltyAgent: Autonomous Novelty Reporting Agent with Point-wise Novelty Analysis and Self-Validation

Add code
Mar 21, 2026
Viaarxiv icon

RouterKGQA: Specialized--General Model Routing for Constraint-Aware Knowledge Graph Question Answering

Add code
Mar 20, 2026
Viaarxiv icon

AgentDropoutV2: Optimizing Information Flow in Multi-Agent Systems via Test-Time Rectify-or-Reject Pruning

Add code
Feb 26, 2026
Viaarxiv icon

PACE: Defying the Scaling Hypothesis of Exploration in Iterative Alignment for Mathematical Reasoning

Add code
Feb 05, 2026
Viaarxiv icon

Stop Rewarding Hallucinated Steps: Faithfulness-Aware Step-Level Reinforcement Learning for Small Reasoning Models

Add code
Feb 05, 2026
Viaarxiv icon

Think Dense, Not Long: Dynamic Decoupled Conditional Advantage for Efficient Reasoning

Add code
Feb 02, 2026
Viaarxiv icon