Picture for Xipeng Qiu

Xipeng Qiu

R3-RAG: Learning Step-by-Step Reasoning and Retrieval for LLMs via Reinforcement Learning

Add code
May 26, 2025
Viaarxiv icon

REARANK: Reasoning Re-ranking Agent via Reinforcement Learning

Add code
May 26, 2025
Viaarxiv icon

ConvSearch-R1: Enhancing Query Reformulation for Conversational Search with Reasoning via Reinforcement Learning

Add code
May 21, 2025
Viaarxiv icon

Code2Logic: Game-Code-Driven Data Synthesis for Enhancing VLMs General Reasoning

Add code
May 20, 2025
Viaarxiv icon

Teach2Eval: An Indirect Evaluation Method for LLM by Judging How It Teaches

Add code
May 18, 2025
Viaarxiv icon

Reinforced Interactive Continual Learning via Real-time Noisy Human Feedback

Add code
May 15, 2025
Viaarxiv icon

Task-Core Memory Management and Consolidation for Long-term Continual Learning

Add code
May 15, 2025
Viaarxiv icon

Towards Understanding the Nature of Attention with Low-Rank Sparse Decomposition

Add code
Apr 29, 2025
Viaarxiv icon

VisuoThink: Empowering LVLM Reasoning with Multimodal Tree Search

Add code
Apr 12, 2025
Viaarxiv icon

Revisiting LLM Evaluation through Mechanism Interpretability: a New Metric and Model Utility Law

Add code
Apr 10, 2025
Viaarxiv icon