Picture for Qianguo Sun

Qianguo Sun

Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe

Add code
Mar 23, 2026
Viaarxiv icon

VSearcher: Long-Horizon Multimodal Search Agent via Reinforcement Learning

Add code
Mar 03, 2026
Viaarxiv icon

Linear Preference Optimization: Decoupled Gradient Control via Absolute Regularization

Add code
Aug 20, 2025
Viaarxiv icon

UniTTS: An end-to-end TTS system without decoupling of acoustic and semantic information

Add code
May 23, 2025
Viaarxiv icon

Never Lost in the Middle: Improving Large Language Models via Attention Strengthening Question Answering

Add code
Nov 15, 2023
Figure 1 for Never Lost in the Middle: Improving Large Language Models via Attention Strengthening Question Answering
Figure 2 for Never Lost in the Middle: Improving Large Language Models via Attention Strengthening Question Answering
Figure 3 for Never Lost in the Middle: Improving Large Language Models via Attention Strengthening Question Answering
Figure 4 for Never Lost in the Middle: Improving Large Language Models via Attention Strengthening Question Answering
Viaarxiv icon