Picture for Xian Li

Xian Li

Agentic Conversational Search with Contextualized Reasoning via Reinforcement Learning

Add code
Jan 19, 2026
Viaarxiv icon

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

VAR RL Done Right: Tackling Asynchronous Policy Conflicts in Visual Autoregressive Generation

Add code
Jan 05, 2026
Viaarxiv icon

NextFlow: Unified Sequential Modeling Activates Multimodal Understanding and Generation

Add code
Jan 05, 2026
Viaarxiv icon

Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model

Add code
Dec 23, 2025
Viaarxiv icon

Scaling Agent Learning via Experience Synthesis

Add code
Nov 10, 2025
Figure 1 for Scaling Agent Learning via Experience Synthesis
Figure 2 for Scaling Agent Learning via Experience Synthesis
Figure 3 for Scaling Agent Learning via Experience Synthesis
Figure 4 for Scaling Agent Learning via Experience Synthesis
Viaarxiv icon

SoccerNet 2025 Challenges Results

Add code
Aug 26, 2025
Viaarxiv icon

UniConv: Unifying Retrieval and Response Generation for Large Language Models in Conversations

Add code
Jul 09, 2025
Figure 1 for UniConv: Unifying Retrieval and Response Generation for Large Language Models in Conversations
Figure 2 for UniConv: Unifying Retrieval and Response Generation for Large Language Models in Conversations
Figure 3 for UniConv: Unifying Retrieval and Response Generation for Large Language Models in Conversations
Figure 4 for UniConv: Unifying Retrieval and Response Generation for Large Language Models in Conversations
Viaarxiv icon

NaturalThoughts: Selecting and Distilling Reasoning Traces for General Reasoning Tasks

Add code
Jul 02, 2025
Figure 1 for NaturalThoughts: Selecting and Distilling Reasoning Traces for General Reasoning Tasks
Figure 2 for NaturalThoughts: Selecting and Distilling Reasoning Traces for General Reasoning Tasks
Figure 3 for NaturalThoughts: Selecting and Distilling Reasoning Traces for General Reasoning Tasks
Figure 4 for NaturalThoughts: Selecting and Distilling Reasoning Traces for General Reasoning Tasks
Viaarxiv icon

Bridging Offline and Online Reinforcement Learning for LLMs

Add code
Jun 26, 2025
Viaarxiv icon