Picture for Xiangyu Xi

Xiangyu Xi

Enhancing Efficiency and Exploration in Reinforcement Learning for LLMs

Add code
May 24, 2025
Viaarxiv icon

Rethinking the Sampling Criteria in Reinforcement Learning for LLM Reasoning: A Competence-Difficulty Alignment Perspective

Add code
May 23, 2025
Viaarxiv icon

SampleMix: A Sample-wise Pre-training Data Mixing Strategey by Coordinating Data Quality and Diversity

Add code
Mar 03, 2025
Viaarxiv icon

Dialog-to-Actions: Building Task-Oriented Dialogue System via Action-Level Generation

Add code
Apr 03, 2023
Figure 1 for Dialog-to-Actions: Building Task-Oriented Dialogue System via Action-Level Generation
Figure 2 for Dialog-to-Actions: Building Task-Oriented Dialogue System via Action-Level Generation
Figure 3 for Dialog-to-Actions: Building Task-Oriented Dialogue System via Action-Level Generation
Figure 4 for Dialog-to-Actions: Building Task-Oriented Dialogue System via Action-Level Generation
Viaarxiv icon

MUSIED: A Benchmark for Event Detection from Multi-Source Heterogeneous Informal Texts

Add code
Nov 25, 2022
Figure 1 for MUSIED: A Benchmark for Event Detection from Multi-Source Heterogeneous Informal Texts
Figure 2 for MUSIED: A Benchmark for Event Detection from Multi-Source Heterogeneous Informal Texts
Figure 3 for MUSIED: A Benchmark for Event Detection from Multi-Source Heterogeneous Informal Texts
Figure 4 for MUSIED: A Benchmark for Event Detection from Multi-Source Heterogeneous Informal Texts
Viaarxiv icon

A Low-Cost, Controllable and Interpretable Task-Oriented Chatbot: With Real-World After-Sale Services as Example

Add code
May 13, 2022
Figure 1 for A Low-Cost, Controllable and Interpretable Task-Oriented Chatbot: With Real-World After-Sale Services as Example
Figure 2 for A Low-Cost, Controllable and Interpretable Task-Oriented Chatbot: With Real-World After-Sale Services as Example
Figure 3 for A Low-Cost, Controllable and Interpretable Task-Oriented Chatbot: With Real-World After-Sale Services as Example
Figure 4 for A Low-Cost, Controllable and Interpretable Task-Oriented Chatbot: With Real-World After-Sale Services as Example
Viaarxiv icon

Capturing Event Argument Interaction via A Bi-Directional Entity-Level Recurrent Decoder

Add code
Jul 01, 2021
Figure 1 for Capturing Event Argument Interaction via A Bi-Directional Entity-Level Recurrent Decoder
Figure 2 for Capturing Event Argument Interaction via A Bi-Directional Entity-Level Recurrent Decoder
Figure 3 for Capturing Event Argument Interaction via A Bi-Directional Entity-Level Recurrent Decoder
Figure 4 for Capturing Event Argument Interaction via A Bi-Directional Entity-Level Recurrent Decoder
Viaarxiv icon