Picture for Xihuai Wang

Xihuai Wang

SRFT: A Single-Stage Method with Supervised and Reinforcement Fine-Tuning for Reasoning

Add code
Jun 24, 2025
Viaarxiv icon

Audio Turing Test: Benchmarking the Human-likeness of Large Language Model-based Text-to-Speech Systems in Chinese

Add code
May 16, 2025
Viaarxiv icon

PMAT: Optimizing Action Generation Order in Multi-Agent Reinforcement Learning

Add code
Feb 23, 2025
Viaarxiv icon

Leveraging Dual Process Theory in Language Agent Framework for Real-time Simultaneous Human-AI Collaboration

Add code
Feb 17, 2025
Viaarxiv icon

HammerBench: Fine-Grained Function-Calling Evaluation in Real Mobile Device Scenarios

Add code
Dec 21, 2024
Viaarxiv icon

Computing Ex Ante Equilibrium in Heterogeneous Zero-Sum Team Games

Add code
Oct 02, 2024
Viaarxiv icon

Mutual Theory of Mind in Human-AI Collaboration: An Empirical Study with LLM-driven AI Agents in a Real-time Shared Workspace Task

Add code
Sep 13, 2024
Viaarxiv icon

Quantifying Zero-shot Coordination Capability with Behavior Preferring Partners

Add code
Oct 08, 2023
Viaarxiv icon

Order Matters: Agent-by-agent Policy Optimization

Add code
Feb 26, 2023
Figure 1 for Order Matters: Agent-by-agent Policy Optimization
Figure 2 for Order Matters: Agent-by-agent Policy Optimization
Figure 3 for Order Matters: Agent-by-agent Policy Optimization
Figure 4 for Order Matters: Agent-by-agent Policy Optimization
Viaarxiv icon

Model-based Multi-agent Reinforcement Learning: Recent Progress and Prospects

Add code
Mar 20, 2022
Figure 1 for Model-based Multi-agent Reinforcement Learning: Recent Progress and Prospects
Viaarxiv icon