Picture for An Yang

An Yang

additional authors not shown

RMTBench: Benchmarking LLMs Through Multi-Turn User-Centric Role-Playing

Add code
Jul 27, 2025
Viaarxiv icon

Group Sequence Policy Optimization

Add code
Jul 24, 2025
Viaarxiv icon

Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

Add code
Jun 05, 2025
Viaarxiv icon

Rationales Are Not Silver Bullets: Measuring the Impact of Rationales on Model Performance and Reliability

Add code
May 30, 2025
Viaarxiv icon

WorldPM: Scaling Human Preference Modeling

Add code
May 15, 2025
Viaarxiv icon

Qwen3 Technical Report

Add code
May 14, 2025
Viaarxiv icon

Qwen2.5-1M Technical Report

Add code
Jan 26, 2025
Viaarxiv icon

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Add code
Jan 03, 2025
Viaarxiv icon

Qwen2.5 Technical Report

Add code
Dec 19, 2024
Viaarxiv icon

Language Models can Self-Lengthen to Generate Long Texts

Add code
Oct 31, 2024
Figure 1 for Language Models can Self-Lengthen to Generate Long Texts
Figure 2 for Language Models can Self-Lengthen to Generate Long Texts
Figure 3 for Language Models can Self-Lengthen to Generate Long Texts
Figure 4 for Language Models can Self-Lengthen to Generate Long Texts
Viaarxiv icon