Picture for Dayiheng Liu

Dayiheng Liu

additional authors not shown

Teaching LLM to Reason: Reinforcement Learning from Algorithmic Problems without Code

Add code
Jul 10, 2025
Viaarxiv icon

CoRT: Code-integrated Reasoning within Thinking

Add code
Jun 12, 2025
Viaarxiv icon

Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

Add code
Jun 05, 2025
Viaarxiv icon

MTR-Bench: A Comprehensive Benchmark for Multi-Turn Reasoning Evaluation

Add code
May 26, 2025
Viaarxiv icon

WorldPM: Scaling Human Preference Modeling

Add code
May 15, 2025
Viaarxiv icon

Parallel Scaling Law for Language Models

Add code
May 15, 2025
Viaarxiv icon

Qwen3 Technical Report

Add code
May 14, 2025
Viaarxiv icon

Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

Add code
May 10, 2025
Viaarxiv icon

START: Self-taught Reasoner with Tools

Add code
Mar 07, 2025
Viaarxiv icon

DataMan: Data Manager for Pre-training Large Language Models

Add code
Feb 26, 2025
Viaarxiv icon