Picture for Wayne Xin Zhao

Wayne Xin Zhao

AVC-DPO: Aligned Video Captioning via Direct Preference Optimization

Add code
Jul 02, 2025
Viaarxiv icon

Reasoning with Exploration: An Entropy Perspective

Add code
Jun 17, 2025
Viaarxiv icon

ICPC-Eval: Probing the Frontiers of LLM Reasoning with Competitive Programming Contests

Add code
Jun 05, 2025
Viaarxiv icon

Towards Effective Code-Integrated Reasoning

Add code
May 30, 2025
Viaarxiv icon

Reinforced Informativeness Optimization for Long-Form Retrieval-Augmented Generation

Add code
May 27, 2025
Viaarxiv icon

MMATH: A Multilingual Benchmark for Mathematical Reasoning

Add code
May 25, 2025
Viaarxiv icon

ManuSearch: Democratizing Deep Search in Large Language Models with a Transparent and Open Multi-Agent Framework

Add code
May 23, 2025
Viaarxiv icon

R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning

Add code
May 22, 2025
Viaarxiv icon

DeepRec: Towards a Deep Dive Into the Item Space with Large Language Model Based Recommendation

Add code
May 22, 2025
Viaarxiv icon

Incentivizing Dual Process Thinking for Efficient Large Language Model Reasoning

Add code
May 22, 2025
Viaarxiv icon