Picture for Wanli Ouyang

Wanli Ouyang

School of Electrical and Information Engineering, The University of Sydney, Australia

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Add code
May 28, 2025
Viaarxiv icon

LabUtopia: High-Fidelity Simulation and Hierarchical Benchmark for Scientific Embodied Agents

Add code
May 28, 2025
Viaarxiv icon

The Avengers: A Simple Recipe for Uniting Smaller Language Models to Challenge Proprietary Giants

Add code
May 26, 2025
Viaarxiv icon

GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation

Add code
May 26, 2025
Viaarxiv icon

MOOSE-Chem2: Exploring LLM Limits in Fine-Grained Scientific Hypothesis Discovery via Hierarchical Search

Add code
May 25, 2025
Viaarxiv icon

MOOSE-Chem3: Toward Experiment-Guided Hypothesis Ranking via Simulated Experimental Feedback

Add code
May 23, 2025
Viaarxiv icon

ChemMLLM: Chemical Multimodal Large Language Model

Add code
May 22, 2025
Viaarxiv icon

CPRet: A Dataset, Benchmark, and Model for Retrieval in Competitive Programming

Add code
May 19, 2025
Viaarxiv icon

AutoMat: Enabling Automated Crystal Structure Reconstruction from Microscopy via Agentic Tool Use

Add code
May 19, 2025
Viaarxiv icon

CompBench: Benchmarking Complex Instruction-guided Image Editing

Add code
May 18, 2025
Viaarxiv icon