Picture for Dongrui Liu

Dongrui Liu

ExGRPO: Learning to Reason from Experience

Add code
Oct 02, 2025
Figure 1 for ExGRPO: Learning to Reason from Experience
Figure 2 for ExGRPO: Learning to Reason from Experience
Figure 3 for ExGRPO: Learning to Reason from Experience
Figure 4 for ExGRPO: Learning to Reason from Experience
Viaarxiv icon

Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Delibration

Add code
Sep 18, 2025
Viaarxiv icon

The LLM Already Knows: Estimating LLM-Perceived Question Difficulty via Hidden Representations

Add code
Sep 16, 2025
Viaarxiv icon

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Add code
Jul 28, 2025
Viaarxiv icon

SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law

Add code
Jul 24, 2025
Figure 1 for SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law
Figure 2 for SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law
Figure 3 for SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law
Figure 4 for SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law
Viaarxiv icon

Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report

Add code
Jul 22, 2025
Figure 1 for Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report
Figure 2 for Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report
Figure 3 for Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report
Figure 4 for Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report
Viaarxiv icon

Accelerating Diffusion Large Language Models with SlowFast: The Three Golden Principles

Add code
Jun 12, 2025
Viaarxiv icon

Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution

Add code
May 26, 2025
Viaarxiv icon

Benchmarking Multimodal Knowledge Conflict for Large Multimodal Models

Add code
May 26, 2025
Viaarxiv icon

A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

Add code
Mar 27, 2025
Viaarxiv icon