Picture for Dehua Ma

Dehua Ma

Omni-RRM: Advancing Omni Reward Modeling via Automatic Rubric-Grounded Preference Synthesis

Add code
Jan 31, 2026
Viaarxiv icon

Kwai Keye-VL Technical Report

Add code
Jul 02, 2025
Viaarxiv icon

Scaling Test-time Compute for LLM Agents

Add code
Jun 15, 2025
Viaarxiv icon

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Add code
Feb 20, 2025
Viaarxiv icon