Picture for Kangrui Mao

Kangrui Mao

ORBIT -- Open Recommendation Benchmark for Reproducible Research with Hidden Tests

Add code
Oct 30, 2025
Viaarxiv icon

DeepResearchGym: A Free, Transparent, and Reproducible Evaluation Sandbox for Deep Research

Add code
May 25, 2025
Figure 1 for DeepResearchGym: A Free, Transparent, and Reproducible Evaluation Sandbox for Deep Research
Figure 2 for DeepResearchGym: A Free, Transparent, and Reproducible Evaluation Sandbox for Deep Research
Figure 3 for DeepResearchGym: A Free, Transparent, and Reproducible Evaluation Sandbox for Deep Research
Figure 4 for DeepResearchGym: A Free, Transparent, and Reproducible Evaluation Sandbox for Deep Research
Viaarxiv icon

MMBench-Video: A Long-Form Multi-Shot Benchmark for Holistic Video Understanding

Add code
Jun 20, 2024
Viaarxiv icon

OAKINK2: A Dataset of Bimanual Hands-Object Manipulation in Complex Task Completion

Add code
Mar 28, 2024
Figure 1 for OAKINK2: A Dataset of Bimanual Hands-Object Manipulation in Complex Task Completion
Figure 2 for OAKINK2: A Dataset of Bimanual Hands-Object Manipulation in Complex Task Completion
Figure 3 for OAKINK2: A Dataset of Bimanual Hands-Object Manipulation in Complex Task Completion
Figure 4 for OAKINK2: A Dataset of Bimanual Hands-Object Manipulation in Complex Task Completion
Viaarxiv icon