Picture for Zhi Li

Zhi Li

Agents' Last Exam

Add code
Jun 03, 2026
Viaarxiv icon

PolySpeech-100: A Large-Scale Benchmark for Speech Understanding Across 100+ Languages and Dialects

Add code
May 31, 2026
Viaarxiv icon

Fisher-Preserving Guidance: Training-Free Manifold Constraints for Safe Diffusion Control

Add code
May 28, 2026
Viaarxiv icon

MUSE: Benchmarking Manufacturable, Functional, and Assemblable Text-to-CAD Generation

Add code
May 27, 2026
Viaarxiv icon

Random Walk on Point Clouds for Feature Detection

Add code
Apr 22, 2026
Viaarxiv icon

WebWorld: A Large-Scale World Model for Web Agent Training

Add code
Feb 16, 2026
Viaarxiv icon

ERNIE 5.0 Technical Report

Add code
Feb 04, 2026
Viaarxiv icon

Human-in-the-Loop Failure Recovery with Adaptive Task Allocation

Add code
Feb 03, 2026
Viaarxiv icon

Not All Negative Samples Are Equal: LLMs Learn Better from Plausible Reasoning

Add code
Feb 03, 2026
Viaarxiv icon

A Hitchhiker's Guide to Poisson Gradient Estimation

Add code
Feb 03, 2026
Viaarxiv icon