Picture for Wangbo Zhao

Wangbo Zhao

K-Forcing: Joint Next-K-Token Decoding via Push-Forward Language Modeling

Add code
Jun 09, 2026
Viaarxiv icon

WorldOlympiad: Can Your World Model Survive a Triathlon?

Add code
Jun 09, 2026
Viaarxiv icon

GroupToM-Bench: Benchmarking Group Theory of Mind and Nonlinear Social Emergence in MLLMs

Add code
Jun 02, 2026
Viaarxiv icon

SkillDAG: Self-Evolving Typed Skill Graphs for LLM Skill Selection at Scale

Add code
Jun 02, 2026
Viaarxiv icon

HumanNOVA: Photorealistic, Universal and Rapid 3D Human Avatar Modeling from a Single Image

Add code
Jun 01, 2026
Viaarxiv icon

K-Sort Eval: Efficient Preference Evaluation for Visual Generation via Corrected VLM-as-a-Judge

Add code
Feb 10, 2026
Viaarxiv icon

FARTrack: Fast Autoregressive Visual Tracking with High Performance

Add code
Feb 03, 2026
Viaarxiv icon

CaveAgent: Transforming LLMs into Stateful Runtime Operators

Add code
Jan 04, 2026
Viaarxiv icon

RAPID^3: Tri-Level Reinforced Acceleration Policies for Diffusion Transformer

Add code
Sep 26, 2025
Viaarxiv icon

MDK12-Bench: A Comprehensive Evaluation of Multimodal Large Language Models on Multidisciplinary Exams

Add code
Aug 09, 2025
Viaarxiv icon