Picture for Jiahao Zhang

Jiahao Zhang

Towards High-Order Mean Flow Generative Models: Feasibility, Expressivity, and Provably Efficient Criteria

Add code
Aug 09, 2025
Viaarxiv icon

T2VWorldBench: A Benchmark for Evaluating World Knowledge in Text-to-Video Generation

Add code
Jul 24, 2025
Viaarxiv icon

Thinking Isn't an Illusion: Overcoming the Limitations of Reasoning Models via Tool Augmentations

Add code
Jul 23, 2025
Viaarxiv icon

Causal-aware Large Language Models: Enhancing Decision-Making Through Learning, Adapting and Acting

Add code
May 30, 2025
Viaarxiv icon

T2VTextBench: A Human Evaluation Benchmark for Textual Control in Video Generation Models

Add code
May 08, 2025
Viaarxiv icon

T2VPhysBench: A First-Principles Benchmark for Physical Consistency in Text-to-Video Generation

Add code
May 01, 2025
Viaarxiv icon

E-InMeMo: Enhanced Prompting for Visual In-Context Learning

Add code
Apr 25, 2025
Viaarxiv icon

Dimension-Free Decision Calibration for Nonlinear Loss Functions

Add code
Apr 22, 2025
Viaarxiv icon

Provable Failure of Language Models in Learning Majority Boolean Logic via Gradient Descent

Add code
Apr 07, 2025
Viaarxiv icon

Can You Count to Nine? A Human Evaluation Benchmark for Counting Limits in Modern Text-to-Video Models

Add code
Apr 05, 2025
Viaarxiv icon