Picture for Jiahao Zhang

Jiahao Zhang

Causal-aware Large Language Models: Enhancing Decision-Making Through Learning, Adapting and Acting

Add code
May 30, 2025
Viaarxiv icon

T2VTextBench: A Human Evaluation Benchmark for Textual Control in Video Generation Models

Add code
May 08, 2025
Viaarxiv icon

T2VPhysBench: A First-Principles Benchmark for Physical Consistency in Text-to-Video Generation

Add code
May 01, 2025
Viaarxiv icon

E-InMeMo: Enhanced Prompting for Visual In-Context Learning

Add code
Apr 25, 2025
Viaarxiv icon

Dimension-Free Decision Calibration for Nonlinear Loss Functions

Add code
Apr 22, 2025
Viaarxiv icon

Provable Failure of Language Models in Learning Majority Boolean Logic via Gradient Descent

Add code
Apr 07, 2025
Viaarxiv icon

Can You Count to Nine? A Human Evaluation Benchmark for Counting Limits in Modern Text-to-Video Models

Add code
Apr 05, 2025
Viaarxiv icon

MEET: A Million-Scale Dataset for Fine-Grained Geospatial Scene Classification with Zoom-Free Remote Sensing Imagery

Add code
Mar 14, 2025
Viaarxiv icon

Text-to-Image Diffusion Models Cannot Count, and Prompt Refinement Cannot Help

Add code
Mar 10, 2025
Viaarxiv icon

Integrated Computation and Communication with Fiber-optic Transmissions

Add code
Mar 04, 2025
Viaarxiv icon