Picture for Jiayan Huo

Jiayan Huo

T2VTextBench: A Human Evaluation Benchmark for Textual Control in Video Generation Models

Add code
May 08, 2025
Viaarxiv icon

T2VPhysBench: A First-Principles Benchmark for Physical Consistency in Text-to-Video Generation

Add code
May 01, 2025
Viaarxiv icon

Can You Count to Nine? A Human Evaluation Benchmark for Counting Limits in Modern Text-to-Video Models

Add code
Apr 05, 2025
Viaarxiv icon

Text-to-Image Diffusion Models Cannot Count, and Prompt Refinement Cannot Help

Add code
Mar 10, 2025
Figure 1 for Text-to-Image Diffusion Models Cannot Count, and Prompt Refinement Cannot Help
Figure 2 for Text-to-Image Diffusion Models Cannot Count, and Prompt Refinement Cannot Help
Figure 3 for Text-to-Image Diffusion Models Cannot Count, and Prompt Refinement Cannot Help
Figure 4 for Text-to-Image Diffusion Models Cannot Count, and Prompt Refinement Cannot Help
Viaarxiv icon

Fast Gradient Computation for RoPE Attention in Almost Linear Time

Add code
Dec 23, 2024
Viaarxiv icon