Picture for Zekai Huang

Zekai Huang

Your Vision-Language Model Can't Even Count to 20: Exposing the Failures of VLMs in Compositional Counting

Add code
Oct 06, 2025
Viaarxiv icon

Can You Count to Nine? A Human Evaluation Benchmark for Counting Limits in Modern Text-to-Video Models

Add code
Apr 05, 2025
Viaarxiv icon