Picture for Zhao Song

Zhao Song

Only Large Weights (And Not Skip Connections) Can Prevent the Perils of Rank Collapse

Add code
May 22, 2025
Viaarxiv icon

Fast RoPE Attention: Combining the Polynomial Method and Fast Fourier Transform

Add code
May 17, 2025
Viaarxiv icon

T2VTextBench: A Human Evaluation Benchmark for Textual Control in Video Generation Models

Add code
May 08, 2025
Viaarxiv icon

T2VPhysBench: A First-Principles Benchmark for Physical Consistency in Text-to-Video Generation

Add code
May 01, 2025
Viaarxiv icon

Attention Mechanism, Max-Affine Partition, and Universal Approximation

Add code
Apr 28, 2025
Viaarxiv icon

Discriminator-Free Direct Preference Optimization for Video Diffusion

Add code
Apr 11, 2025
Viaarxiv icon

Do LLMs trust AI regulation? Emerging behaviour of game-theoretic LLM agents

Add code
Apr 11, 2025
Figure 1 for Do LLMs trust AI regulation? Emerging behaviour of game-theoretic LLM agents
Figure 2 for Do LLMs trust AI regulation? Emerging behaviour of game-theoretic LLM agents
Figure 3 for Do LLMs trust AI regulation? Emerging behaviour of game-theoretic LLM agents
Figure 4 for Do LLMs trust AI regulation? Emerging behaviour of game-theoretic LLM agents
Viaarxiv icon

Provable Failure of Language Models in Learning Majority Boolean Logic via Gradient Descent

Add code
Apr 07, 2025
Viaarxiv icon

Can You Count to Nine? A Human Evaluation Benchmark for Counting Limits in Modern Text-to-Video Models

Add code
Apr 05, 2025
Viaarxiv icon

Exploring the Limits of KV Cache Compression in Visual Autoregressive Transformers

Add code
Mar 19, 2025
Figure 1 for Exploring the Limits of KV Cache Compression in Visual Autoregressive Transformers
Figure 2 for Exploring the Limits of KV Cache Compression in Visual Autoregressive Transformers
Viaarxiv icon