Picture for Zhao Song

Zhao Song

RichSpace: Enriching Text-to-Video Prompt Space via Text Embedding Interpolation

Add code
Jan 17, 2025
Viaarxiv icon

On the Computational Capability of Graph Neural Networks: A Circuit Complexity Bound Perspective

Add code
Jan 11, 2025
Viaarxiv icon

Text-to-Edit: Controllable End-to-End Video Ad Creation via Multimodal LLMs

Add code
Jan 10, 2025
Viaarxiv icon

Circuit Complexity Bounds for Visual Autoregressive Model

Add code
Jan 08, 2025
Viaarxiv icon

On Computational Limits and Provably Efficient Criteria of Visual Autoregressive Models: A Fine-Grained Complexity Analysis

Add code
Jan 08, 2025
Figure 1 for On Computational Limits and Provably Efficient Criteria of Visual Autoregressive Models: A Fine-Grained Complexity Analysis
Viaarxiv icon

Fast Gradient Computation for RoPE Attention in Almost Linear Time

Add code
Dec 23, 2024
Viaarxiv icon

Theoretical Constraints on the Expressive Power of $\mathsf{RoPE}$-based Tensor Attention Transformers

Add code
Dec 23, 2024
Viaarxiv icon

Grams: Gradient Descent with Adaptive Momentum Scaling

Add code
Dec 22, 2024
Figure 1 for Grams: Gradient Descent with Adaptive Momentum Scaling
Figure 2 for Grams: Gradient Descent with Adaptive Momentum Scaling
Figure 3 for Grams: Gradient Descent with Adaptive Momentum Scaling
Figure 4 for Grams: Gradient Descent with Adaptive Momentum Scaling
Viaarxiv icon

Numerical Pruning for Efficient Autoregressive Models

Add code
Dec 17, 2024
Figure 1 for Numerical Pruning for Efficient Autoregressive Models
Figure 2 for Numerical Pruning for Efficient Autoregressive Models
Figure 3 for Numerical Pruning for Efficient Autoregressive Models
Figure 4 for Numerical Pruning for Efficient Autoregressive Models
Viaarxiv icon

LazyDiT: Lazy Learning for the Acceleration of Diffusion Transformers

Add code
Dec 17, 2024
Viaarxiv icon