Picture for Yuzhang Shang

Yuzhang Shang

ButterflyQuant: Ultra-low-bit LLM Quantization through Learnable Orthogonal Butterfly Transforms

Add code
Sep 11, 2025
Viaarxiv icon

When Tokens Talk Too Much: A Survey of Multimodal Long-Context Token Compression across Images, Videos, and Audios

Add code
Jul 27, 2025
Figure 1 for When Tokens Talk Too Much: A Survey of Multimodal Long-Context Token Compression across Images, Videos, and Audios
Figure 2 for When Tokens Talk Too Much: A Survey of Multimodal Long-Context Token Compression across Images, Videos, and Audios
Figure 3 for When Tokens Talk Too Much: A Survey of Multimodal Long-Context Token Compression across Images, Videos, and Audios
Figure 4 for When Tokens Talk Too Much: A Survey of Multimodal Long-Context Token Compression across Images, Videos, and Audios
Viaarxiv icon

DD-Ranking: Rethinking the Evaluation of Dataset Distillation

Add code
May 19, 2025
Figure 1 for DD-Ranking: Rethinking the Evaluation of Dataset Distillation
Figure 2 for DD-Ranking: Rethinking the Evaluation of Dataset Distillation
Figure 3 for DD-Ranking: Rethinking the Evaluation of Dataset Distillation
Figure 4 for DD-Ranking: Rethinking the Evaluation of Dataset Distillation
Viaarxiv icon

VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate

Add code
Apr 16, 2025
Figure 1 for VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate
Figure 2 for VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate
Figure 3 for VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate
Figure 4 for VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate
Viaarxiv icon

GSQ-Tuning: Group-Shared Exponents Integer in Fully Quantized Training for LLMs On-Device Fine-tuning

Add code
Feb 18, 2025
Figure 1 for GSQ-Tuning: Group-Shared Exponents Integer in Fully Quantized Training for LLMs On-Device Fine-tuning
Figure 2 for GSQ-Tuning: Group-Shared Exponents Integer in Fully Quantized Training for LLMs On-Device Fine-tuning
Figure 3 for GSQ-Tuning: Group-Shared Exponents Integer in Fully Quantized Training for LLMs On-Device Fine-tuning
Figure 4 for GSQ-Tuning: Group-Shared Exponents Integer in Fully Quantized Training for LLMs On-Device Fine-tuning
Viaarxiv icon

Benchmarking Post-Training Quantization in LLMs: Comprehensive Taxonomy, Unified Evaluation, and Comparative Analysis

Add code
Feb 18, 2025
Figure 1 for Benchmarking Post-Training Quantization in LLMs: Comprehensive Taxonomy, Unified Evaluation, and Comparative Analysis
Figure 2 for Benchmarking Post-Training Quantization in LLMs: Comprehensive Taxonomy, Unified Evaluation, and Comparative Analysis
Figure 3 for Benchmarking Post-Training Quantization in LLMs: Comprehensive Taxonomy, Unified Evaluation, and Comparative Analysis
Figure 4 for Benchmarking Post-Training Quantization in LLMs: Comprehensive Taxonomy, Unified Evaluation, and Comparative Analysis
Viaarxiv icon

PTQ1.61: Push the Real Limit of Extremely Low-Bit Post-Training Quantization Methods for Large Language Models

Add code
Feb 18, 2025
Figure 1 for PTQ1.61: Push the Real Limit of Extremely Low-Bit Post-Training Quantization Methods for Large Language Models
Figure 2 for PTQ1.61: Push the Real Limit of Extremely Low-Bit Post-Training Quantization Methods for Large Language Models
Figure 3 for PTQ1.61: Push the Real Limit of Extremely Low-Bit Post-Training Quantization Methods for Large Language Models
Figure 4 for PTQ1.61: Push the Real Limit of Extremely Low-Bit Post-Training Quantization Methods for Large Language Models
Viaarxiv icon

DLFR-VAE: Dynamic Latent Frame Rate VAE for Video Generation

Add code
Feb 17, 2025
Figure 1 for DLFR-VAE: Dynamic Latent Frame Rate VAE for Video Generation
Figure 2 for DLFR-VAE: Dynamic Latent Frame Rate VAE for Video Generation
Figure 3 for DLFR-VAE: Dynamic Latent Frame Rate VAE for Video Generation
Figure 4 for DLFR-VAE: Dynamic Latent Frame Rate VAE for Video Generation
Viaarxiv icon

E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling

Add code
Dec 19, 2024
Figure 1 for E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling
Figure 2 for E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling
Figure 3 for E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling
Figure 4 for E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling
Viaarxiv icon

freePruner: A Training-free Approach for Large Multimodal Model Acceleration

Add code
Nov 23, 2024
Figure 1 for freePruner: A Training-free Approach for Large Multimodal Model Acceleration
Figure 2 for freePruner: A Training-free Approach for Large Multimodal Model Acceleration
Figure 3 for freePruner: A Training-free Approach for Large Multimodal Model Acceleration
Figure 4 for freePruner: A Training-free Approach for Large Multimodal Model Acceleration
Viaarxiv icon