Picture for Guohao Dai

Guohao Dai

VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate

Add code
Apr 16, 2025
Viaarxiv icon

SpecEE: Accelerating Large Language Model Inference with Speculative Early Exiting

Add code
Apr 11, 2025
Viaarxiv icon

DiTFastAttnV2: Head-wise Attention Compression for Multi-Modality Diffusion Transformers

Add code
Mar 28, 2025
Viaarxiv icon

Megrez-Omni Technical Report

Add code
Feb 19, 2025
Viaarxiv icon

DLFR-VAE: Dynamic Latent Frame Rate VAE for Video Generation

Add code
Feb 17, 2025
Viaarxiv icon

FrameFusion: Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models

Add code
Dec 30, 2024
Figure 1 for FrameFusion: Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models
Viaarxiv icon

MBQ: Modality-Balanced Quantization for Large Vision-Language Models

Add code
Dec 27, 2024
Viaarxiv icon

E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling

Add code
Dec 19, 2024
Figure 1 for E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling
Figure 2 for E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling
Figure 3 for E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling
Figure 4 for E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling
Viaarxiv icon

Automating Energy-Efficient GPU Kernel Generation: A Fast Search-Based Compilation Approach

Add code
Nov 28, 2024
Figure 1 for Automating Energy-Efficient GPU Kernel Generation: A Fast Search-Based Compilation Approach
Figure 2 for Automating Energy-Efficient GPU Kernel Generation: A Fast Search-Based Compilation Approach
Figure 3 for Automating Energy-Efficient GPU Kernel Generation: A Fast Search-Based Compilation Approach
Figure 4 for Automating Energy-Efficient GPU Kernel Generation: A Fast Search-Based Compilation Approach
Viaarxiv icon

SoftmAP: Software-Hardware Co-design for Integer-Only Softmax on Associative Processors

Add code
Nov 26, 2024
Viaarxiv icon