Picture for Ruihao Gong

Ruihao Gong

MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping

Add code
Nov 19, 2025
Figure 1 for MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping
Figure 2 for MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping
Figure 3 for MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping
Figure 4 for MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping
Viaarxiv icon

Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals

Add code
Oct 31, 2025
Figure 1 for Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals
Figure 2 for Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals
Figure 3 for Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals
Figure 4 for Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals
Viaarxiv icon

LLMC+: Benchmarking Vision-Language Model Compression with a Plug-and-play Toolkit

Add code
Aug 13, 2025
Viaarxiv icon

Post-Training Quantization for Video Matting

Add code
Jun 12, 2025
Viaarxiv icon

Pre$^3$: Enabling Deterministic Pushdown Automata for Faster Structured LLM Generation

Add code
Jun 04, 2025
Viaarxiv icon

QVGen: Pushing the Limit of Quantized Video Generative Models

Add code
May 16, 2025
Viaarxiv icon

Hierarchical Balance Packing: Towards Efficient Supervised Fine-tuning for Long-Context LLM

Add code
Mar 10, 2025
Viaarxiv icon

PTSBench: A Comprehensive Post-Training Sparsity Benchmark Towards Algorithms and Models

Add code
Dec 10, 2024
Viaarxiv icon

HarmoniCa: Harmonizing Training and Inference for Better Feature Cache in Diffusion Transformer Acceleration

Add code
Oct 02, 2024
Viaarxiv icon

A Survey of Low-bit Large Language Models: Basics, Systems, and Algorithms

Add code
Sep 25, 2024
Figure 1 for A Survey of Low-bit Large Language Models: Basics, Systems, and Algorithms
Figure 2 for A Survey of Low-bit Large Language Models: Basics, Systems, and Algorithms
Figure 3 for A Survey of Low-bit Large Language Models: Basics, Systems, and Algorithms
Figure 4 for A Survey of Low-bit Large Language Models: Basics, Systems, and Algorithms
Viaarxiv icon