Picture for Ruihao Gong

Ruihao Gong

Selective Focus: Investigating Semantics Sensitivity in Post-training Quantization for Lane Detection

Add code
May 10, 2024
Viaarxiv icon

Fast and Controllable Post-training Sparsity: Learning Optimal Sparsity Allocation with Global Constraint in Minutes

Add code
May 09, 2024
Viaarxiv icon

LLM-QBench: A Benchmark Towards the Best Practice for Post-training Quantization of Large Language Models

Add code
May 09, 2024
Viaarxiv icon

2023 Low-Power Computer Vision Challenge (LPCVC) Summary

Add code
Mar 11, 2024
Figure 1 for 2023 Low-Power Computer Vision Challenge (LPCVC) Summary
Figure 2 for 2023 Low-Power Computer Vision Challenge (LPCVC) Summary
Figure 3 for 2023 Low-Power Computer Vision Challenge (LPCVC) Summary
Figure 4 for 2023 Low-Power Computer Vision Challenge (LPCVC) Summary
Viaarxiv icon

ProPD: Dynamic Token Tree Pruning and Generation for LLM Parallel Decoding

Add code
Feb 21, 2024
Figure 1 for ProPD: Dynamic Token Tree Pruning and Generation for LLM Parallel Decoding
Figure 2 for ProPD: Dynamic Token Tree Pruning and Generation for LLM Parallel Decoding
Figure 3 for ProPD: Dynamic Token Tree Pruning and Generation for LLM Parallel Decoding
Figure 4 for ProPD: Dynamic Token Tree Pruning and Generation for LLM Parallel Decoding
Viaarxiv icon

TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models

Add code
Nov 27, 2023
Viaarxiv icon

QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models

Add code
Oct 12, 2023
Viaarxiv icon

Lossy and Lossless Post-training Model Size Compression

Add code
Aug 08, 2023
Viaarxiv icon

SysNoise: Exploring and Benchmarking Training-Deployment System Inconsistency

Add code
Jul 01, 2023
Viaarxiv icon

Outlier Suppression+: Accurate quantization of large language models by equivalent and optimal shifting and scaling

Add code
Apr 18, 2023
Figure 1 for Outlier Suppression+: Accurate quantization of large language models by equivalent and optimal shifting and scaling
Figure 2 for Outlier Suppression+: Accurate quantization of large language models by equivalent and optimal shifting and scaling
Figure 3 for Outlier Suppression+: Accurate quantization of large language models by equivalent and optimal shifting and scaling
Figure 4 for Outlier Suppression+: Accurate quantization of large language models by equivalent and optimal shifting and scaling
Viaarxiv icon