Picture for Sihyeong Park

Sihyeong Park

A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and Efficiency

Add code
May 03, 2025
Viaarxiv icon

Elastic-DETR: Making Image Resolution Learnable with Content-Specific Network Prediction

Add code
Dec 09, 2024
Viaarxiv icon

A Comprehensive Evaluation of Quantized Instruction-Tuned Large Language Models: An Experimental Analysis up to 405B

Add code
Sep 17, 2024
Figure 1 for A Comprehensive Evaluation of Quantized Instruction-Tuned Large Language Models: An Experimental Analysis up to 405B
Figure 2 for A Comprehensive Evaluation of Quantized Instruction-Tuned Large Language Models: An Experimental Analysis up to 405B
Figure 3 for A Comprehensive Evaluation of Quantized Instruction-Tuned Large Language Models: An Experimental Analysis up to 405B
Figure 4 for A Comprehensive Evaluation of Quantized Instruction-Tuned Large Language Models: An Experimental Analysis up to 405B
Viaarxiv icon

Mixed Non-linear Quantization for Vision Transformers

Add code
Jul 26, 2024
Viaarxiv icon