Picture for Jemin Lee

Jemin Lee

A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and Efficiency

Add code
May 03, 2025
Figure 1 for A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and Efficiency
Figure 2 for A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and Efficiency
Figure 3 for A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and Efficiency
Figure 4 for A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and Efficiency
Viaarxiv icon

LLM-Guided Open RAN: Empowering Hierarchical RAN Intelligent Control

Add code
Apr 25, 2025
Figure 1 for LLM-Guided Open RAN: Empowering Hierarchical RAN Intelligent Control
Figure 2 for LLM-Guided Open RAN: Empowering Hierarchical RAN Intelligent Control
Figure 3 for LLM-Guided Open RAN: Empowering Hierarchical RAN Intelligent Control
Figure 4 for LLM-Guided Open RAN: Empowering Hierarchical RAN Intelligent Control
Viaarxiv icon

QuantuneV2: Compiler-Based Local Metric-Driven Mixed Precision Quantization for Practical Embedded AI Applications

Add code
Jan 13, 2025
Figure 1 for QuantuneV2: Compiler-Based Local Metric-Driven Mixed Precision Quantization for Practical Embedded AI Applications
Figure 2 for QuantuneV2: Compiler-Based Local Metric-Driven Mixed Precision Quantization for Practical Embedded AI Applications
Figure 3 for QuantuneV2: Compiler-Based Local Metric-Driven Mixed Precision Quantization for Practical Embedded AI Applications
Figure 4 for QuantuneV2: Compiler-Based Local Metric-Driven Mixed Precision Quantization for Practical Embedded AI Applications
Viaarxiv icon

ML$^2$Tuner: Efficient Code Tuning via Multi-Level Machine Learning Models

Add code
Nov 16, 2024
Viaarxiv icon

A Comprehensive Evaluation of Quantized Instruction-Tuned Large Language Models: An Experimental Analysis up to 405B

Add code
Sep 17, 2024
Figure 1 for A Comprehensive Evaluation of Quantized Instruction-Tuned Large Language Models: An Experimental Analysis up to 405B
Figure 2 for A Comprehensive Evaluation of Quantized Instruction-Tuned Large Language Models: An Experimental Analysis up to 405B
Figure 3 for A Comprehensive Evaluation of Quantized Instruction-Tuned Large Language Models: An Experimental Analysis up to 405B
Figure 4 for A Comprehensive Evaluation of Quantized Instruction-Tuned Large Language Models: An Experimental Analysis up to 405B
Viaarxiv icon

Mixed Non-linear Quantization for Vision Transformers

Add code
Jul 26, 2024
Figure 1 for Mixed Non-linear Quantization for Vision Transformers
Figure 2 for Mixed Non-linear Quantization for Vision Transformers
Figure 3 for Mixed Non-linear Quantization for Vision Transformers
Figure 4 for Mixed Non-linear Quantization for Vision Transformers
Viaarxiv icon

Visual Preference Inference: An Image Sequence-Based Preference Reasoning in Tabletop Object Manipulation

Add code
Mar 18, 2024
Figure 1 for Visual Preference Inference: An Image Sequence-Based Preference Reasoning in Tabletop Object Manipulation
Figure 2 for Visual Preference Inference: An Image Sequence-Based Preference Reasoning in Tabletop Object Manipulation
Figure 3 for Visual Preference Inference: An Image Sequence-Based Preference Reasoning in Tabletop Object Manipulation
Figure 4 for Visual Preference Inference: An Image Sequence-Based Preference Reasoning in Tabletop Object Manipulation
Viaarxiv icon

Positioning Using Wireless Networks: Applications, Recent Progress and Future Challenges

Add code
Mar 18, 2024
Figure 1 for Positioning Using Wireless Networks: Applications, Recent Progress and Future Challenges
Figure 2 for Positioning Using Wireless Networks: Applications, Recent Progress and Future Challenges
Figure 3 for Positioning Using Wireless Networks: Applications, Recent Progress and Future Challenges
Figure 4 for Positioning Using Wireless Networks: Applications, Recent Progress and Future Challenges
Viaarxiv icon

Q-HyViT: Post-Training Quantization for Hybrid Vision Transformer with Bridge Block Reconstruction

Add code
Mar 22, 2023
Figure 1 for Q-HyViT: Post-Training Quantization for Hybrid Vision Transformer with Bridge Block Reconstruction
Figure 2 for Q-HyViT: Post-Training Quantization for Hybrid Vision Transformer with Bridge Block Reconstruction
Figure 3 for Q-HyViT: Post-Training Quantization for Hybrid Vision Transformer with Bridge Block Reconstruction
Figure 4 for Q-HyViT: Post-Training Quantization for Hybrid Vision Transformer with Bridge Block Reconstruction
Viaarxiv icon

Average Age of Information Penalty of Short-Packet Communications with Packet Management

Add code
Oct 26, 2022
Figure 1 for Average Age of Information Penalty of Short-Packet Communications with Packet Management
Figure 2 for Average Age of Information Penalty of Short-Packet Communications with Packet Management
Figure 3 for Average Age of Information Penalty of Short-Packet Communications with Packet Management
Figure 4 for Average Age of Information Penalty of Short-Packet Communications with Packet Management
Viaarxiv icon