Picture for Shashank Nag

Shashank Nag

HELIOS: Adaptive Model And Early-Exit Selection for Efficient LLM Inference Serving

Add code
Apr 14, 2025
Viaarxiv icon

Shrinking the Giant : Quasi-Weightless Transformers for Low Energy Inference

Add code
Nov 04, 2024
Viaarxiv icon

ViTA: A Vision Transformer Inference Accelerator for Edge Applications

Add code
Feb 17, 2023
Viaarxiv icon