Picture for Soutrik Mukherjee

Soutrik Mukherjee

GPU-Accelerated Optimization of Transformer-Based Neural Networks for Real-Time Inference

Add code
Mar 30, 2026
Viaarxiv icon