Picture for Tushar Krishna

Tushar Krishna

Slm-mux: Orchestrating small language models for reasoning

Add code
Oct 06, 2025
Viaarxiv icon

ThinKV: Thought-Adaptive KV Cache Compression for Efficient Reasoning Models

Add code
Oct 01, 2025
Viaarxiv icon

Win Fast or Lose Slow: Balancing Speed and Accuracy in Latency-Sensitive Decisions of LLMs

Add code
May 26, 2025
Viaarxiv icon

Towards Easy and Realistic Network Infrastructure Testing for Large-scale Machine Learning

Add code
Apr 29, 2025
Figure 1 for Towards Easy and Realistic Network Infrastructure Testing for Large-scale Machine Learning
Figure 2 for Towards Easy and Realistic Network Infrastructure Testing for Large-scale Machine Learning
Figure 3 for Towards Easy and Realistic Network Infrastructure Testing for Large-scale Machine Learning
Viaarxiv icon

NSFlow: An End-to-End FPGA Framework with Scalable Dataflow Architecture for Neuro-Symbolic AI

Add code
Apr 29, 2025
Viaarxiv icon

Generative AI in Embodied Systems: System-Level Analysis of Performance, Efficiency and Scalability

Add code
Apr 26, 2025
Viaarxiv icon

Accelerating LLM Inference with Flexible N:M Sparsity via A Fully Digital Compute-in-Memory Accelerator

Add code
Apr 19, 2025
Viaarxiv icon

Understanding and Optimizing Multi-Stage AI Inference Pipelines

Add code
Apr 16, 2025
Viaarxiv icon

OuroMamba: A Data-Free Quantization Framework for Vision Mamba Models

Add code
Mar 13, 2025
Viaarxiv icon

AIRCHITECT v2: Learning the Hardware Accelerator Design Space through Unified Representations

Add code
Jan 17, 2025
Viaarxiv icon