Alert button
Picture for Souvik Kundu

Souvik Kundu

Alert button

AFLoRA: Adaptive Freezing of Low Rank Adaptation in Parameter Efficient Fine-Tuning of Large Models

Add code
Bookmark button
Alert button
Mar 20, 2024
Zeyu Liu, Souvik Kundu, Anni Li, Junrui Wan, Lianghao Jiang, Peter Anthony Beerel

Figure 1 for AFLoRA: Adaptive Freezing of Low Rank Adaptation in Parameter Efficient Fine-Tuning of Large Models
Figure 2 for AFLoRA: Adaptive Freezing of Low Rank Adaptation in Parameter Efficient Fine-Tuning of Large Models
Figure 3 for AFLoRA: Adaptive Freezing of Low Rank Adaptation in Parameter Efficient Fine-Tuning of Large Models
Figure 4 for AFLoRA: Adaptive Freezing of Low Rank Adaptation in Parameter Efficient Fine-Tuning of Large Models
Viaarxiv icon

GEAR: An Efficient KV Cache Compression Recipe for Near-Lossless Generative Inference of LLM

Add code
Bookmark button
Alert button
Mar 11, 2024
Hao Kang, Qingru Zhang, Souvik Kundu, Geonhwa Jeong, Zaoxing Liu, Tushar Krishna, Tuo Zhao

Figure 1 for GEAR: An Efficient KV Cache Compression Recipe for Near-Lossless Generative Inference of LLM
Figure 2 for GEAR: An Efficient KV Cache Compression Recipe for Near-Lossless Generative Inference of LLM
Figure 3 for GEAR: An Efficient KV Cache Compression Recipe for Near-Lossless Generative Inference of LLM
Figure 4 for GEAR: An Efficient KV Cache Compression Recipe for Near-Lossless Generative Inference of LLM
Viaarxiv icon

GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM

Add code
Bookmark button
Alert button
Mar 08, 2024
Hao Kang, Qingru Zhang, Souvik Kundu, Geonhwa Jeong, Zaoxing Liu, Tushar Krishna, Tuo Zhao

Figure 1 for GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM
Figure 2 for GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM
Figure 3 for GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM
Figure 4 for GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM
Viaarxiv icon

Towards Joint Optimization for DNN Architecture and Configuration for Compute-In-Memory Hardware

Add code
Bookmark button
Alert button
Feb 19, 2024
Souvik Kundu, Anthony Sarah, Vinay Joshi, Om J Omer, Sreenivas Subramoney

Viaarxiv icon

Linearizing Models for Efficient yet Robust Private Inference

Add code
Bookmark button
Alert button
Feb 08, 2024
Sreetama Sarkar, Souvik Kundu, Peter A. Beerel

Viaarxiv icon

Sparse but Strong: Crafting Adversarially Robust Graph Lottery Tickets

Add code
Bookmark button
Alert button
Dec 11, 2023
Subhajit Dutta Chowdhury, Zhiyu Ni, Qingyuan Peng, Souvik Kundu, Pierluigi Nuzzo

Viaarxiv icon

Recent Advances in Scalable Energy-Efficient and Trustworthy Spiking Neural networks: from Algorithms to Technology

Add code
Bookmark button
Alert button
Dec 02, 2023
Souvik Kundu, Rui-Jie Zhu, Akhilesh Jaiswal, Peter A. Beerel

Viaarxiv icon

Fusing Models with Complementary Expertise

Add code
Bookmark button
Alert button
Oct 02, 2023
Hongyi Wang, Felipe Maia Polo, Yuekai Sun, Souvik Kundu, Eric Xing, Mikhail Yurochkin

Figure 1 for Fusing Models with Complementary Expertise
Figure 2 for Fusing Models with Complementary Expertise
Figure 3 for Fusing Models with Complementary Expertise
Figure 4 for Fusing Models with Complementary Expertise
Viaarxiv icon

Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity

Add code
Bookmark button
Alert button
Sep 29, 2023
Lu Yin, Shiwei Liu, Ajay Jaiswal, Souvik Kundu, Zhangyang Wang

Viaarxiv icon

InstaTune: Instantaneous Neural Architecture Search During Fine-Tuning

Add code
Bookmark button
Alert button
Aug 29, 2023
Sharath Nittur Sridhar, Souvik Kundu, Sairam Sundaresan, Maciej Szankin, Anthony Sarah

Figure 1 for InstaTune: Instantaneous Neural Architecture Search During Fine-Tuning
Figure 2 for InstaTune: Instantaneous Neural Architecture Search During Fine-Tuning
Figure 3 for InstaTune: Instantaneous Neural Architecture Search During Fine-Tuning
Figure 4 for InstaTune: Instantaneous Neural Architecture Search During Fine-Tuning
Viaarxiv icon