Picture for Rahul Mazumder

Rahul Mazumder

TSENOR: Highly-Efficient Algorithm for Finding Transposable N:M Sparse Masks

Add code
May 29, 2025
Viaarxiv icon

An Optimization Framework for Differentially Private Sparse Fine-Tuning

Add code
Mar 17, 2025
Viaarxiv icon

Efficient AI in Practice: Training and Deployment of Efficient LLMs for Industry Applications

Add code
Feb 20, 2025
Viaarxiv icon

HASSLE-free: A unified Framework for Sparse plus Low-Rank Matrix Decomposition for LLMs

Add code
Feb 02, 2025
Viaarxiv icon

Efficient user history modeling with amortized inference for deep learning recommendation models

Add code
Dec 09, 2024
Figure 1 for Efficient user history modeling with amortized inference for deep learning recommendation models
Figure 2 for Efficient user history modeling with amortized inference for deep learning recommendation models
Figure 3 for Efficient user history modeling with amortized inference for deep learning recommendation models
Figure 4 for Efficient user history modeling with amortized inference for deep learning recommendation models
Viaarxiv icon

Preserving Deep Representations In One-Shot Pruning: A Hessian-Free Second-Order Optimization Framework

Add code
Nov 27, 2024
Viaarxiv icon

ALPS: Improved Optimization for Highly Sparse One-Shot Pruning for Large Language Models

Add code
Jun 12, 2024
Figure 1 for ALPS: Improved Optimization for Highly Sparse One-Shot Pruning for Large Language Models
Figure 2 for ALPS: Improved Optimization for Highly Sparse One-Shot Pruning for Large Language Models
Figure 3 for ALPS: Improved Optimization for Highly Sparse One-Shot Pruning for Large Language Models
Figure 4 for ALPS: Improved Optimization for Highly Sparse One-Shot Pruning for Large Language Models
Viaarxiv icon

FALCON: FLOP-Aware Combinatorial Optimization for Neural Network Pruning

Add code
Mar 11, 2024
Figure 1 for FALCON: FLOP-Aware Combinatorial Optimization for Neural Network Pruning
Figure 2 for FALCON: FLOP-Aware Combinatorial Optimization for Neural Network Pruning
Figure 3 for FALCON: FLOP-Aware Combinatorial Optimization for Neural Network Pruning
Figure 4 for FALCON: FLOP-Aware Combinatorial Optimization for Neural Network Pruning
Viaarxiv icon

FAST: An Optimization Framework for Fast Additive Segmentation in Transparent ML

Add code
Feb 20, 2024
Viaarxiv icon

Randomization Can Reduce Both Bias and Variance: A Case Study in Random Forests

Add code
Feb 20, 2024
Viaarxiv icon