Picture for Rahul Mazumder

Rahul Mazumder

Computation of Least Trimmed Squares: A Branch-and-Bound framework with Hyperplane Arrangement Enhancements

Add code
Apr 14, 2026
Viaarxiv icon

MOONSHOT : A Framework for Multi-Objective Pruning of Vision and Large Language Models

Add code
Apr 14, 2026
Viaarxiv icon

Robust Batch-Level Query Routing for Large Language Models under Cost and Capacity Constraints

Add code
Mar 25, 2026
Viaarxiv icon

3BASiL: An Algorithmic Framework for Sparse plus Low-Rank Compression of LLMs

Add code
Mar 02, 2026
Viaarxiv icon

Reasoning Models Can be Accurately Pruned Via Chain-of-Thought Reconstruction

Add code
Sep 15, 2025
Figure 1 for Reasoning Models Can be Accurately Pruned Via Chain-of-Thought Reconstruction
Figure 2 for Reasoning Models Can be Accurately Pruned Via Chain-of-Thought Reconstruction
Figure 3 for Reasoning Models Can be Accurately Pruned Via Chain-of-Thought Reconstruction
Figure 4 for Reasoning Models Can be Accurately Pruned Via Chain-of-Thought Reconstruction
Viaarxiv icon

TSENOR: Highly-Efficient Algorithm for Finding Transposable N:M Sparse Masks

Add code
May 29, 2025
Viaarxiv icon

An Optimization Framework for Differentially Private Sparse Fine-Tuning

Add code
Mar 17, 2025
Figure 1 for An Optimization Framework for Differentially Private Sparse Fine-Tuning
Figure 2 for An Optimization Framework for Differentially Private Sparse Fine-Tuning
Figure 3 for An Optimization Framework for Differentially Private Sparse Fine-Tuning
Figure 4 for An Optimization Framework for Differentially Private Sparse Fine-Tuning
Viaarxiv icon

Efficient AI in Practice: Training and Deployment of Efficient LLMs for Industry Applications

Add code
Feb 20, 2025
Viaarxiv icon

HASSLE-free: A unified Framework for Sparse plus Low-Rank Matrix Decomposition for LLMs

Add code
Feb 02, 2025
Viaarxiv icon

Efficient user history modeling with amortized inference for deep learning recommendation models

Add code
Dec 09, 2024
Figure 1 for Efficient user history modeling with amortized inference for deep learning recommendation models
Figure 2 for Efficient user history modeling with amortized inference for deep learning recommendation models
Figure 3 for Efficient user history modeling with amortized inference for deep learning recommendation models
Figure 4 for Efficient user history modeling with amortized inference for deep learning recommendation models
Viaarxiv icon