Picture for Deep Shah

Deep Shah

Prompt-Level Distillation: A Non-Parametric Alternative to Model Fine-Tuning for Efficient Reasoning

Add code
Feb 24, 2026
Viaarxiv icon

Long-Tail Knowledge in Large Language Models: Taxonomy, Mechanisms, Interventions and Implications

Add code
Feb 18, 2026
Viaarxiv icon

Mining Generalizable Activation Functions

Add code
Feb 05, 2026
Viaarxiv icon

Taxonomy of the Retrieval System Framework: Pitfalls and Paradigms

Add code
Jan 27, 2026
Viaarxiv icon

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon