Picture for Ashwinee Panda

Ashwinee Panda

Dense Backpropagation Improves Training for Sparse Mixture-of-Experts

Add code
Apr 18, 2025
Viaarxiv icon

Analysis of Attention in Video Diffusion Transformers

Add code
Apr 14, 2025
Viaarxiv icon

LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation

Add code
Apr 10, 2025
Viaarxiv icon

Using Attention Sinks to Identify and Evaluate Dormant Heads in Pretrained LLMs

Add code
Apr 04, 2025
Viaarxiv icon

Privacy Auditing of Large Language Models

Add code
Mar 09, 2025
Viaarxiv icon

Continual Pre-training of MoEs: How robust is your router?

Add code
Mar 06, 2025
Viaarxiv icon

Gemstones: A Model Suite for Multi-Faceted Scaling Laws

Add code
Feb 07, 2025
Figure 1 for Gemstones: A Model Suite for Multi-Faceted Scaling Laws
Figure 2 for Gemstones: A Model Suite for Multi-Faceted Scaling Laws
Figure 3 for Gemstones: A Model Suite for Multi-Faceted Scaling Laws
Figure 4 for Gemstones: A Model Suite for Multi-Faceted Scaling Laws
Viaarxiv icon

Refusal Tokens: A Simple Way to Calibrate Refusals in Large Language Models

Add code
Dec 09, 2024
Figure 1 for Refusal Tokens: A Simple Way to Calibrate Refusals in Large Language Models
Figure 2 for Refusal Tokens: A Simple Way to Calibrate Refusals in Large Language Models
Figure 3 for Refusal Tokens: A Simple Way to Calibrate Refusals in Large Language Models
Figure 4 for Refusal Tokens: A Simple Way to Calibrate Refusals in Large Language Models
Viaarxiv icon

Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMs

Add code
Jun 25, 2024
Figure 1 for Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMs
Figure 2 for Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMs
Figure 3 for Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMs
Figure 4 for Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMs
Viaarxiv icon

Safety Alignment Should Be Made More Than Just a Few Tokens Deep

Add code
Jun 10, 2024
Figure 1 for Safety Alignment Should Be Made More Than Just a Few Tokens Deep
Figure 2 for Safety Alignment Should Be Made More Than Just a Few Tokens Deep
Figure 3 for Safety Alignment Should Be Made More Than Just a Few Tokens Deep
Figure 4 for Safety Alignment Should Be Made More Than Just a Few Tokens Deep
Viaarxiv icon