Picture for Nathalie Baracaldo

Nathalie Baracaldo

Reasoning Model Unlearning: Forgetting Traces, Not Just Answers, While Preserving Reasoning Skills

Add code
Jun 15, 2025
Viaarxiv icon

EPiC: Towards Lossless Speedup for Reasoning Training through Edge-Preserving CoT Condensation

Add code
Jun 04, 2025
Viaarxiv icon

MAP: Multi-Human-Value Alignment Palette

Add code
Oct 24, 2024
Figure 1 for MAP: Multi-Human-Value Alignment Palette
Figure 2 for MAP: Multi-Human-Value Alignment Palette
Figure 3 for MAP: Multi-Human-Value Alignment Palette
Figure 4 for MAP: Multi-Human-Value Alignment Palette
Viaarxiv icon

WAGLE: Strategic Weight Attribution for Effective and Modular Unlearning in Large Language Models

Add code
Oct 23, 2024
Viaarxiv icon

Mitigating Forgetting in LLM Supervised Fine-Tuning and Preference Learning

Add code
Oct 20, 2024
Figure 1 for Mitigating Forgetting in LLM Supervised Fine-Tuning and Preference Learning
Figure 2 for Mitigating Forgetting in LLM Supervised Fine-Tuning and Preference Learning
Figure 3 for Mitigating Forgetting in LLM Supervised Fine-Tuning and Preference Learning
Figure 4 for Mitigating Forgetting in LLM Supervised Fine-Tuning and Preference Learning
Viaarxiv icon

Turning Generative Models Degenerate: The Power of Data Poisoning Attacks

Add code
Jul 18, 2024
Viaarxiv icon

Split, Unlearn, Merge: Leveraging Data Attributes for More Effective Unlearning in LLMs

Add code
Jun 17, 2024
Viaarxiv icon

Rethinking Machine Unlearning for Large Language Models

Add code
Feb 15, 2024
Viaarxiv icon

Enhancing In-context Learning via Linear Probe Calibration

Add code
Jan 22, 2024
Viaarxiv icon

FairSISA: Ensemble Post-Processing to Improve Fairness of Unlearning in LLMs

Add code
Dec 12, 2023
Figure 1 for FairSISA: Ensemble Post-Processing to Improve Fairness of Unlearning in LLMs
Figure 2 for FairSISA: Ensemble Post-Processing to Improve Fairness of Unlearning in LLMs
Figure 3 for FairSISA: Ensemble Post-Processing to Improve Fairness of Unlearning in LLMs
Figure 4 for FairSISA: Ensemble Post-Processing to Improve Fairness of Unlearning in LLMs
Viaarxiv icon