Picture for Nathalie Baracaldo

Nathalie Baracaldo

Reasoning Model Unlearning: Forgetting Traces, Not Just Answers, While Preserving Reasoning Skills

Add code
Jun 15, 2025
Viaarxiv icon

EPiC: Towards Lossless Speedup for Reasoning Training through Edge-Preserving CoT Condensation

Add code
Jun 04, 2025
Viaarxiv icon

MAP: Multi-Human-Value Alignment Palette

Add code
Oct 24, 2024
Figure 1 for MAP: Multi-Human-Value Alignment Palette
Figure 2 for MAP: Multi-Human-Value Alignment Palette
Figure 3 for MAP: Multi-Human-Value Alignment Palette
Figure 4 for MAP: Multi-Human-Value Alignment Palette
Viaarxiv icon

WAGLE: Strategic Weight Attribution for Effective and Modular Unlearning in Large Language Models

Add code
Oct 23, 2024
Viaarxiv icon

Mitigating Forgetting in LLM Supervised Fine-Tuning and Preference Learning

Add code
Oct 20, 2024
Figure 1 for Mitigating Forgetting in LLM Supervised Fine-Tuning and Preference Learning
Figure 2 for Mitigating Forgetting in LLM Supervised Fine-Tuning and Preference Learning
Figure 3 for Mitigating Forgetting in LLM Supervised Fine-Tuning and Preference Learning
Figure 4 for Mitigating Forgetting in LLM Supervised Fine-Tuning and Preference Learning
Viaarxiv icon

Turning Generative Models Degenerate: The Power of Data Poisoning Attacks

Add code
Jul 18, 2024
Figure 1 for Turning Generative Models Degenerate: The Power of Data Poisoning Attacks
Figure 2 for Turning Generative Models Degenerate: The Power of Data Poisoning Attacks
Figure 3 for Turning Generative Models Degenerate: The Power of Data Poisoning Attacks
Figure 4 for Turning Generative Models Degenerate: The Power of Data Poisoning Attacks
Viaarxiv icon

Split, Unlearn, Merge: Leveraging Data Attributes for More Effective Unlearning in LLMs

Add code
Jun 17, 2024
Figure 1 for Split, Unlearn, Merge: Leveraging Data Attributes for More Effective Unlearning in LLMs
Figure 2 for Split, Unlearn, Merge: Leveraging Data Attributes for More Effective Unlearning in LLMs
Figure 3 for Split, Unlearn, Merge: Leveraging Data Attributes for More Effective Unlearning in LLMs
Figure 4 for Split, Unlearn, Merge: Leveraging Data Attributes for More Effective Unlearning in LLMs
Viaarxiv icon

Rethinking Machine Unlearning for Large Language Models

Add code
Feb 15, 2024
Viaarxiv icon

Enhancing In-context Learning via Linear Probe Calibration

Add code
Jan 22, 2024
Viaarxiv icon

FairSISA: Ensemble Post-Processing to Improve Fairness of Unlearning in LLMs

Add code
Dec 12, 2023
Figure 1 for FairSISA: Ensemble Post-Processing to Improve Fairness of Unlearning in LLMs
Figure 2 for FairSISA: Ensemble Post-Processing to Improve Fairness of Unlearning in LLMs
Figure 3 for FairSISA: Ensemble Post-Processing to Improve Fairness of Unlearning in LLMs
Figure 4 for FairSISA: Ensemble Post-Processing to Improve Fairness of Unlearning in LLMs
Viaarxiv icon