Picture for Abhilasha Ravichander

Abhilasha Ravichander

Model State Arithmetic for Machine Unlearning

Add code
Jun 26, 2025
Viaarxiv icon

What Has Been Lost with Synthetic Evaluation?

Add code
May 28, 2025
Viaarxiv icon

Why and How LLMs Hallucinate: Connecting the Dots with Subsequence Associations

Add code
Apr 17, 2025
Viaarxiv icon

Information-Guided Identification of Training Data Imprint in (Proprietary) Large Language Models

Add code
Mar 15, 2025
Viaarxiv icon

HALoGEN: Fantastic LLM Hallucinations and Where to Find Them

Add code
Jan 14, 2025
Figure 1 for HALoGEN: Fantastic LLM Hallucinations and Where to Find Them
Figure 2 for HALoGEN: Fantastic LLM Hallucinations and Where to Find Them
Figure 3 for HALoGEN: Fantastic LLM Hallucinations and Where to Find Them
Figure 4 for HALoGEN: Fantastic LLM Hallucinations and Where to Find Them
Viaarxiv icon

RESTOR: Knowledge Recovery through Machine Unlearning

Add code
Oct 31, 2024
Figure 1 for RESTOR: Knowledge Recovery through Machine Unlearning
Figure 2 for RESTOR: Knowledge Recovery through Machine Unlearning
Figure 3 for RESTOR: Knowledge Recovery through Machine Unlearning
Figure 4 for RESTOR: Knowledge Recovery through Machine Unlearning
Viaarxiv icon

Reverse Question Answering: Can an LLM Write a Question so Hard (or Bad) that it Can't Answer?

Add code
Oct 20, 2024
Viaarxiv icon

WildHallucinations: Evaluating Long-form Factuality in LLMs with Real-World Entity Queries

Add code
Jul 24, 2024
Figure 1 for WildHallucinations: Evaluating Long-form Factuality in LLMs with Real-World Entity Queries
Figure 2 for WildHallucinations: Evaluating Long-form Factuality in LLMs with Real-World Entity Queries
Figure 3 for WildHallucinations: Evaluating Long-form Factuality in LLMs with Real-World Entity Queries
Figure 4 for WildHallucinations: Evaluating Long-form Factuality in LLMs with Real-World Entity Queries
Viaarxiv icon

WildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild

Add code
Jun 07, 2024
Figure 1 for WildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild
Figure 2 for WildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild
Figure 3 for WildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild
Figure 4 for WildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild
Viaarxiv icon

Artifacts or Abduction: How Do LLMs Answer Multiple-Choice Questions Without the Question?

Add code
Feb 19, 2024
Viaarxiv icon