Picture for Daniel Khashabi

Daniel Khashabi

Shammie

Jailbreak Distillation: Renewable Safety Benchmarking

Add code
May 28, 2025
Viaarxiv icon

Lost in the Haystack: Smaller Needles are More Difficult for LLMs to Find

Add code
May 23, 2025
Viaarxiv icon

SIMPLEMIX: Frustratingly Simple Mixing of Off- and On-policy Data in Language Model Preference Learning

Add code
May 05, 2025
Viaarxiv icon

ICL CIPHERS: Quantifying "Learning'' in In-Context Learning via Substitution Ciphers

Add code
Apr 28, 2025
Viaarxiv icon

Certified Mitigation of Worst-Case LLM Copyright Infringement

Add code
Apr 22, 2025
Viaarxiv icon

Science Hierarchography: Hierarchical Organization of Science Literature

Add code
Apr 18, 2025
Viaarxiv icon

Can LLMs Generate Tabular Summaries of Science Papers? Rethinking the Evaluation Protocol

Add code
Apr 14, 2025
Viaarxiv icon

CLAIMCHECK: How Grounded are LLM Critiques of Scientific Papers?

Add code
Mar 27, 2025
Viaarxiv icon

Can A Society of Generative Agents Simulate Human Behavior and Inform Public Health Policy? A Case Study on Vaccine Hesitancy

Add code
Mar 12, 2025
Viaarxiv icon

GenEx: Generating an Explorable World

Add code
Dec 12, 2024
Viaarxiv icon