Picture for Daniel Khashabi

Daniel Khashabi

Shammie

IA2: Alignment with ICL Activations Improves Supervised Fine-Tuning

Add code
Sep 26, 2025
Viaarxiv icon

Linguistic Nepotism: Trading-off Quality for Language Preference in Multilingual RAG

Add code
Sep 17, 2025
Viaarxiv icon

Evaluating the Evaluators: Are readability metrics good measures of readability?

Add code
Aug 26, 2025
Viaarxiv icon

Feedback Friction: LLMs Struggle to Fully Incorporate External Feedback

Add code
Jun 13, 2025
Viaarxiv icon

Jailbreak Distillation: Renewable Safety Benchmarking

Add code
May 28, 2025
Viaarxiv icon

Lost in the Haystack: Smaller Needles are More Difficult for LLMs to Find

Add code
May 23, 2025
Viaarxiv icon

SIMPLEMIX: Frustratingly Simple Mixing of Off- and On-policy Data in Language Model Preference Learning

Add code
May 05, 2025
Viaarxiv icon

ICL CIPHERS: Quantifying "Learning'' in In-Context Learning via Substitution Ciphers

Add code
Apr 28, 2025
Viaarxiv icon

Certified Mitigation of Worst-Case LLM Copyright Infringement

Add code
Apr 22, 2025
Viaarxiv icon

Science Hierarchography: Hierarchical Organization of Science Literature

Add code
Apr 18, 2025
Figure 1 for Science Hierarchography: Hierarchical Organization of Science Literature
Figure 2 for Science Hierarchography: Hierarchical Organization of Science Literature
Figure 3 for Science Hierarchography: Hierarchical Organization of Science Literature
Figure 4 for Science Hierarchography: Hierarchical Organization of Science Literature
Viaarxiv icon