Picture for Daniel Khashabi

Daniel Khashabi

Shammie

Lost in the Haystack: Smaller Needles are More Difficult for LLMs to Find

Add code
May 23, 2025
Figure 1 for Lost in the Haystack: Smaller Needles are More Difficult for LLMs to Find
Figure 2 for Lost in the Haystack: Smaller Needles are More Difficult for LLMs to Find
Figure 3 for Lost in the Haystack: Smaller Needles are More Difficult for LLMs to Find
Figure 4 for Lost in the Haystack: Smaller Needles are More Difficult for LLMs to Find
Viaarxiv icon

SIMPLEMIX: Frustratingly Simple Mixing of Off- and On-policy Data in Language Model Preference Learning

Add code
May 05, 2025
Viaarxiv icon

ICL CIPHERS: Quantifying "Learning'' in In-Context Learning via Substitution Ciphers

Add code
Apr 28, 2025
Viaarxiv icon

Certified Mitigation of Worst-Case LLM Copyright Infringement

Add code
Apr 22, 2025
Figure 1 for Certified Mitigation of Worst-Case LLM Copyright Infringement
Figure 2 for Certified Mitigation of Worst-Case LLM Copyright Infringement
Figure 3 for Certified Mitigation of Worst-Case LLM Copyright Infringement
Figure 4 for Certified Mitigation of Worst-Case LLM Copyright Infringement
Viaarxiv icon

Science Hierarchography: Hierarchical Organization of Science Literature

Add code
Apr 18, 2025
Figure 1 for Science Hierarchography: Hierarchical Organization of Science Literature
Figure 2 for Science Hierarchography: Hierarchical Organization of Science Literature
Figure 3 for Science Hierarchography: Hierarchical Organization of Science Literature
Figure 4 for Science Hierarchography: Hierarchical Organization of Science Literature
Viaarxiv icon

Can LLMs Generate Tabular Summaries of Science Papers? Rethinking the Evaluation Protocol

Add code
Apr 14, 2025
Figure 1 for Can LLMs Generate Tabular Summaries of Science Papers? Rethinking the Evaluation Protocol
Figure 2 for Can LLMs Generate Tabular Summaries of Science Papers? Rethinking the Evaluation Protocol
Figure 3 for Can LLMs Generate Tabular Summaries of Science Papers? Rethinking the Evaluation Protocol
Figure 4 for Can LLMs Generate Tabular Summaries of Science Papers? Rethinking the Evaluation Protocol
Viaarxiv icon

CLAIMCHECK: How Grounded are LLM Critiques of Scientific Papers?

Add code
Mar 27, 2025
Figure 1 for CLAIMCHECK: How Grounded are LLM Critiques of Scientific Papers?
Figure 2 for CLAIMCHECK: How Grounded are LLM Critiques of Scientific Papers?
Figure 3 for CLAIMCHECK: How Grounded are LLM Critiques of Scientific Papers?
Figure 4 for CLAIMCHECK: How Grounded are LLM Critiques of Scientific Papers?
Viaarxiv icon

Can A Society of Generative Agents Simulate Human Behavior and Inform Public Health Policy? A Case Study on Vaccine Hesitancy

Add code
Mar 12, 2025
Viaarxiv icon

GenEx: Generating an Explorable World

Add code
Dec 12, 2024
Figure 1 for GenEx: Generating an Explorable World
Figure 2 for GenEx: Generating an Explorable World
Figure 3 for GenEx: Generating an Explorable World
Figure 4 for GenEx: Generating an Explorable World
Viaarxiv icon

Generative World Explorer

Add code
Nov 19, 2024
Figure 1 for Generative World Explorer
Figure 2 for Generative World Explorer
Figure 3 for Generative World Explorer
Figure 4 for Generative World Explorer
Viaarxiv icon