Picture for Yejin Choi

Yejin Choi

Broken Tokens? Your Language Model can Secretly Handle Non-Canonical Tokenizations

Add code
Jun 23, 2025
Viaarxiv icon

Verifying the Verifiers: Unveiling Pitfalls and Potentials in Fact Verifiers

Add code
Jun 16, 2025
Viaarxiv icon

Infini-gram mini: Exact n-gram Search at the Internet Scale with FM-Index

Add code
Jun 13, 2025
Viaarxiv icon

Socratic-MCTS: Test-Time Visual Reasoning by Asking the Right Questions

Add code
Jun 10, 2025
Viaarxiv icon

Chasing Moving Targets with Online Self-Play Reinforcement Learning for Safer Language Models

Add code
Jun 09, 2025
Viaarxiv icon

Synthetic Visual Genome

Add code
Jun 09, 2025
Viaarxiv icon

When to Trust Context: Self-Reflective Debates for Context Reliability

Add code
Jun 06, 2025
Viaarxiv icon

Unfolding Spatial Cognition: Evaluating Multimodal Models on Visual Simulations

Add code
Jun 05, 2025
Viaarxiv icon

OpenThoughts: Data Recipes for Reasoning Models

Add code
Jun 05, 2025
Viaarxiv icon

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Add code
May 30, 2025
Viaarxiv icon