Picture for Yejin Choi

Yejin Choi

Synthetic Mixed Training: Scaling Parametric Knowledge Acquisition Beyond RAG

Add code
Mar 24, 2026
Viaarxiv icon

FDARxBench: Benchmarking Regulatory and Clinical Reasoning on FDA Generic Drug Assessment

Add code
Mar 20, 2026
Viaarxiv icon

Data-efficient pre-training by scaling synthetic megadocs

Add code
Mar 19, 2026
Viaarxiv icon

Learning from Trials and Errors: Reflective Test-Time Planning for Embodied LLMs

Add code
Feb 24, 2026
Viaarxiv icon

ODESteer: A Unified ODE-Based Steering Framework for LLM Alignment

Add code
Feb 19, 2026
Viaarxiv icon

MemoryArena: Benchmarking Agent Memory in Interdependent Multi-Session Agentic Tasks

Add code
Feb 18, 2026
Viaarxiv icon

iGRPO: Self-Feedback-Driven LLM Reasoning

Add code
Feb 09, 2026
Viaarxiv icon

Theory of Space: Can Foundation Models Construct Spatial Beliefs through Active Exploration?

Add code
Feb 04, 2026
Viaarxiv icon

Privasis: Synthesizing the Largest "Public" Private Dataset from Scratch

Add code
Feb 03, 2026
Viaarxiv icon

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Add code
Jan 30, 2026
Viaarxiv icon