Picture for Blair Yang

Blair Yang

ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement

Add code
Apr 02, 2026
Viaarxiv icon

Grounded Chess Reasoning in Language Models via Master Distillation

Add code
Mar 20, 2026
Viaarxiv icon

OasisSimp: An Open-source Asian-English Sentence Simplification Dataset

Add code
Mar 14, 2026
Viaarxiv icon

SEAM: Semantically Equivalent Across Modalities Benchmark for Vision-Language Models

Add code
Aug 25, 2025
Viaarxiv icon

Report Cards: Qualitative Evaluation of Language Models Using Natural Language Summaries

Add code
Sep 01, 2024
Figure 1 for Report Cards: Qualitative Evaluation of Language Models Using Natural Language Summaries
Figure 2 for Report Cards: Qualitative Evaluation of Language Models Using Natural Language Summaries
Figure 3 for Report Cards: Qualitative Evaluation of Language Models Using Natural Language Summaries
Figure 4 for Report Cards: Qualitative Evaluation of Language Models Using Natural Language Summaries
Viaarxiv icon