Picture for Blair Yang

Blair Yang

SEAM: Semantically Equivalent Across Modalities Benchmark for Vision-Language Models

Add code
Aug 25, 2025
Viaarxiv icon

Report Cards: Qualitative Evaluation of Language Models Using Natural Language Summaries

Add code
Sep 01, 2024
Figure 1 for Report Cards: Qualitative Evaluation of Language Models Using Natural Language Summaries
Figure 2 for Report Cards: Qualitative Evaluation of Language Models Using Natural Language Summaries
Figure 3 for Report Cards: Qualitative Evaluation of Language Models Using Natural Language Summaries
Figure 4 for Report Cards: Qualitative Evaluation of Language Models Using Natural Language Summaries
Viaarxiv icon