Picture for Ayoung Lee

Ayoung Lee

LiveOIBench: Can Large Language Models Outperform Human Contestants in Informatics Olympiads?

Add code
Oct 10, 2025
Figure 1 for LiveOIBench: Can Large Language Models Outperform Human Contestants in Informatics Olympiads?
Figure 2 for LiveOIBench: Can Large Language Models Outperform Human Contestants in Informatics Olympiads?
Figure 3 for LiveOIBench: Can Large Language Models Outperform Human Contestants in Informatics Olympiads?
Figure 4 for LiveOIBench: Can Large Language Models Outperform Human Contestants in Informatics Olympiads?
Viaarxiv icon

CLASH: Evaluating Language Models on Judging High-Stakes Dilemmas from Multiple Perspectives

Add code
Apr 15, 2025
Figure 1 for CLASH: Evaluating Language Models on Judging High-Stakes Dilemmas from Multiple Perspectives
Figure 2 for CLASH: Evaluating Language Models on Judging High-Stakes Dilemmas from Multiple Perspectives
Figure 3 for CLASH: Evaluating Language Models on Judging High-Stakes Dilemmas from Multiple Perspectives
Figure 4 for CLASH: Evaluating Language Models on Judging High-Stakes Dilemmas from Multiple Perspectives
Viaarxiv icon