Picture for Shitanshu Bhushan

Shitanshu Bhushan

LiveOIBench: Can Large Language Models Outperform Human Contestants in Informatics Olympiads?

Add code
Oct 10, 2025
Figure 1 for LiveOIBench: Can Large Language Models Outperform Human Contestants in Informatics Olympiads?
Figure 2 for LiveOIBench: Can Large Language Models Outperform Human Contestants in Informatics Olympiads?
Figure 3 for LiveOIBench: Can Large Language Models Outperform Human Contestants in Informatics Olympiads?
Figure 4 for LiveOIBench: Can Large Language Models Outperform Human Contestants in Informatics Olympiads?
Viaarxiv icon

MLRC-Bench: Can Language Agents Solve Machine Learning Research Challenges?

Add code
Apr 13, 2025
Figure 1 for MLRC-Bench: Can Language Agents Solve Machine Learning Research Challenges?
Figure 2 for MLRC-Bench: Can Language Agents Solve Machine Learning Research Challenges?
Figure 3 for MLRC-Bench: Can Language Agents Solve Machine Learning Research Challenges?
Figure 4 for MLRC-Bench: Can Language Agents Solve Machine Learning Research Challenges?
Viaarxiv icon