Picture for Yi Zong

Yi Zong

Training an LLM-as-a-Judge Model: Pipeline, Insights, and Practical Lessons

Add code
Feb 05, 2025
Viaarxiv icon

GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation

Add code
Feb 24, 2024
Figure 1 for GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation
Figure 2 for GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation
Figure 3 for GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation
Figure 4 for GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation
Viaarxiv icon

Evaluating the Performance of Large Language Models on GAOKAO Benchmark

Add code
May 23, 2023
Viaarxiv icon