Picture for Jingming Zhuo

Jingming Zhuo

OpenCompass: A Universal Evaluation Platform for Large Language Models

Add code
May 19, 2026
Viaarxiv icon

ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs

Add code
Oct 16, 2024
Figure 1 for ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs
Figure 2 for ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs
Figure 3 for ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs
Figure 4 for ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs
Viaarxiv icon

InternLM2 Technical Report

Add code
Mar 26, 2024
Figure 1 for InternLM2 Technical Report
Figure 2 for InternLM2 Technical Report
Figure 3 for InternLM2 Technical Report
Figure 4 for InternLM2 Technical Report
Viaarxiv icon

T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step

Add code
Jan 15, 2024
Figure 1 for T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step
Figure 2 for T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step
Figure 3 for T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step
Figure 4 for T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step
Viaarxiv icon