Picture for Hongli Zhou

Hongli Zhou

RM-Distiller: Exploiting Generative LLM for Reward Model Distillation

Add code
Jan 20, 2026
Viaarxiv icon

Lost in Benchmarks? Rethinking Large Language Model Benchmarking with Item Response Theory

Add code
May 21, 2025
Figure 1 for Lost in Benchmarks? Rethinking Large Language Model Benchmarking with Item Response Theory
Figure 2 for Lost in Benchmarks? Rethinking Large Language Model Benchmarking with Item Response Theory
Figure 3 for Lost in Benchmarks? Rethinking Large Language Model Benchmarking with Item Response Theory
Figure 4 for Lost in Benchmarks? Rethinking Large Language Model Benchmarking with Item Response Theory
Viaarxiv icon

Think-J: Learning to Think for Generative LLM-as-a-Judge

Add code
May 20, 2025
Viaarxiv icon

Mitigating the Bias of Large Language Model Evaluation

Add code
Sep 25, 2024
Figure 1 for Mitigating the Bias of Large Language Model Evaluation
Figure 2 for Mitigating the Bias of Large Language Model Evaluation
Figure 3 for Mitigating the Bias of Large Language Model Evaluation
Figure 4 for Mitigating the Bias of Large Language Model Evaluation
Viaarxiv icon