Picture for Conghui Zhu

Conghui Zhu

Long-form RewardBench: Evaluating Reward Models for Long-form Generation

Add code
Mar 13, 2026
Viaarxiv icon

Toward Robust LLM-Based Judges: Taxonomic Bias Evaluation and Debiasing Optimization

Add code
Mar 09, 2026
Viaarxiv icon

From Perception to Reasoning: Deep Thinking Empowers Multimodal Large Language Models

Add code
Nov 18, 2025
Viaarxiv icon

Speculative Decoding Meets Quantization: Compatibility Evaluation and Hierarchical Framework Design

Add code
May 29, 2025
Viaarxiv icon

Lost in Benchmarks? Rethinking Large Language Model Benchmarking with Item Response Theory

Add code
May 21, 2025
Figure 1 for Lost in Benchmarks? Rethinking Large Language Model Benchmarking with Item Response Theory
Figure 2 for Lost in Benchmarks? Rethinking Large Language Model Benchmarking with Item Response Theory
Figure 3 for Lost in Benchmarks? Rethinking Large Language Model Benchmarking with Item Response Theory
Figure 4 for Lost in Benchmarks? Rethinking Large Language Model Benchmarking with Item Response Theory
Viaarxiv icon

MuSC: Improving Complex Instruction Following with Multi-granularity Self-Contrastive Training

Add code
Feb 17, 2025
Figure 1 for MuSC: Improving Complex Instruction Following with Multi-granularity Self-Contrastive Training
Figure 2 for MuSC: Improving Complex Instruction Following with Multi-granularity Self-Contrastive Training
Figure 3 for MuSC: Improving Complex Instruction Following with Multi-granularity Self-Contrastive Training
Figure 4 for MuSC: Improving Complex Instruction Following with Multi-granularity Self-Contrastive Training
Viaarxiv icon

SEO: Stochastic Experience Optimization for Large Language Models

Add code
Jan 08, 2025
Viaarxiv icon

Mitigating the Bias of Large Language Model Evaluation

Add code
Sep 25, 2024
Figure 1 for Mitigating the Bias of Large Language Model Evaluation
Figure 2 for Mitigating the Bias of Large Language Model Evaluation
Figure 3 for Mitigating the Bias of Large Language Model Evaluation
Figure 4 for Mitigating the Bias of Large Language Model Evaluation
Viaarxiv icon

LoRA-drop: Efficient LoRA Parameter Pruning based on Output Evaluation

Add code
Feb 12, 2024
Figure 1 for LoRA-drop: Efficient LoRA Parameter Pruning based on Output Evaluation
Figure 2 for LoRA-drop: Efficient LoRA Parameter Pruning based on Output Evaluation
Figure 3 for LoRA-drop: Efficient LoRA Parameter Pruning based on Output Evaluation
Figure 4 for LoRA-drop: Efficient LoRA Parameter Pruning based on Output Evaluation
Viaarxiv icon

CLMLF:A Contrastive Learning and Multi-Layer Fusion Method for Multimodal Sentiment Detection

Add code
Apr 14, 2022
Figure 1 for CLMLF:A Contrastive Learning and Multi-Layer Fusion Method for Multimodal Sentiment Detection
Figure 2 for CLMLF:A Contrastive Learning and Multi-Layer Fusion Method for Multimodal Sentiment Detection
Figure 3 for CLMLF:A Contrastive Learning and Multi-Layer Fusion Method for Multimodal Sentiment Detection
Figure 4 for CLMLF:A Contrastive Learning and Multi-Layer Fusion Method for Multimodal Sentiment Detection
Viaarxiv icon