Picture for Runzhe Zhan

Runzhe Zhan

Neuron-Aware Data Selection In Instruction Tuning For Large Language Models

Add code
Mar 13, 2026
Viaarxiv icon

Are Large Reasoning Models Good Translation Evaluators? Analysis and Performance Boost

Add code
Oct 23, 2025
Figure 1 for Are Large Reasoning Models Good Translation Evaluators? Analysis and Performance Boost
Figure 2 for Are Large Reasoning Models Good Translation Evaluators? Analysis and Performance Boost
Figure 3 for Are Large Reasoning Models Good Translation Evaluators? Analysis and Performance Boost
Figure 4 for Are Large Reasoning Models Good Translation Evaluators? Analysis and Performance Boost
Viaarxiv icon

ExGRPO: Learning to Reason from Experience

Add code
Oct 02, 2025
Figure 1 for ExGRPO: Learning to Reason from Experience
Figure 2 for ExGRPO: Learning to Reason from Experience
Figure 3 for ExGRPO: Learning to Reason from Experience
Figure 4 for ExGRPO: Learning to Reason from Experience
Viaarxiv icon

Exposing the Cracks: Vulnerabilities of Retrieval-Augmented LLM-based Machine Translation

Add code
Oct 01, 2025
Figure 1 for Exposing the Cracks: Vulnerabilities of Retrieval-Augmented LLM-based Machine Translation
Figure 2 for Exposing the Cracks: Vulnerabilities of Retrieval-Augmented LLM-based Machine Translation
Figure 3 for Exposing the Cracks: Vulnerabilities of Retrieval-Augmented LLM-based Machine Translation
Figure 4 for Exposing the Cracks: Vulnerabilities of Retrieval-Augmented LLM-based Machine Translation
Viaarxiv icon

Synthesizing Sheet Music Problems for Evaluation and Reinforcement Learning

Add code
Sep 04, 2025
Figure 1 for Synthesizing Sheet Music Problems for Evaluation and Reinforcement Learning
Figure 2 for Synthesizing Sheet Music Problems for Evaluation and Reinforcement Learning
Figure 3 for Synthesizing Sheet Music Problems for Evaluation and Reinforcement Learning
Figure 4 for Synthesizing Sheet Music Problems for Evaluation and Reinforcement Learning
Viaarxiv icon

RepreGuard: Detecting LLM-Generated Text by Revealing Hidden Representation Patterns

Add code
Aug 18, 2025
Figure 1 for RepreGuard: Detecting LLM-Generated Text by Revealing Hidden Representation Patterns
Figure 2 for RepreGuard: Detecting LLM-Generated Text by Revealing Hidden Representation Patterns
Figure 3 for RepreGuard: Detecting LLM-Generated Text by Revealing Hidden Representation Patterns
Figure 4 for RepreGuard: Detecting LLM-Generated Text by Revealing Hidden Representation Patterns
Viaarxiv icon

Rethinking Prompt-based Debiasing in Large Language Models

Add code
Mar 12, 2025
Figure 1 for Rethinking Prompt-based Debiasing in Large Language Models
Figure 2 for Rethinking Prompt-based Debiasing in Large Language Models
Figure 3 for Rethinking Prompt-based Debiasing in Large Language Models
Figure 4 for Rethinking Prompt-based Debiasing in Large Language Models
Viaarxiv icon

Intrinsic Model Weaknesses: How Priming Attacks Unveil Vulnerabilities in Large Language Models

Add code
Feb 23, 2025
Viaarxiv icon

DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios

Add code
Oct 31, 2024
Figure 1 for DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios
Figure 2 for DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios
Figure 3 for DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios
Figure 4 for DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios
Viaarxiv icon

VisAidMath: Benchmarking Visual-Aided Mathematical Reasoning

Add code
Oct 30, 2024
Figure 1 for VisAidMath: Benchmarking Visual-Aided Mathematical Reasoning
Figure 2 for VisAidMath: Benchmarking Visual-Aided Mathematical Reasoning
Figure 3 for VisAidMath: Benchmarking Visual-Aided Mathematical Reasoning
Figure 4 for VisAidMath: Benchmarking Visual-Aided Mathematical Reasoning
Viaarxiv icon