Picture for Mingzhe Du

Mingzhe Du

Paper Espresso: From Paper Overload to Research Insight

Add code
Apr 06, 2026
Viaarxiv icon

Secure Code Generation via Online Reinforcement Learning with Vulnerability Reward Model

Add code
Feb 07, 2026
Viaarxiv icon

Nexus: Execution-Grounded Multi-Agent Test Oracle Synthesis

Add code
Oct 30, 2025
Viaarxiv icon

Beyond Prompt-Induced Lies: Investigating LLM Deception on Benign Prompts

Add code
Aug 08, 2025
Figure 1 for Beyond Prompt-Induced Lies: Investigating LLM Deception on Benign Prompts
Figure 2 for Beyond Prompt-Induced Lies: Investigating LLM Deception on Benign Prompts
Figure 3 for Beyond Prompt-Induced Lies: Investigating LLM Deception on Benign Prompts
Figure 4 for Beyond Prompt-Induced Lies: Investigating LLM Deception on Benign Prompts
Viaarxiv icon

Afterburner: Reinforcement Learning Facilitates Self-Improving Code Efficiency Optimization

Add code
May 29, 2025
Figure 1 for Afterburner: Reinforcement Learning Facilitates Self-Improving Code Efficiency Optimization
Figure 2 for Afterburner: Reinforcement Learning Facilitates Self-Improving Code Efficiency Optimization
Figure 3 for Afterburner: Reinforcement Learning Facilitates Self-Improving Code Efficiency Optimization
Figure 4 for Afterburner: Reinforcement Learning Facilitates Self-Improving Code Efficiency Optimization
Viaarxiv icon

Efficient Reasoning via Chain of Unconscious Thought

Add code
May 26, 2025
Viaarxiv icon

EffiBench-X: A Multi-Language Benchmark for Measuring Efficiency of LLM-Generated Code

Add code
May 19, 2025
Figure 1 for EffiBench-X: A Multi-Language Benchmark for Measuring Efficiency of LLM-Generated Code
Figure 2 for EffiBench-X: A Multi-Language Benchmark for Measuring Efficiency of LLM-Generated Code
Figure 3 for EffiBench-X: A Multi-Language Benchmark for Measuring Efficiency of LLM-Generated Code
Figure 4 for EffiBench-X: A Multi-Language Benchmark for Measuring Efficiency of LLM-Generated Code
Viaarxiv icon

GuardReasoner-VL: Safeguarding VLMs via Reinforced Reasoning

Add code
May 16, 2025
Viaarxiv icon

AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge

Add code
Dec 18, 2024
Figure 1 for AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge
Figure 2 for AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge
Figure 3 for AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge
Figure 4 for AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge
Viaarxiv icon

Rethinking the Influence of Source Code on Test Case Generation

Add code
Sep 14, 2024
Viaarxiv icon