Picture for David Lo

David Lo

Singapore Management University

An LLM-as-Judge Metric for Bridging the Gap with Human Evaluation in SE Tasks

Add code
May 27, 2025
Viaarxiv icon

CODE-DITING: A Reasoning-Based Metric for Functional Alignment in Code Evaluation

Add code
May 26, 2025
Viaarxiv icon

Let the Trial Begin: A Mock-Court Approach to Vulnerability Detection using LLM-Based Agents

Add code
May 16, 2025
Viaarxiv icon

Toward Generalizable Evaluation in the LLM Era: A Survey Beyond Benchmarks

Add code
Apr 26, 2025
Viaarxiv icon

Zero-Shot Cross-Domain Code Search without Fine-Tuning

Add code
Apr 10, 2025
Viaarxiv icon

R2Vul: Learning to Reason about Software Vulnerabilities with Reinforcement Learning and Structured Reasoning Distillation

Add code
Apr 07, 2025
Viaarxiv icon

Towards Secure Program Partitioning for Smart Contracts with LLM's In-Context Learning

Add code
Feb 20, 2025
Viaarxiv icon

Less is More: On the Importance of Data Quality for Unit Test Generation

Add code
Feb 20, 2025
Viaarxiv icon

LessLeak-Bench: A First Investigation of Data Leakage in LLMs Across 83 Software Engineering Benchmarks

Add code
Feb 10, 2025
Viaarxiv icon

Do Existing Testing Tools Really Uncover Gender Bias in Text-to-Image Models?

Add code
Jan 27, 2025
Figure 1 for Do Existing Testing Tools Really Uncover Gender Bias in Text-to-Image Models?
Figure 2 for Do Existing Testing Tools Really Uncover Gender Bias in Text-to-Image Models?
Figure 3 for Do Existing Testing Tools Really Uncover Gender Bias in Text-to-Image Models?
Figure 4 for Do Existing Testing Tools Really Uncover Gender Bias in Text-to-Image Models?
Viaarxiv icon