Picture for Zhenpeng Chen

Zhenpeng Chen

Can Agents Fix Agent Issues?

Add code
May 27, 2025
Viaarxiv icon

AMQA: An Adversarial Dataset for Benchmarking Bias of LLMs in Medicine and Healthcare

Add code
May 26, 2025
Viaarxiv icon

Show Me Your Code! Kill Code Poisoning: A Lightweight Method Based on Code Naturalness

Add code
Feb 20, 2025
Viaarxiv icon

Diversity Drives Fairness: Ensemble of Higher Order Mutants for Intersectional Fairness of Machine Learning Software

Add code
Dec 11, 2024
Viaarxiv icon

Benchmarking Bias in Large Language Models during Role-Playing

Add code
Nov 01, 2024
Figure 1 for Benchmarking Bias in Large Language Models during Role-Playing
Figure 2 for Benchmarking Bias in Large Language Models during Role-Playing
Figure 3 for Benchmarking Bias in Large Language Models during Role-Playing
Figure 4 for Benchmarking Bias in Large Language Models during Role-Playing
Viaarxiv icon

Large Language Model-Based Agents for Software Engineering: A Survey

Add code
Sep 04, 2024
Figure 1 for Large Language Model-Based Agents for Software Engineering: A Survey
Figure 2 for Large Language Model-Based Agents for Software Engineering: A Survey
Figure 3 for Large Language Model-Based Agents for Software Engineering: A Survey
Figure 4 for Large Language Model-Based Agents for Software Engineering: A Survey
Viaarxiv icon

LLM-Powered Test Case Generation for Detecting Tricky Bugs

Add code
Apr 16, 2024
Figure 1 for LLM-Powered Test Case Generation for Detecting Tricky Bugs
Figure 2 for LLM-Powered Test Case Generation for Detecting Tricky Bugs
Figure 3 for LLM-Powered Test Case Generation for Detecting Tricky Bugs
Figure 4 for LLM-Powered Test Case Generation for Detecting Tricky Bugs
Viaarxiv icon

Exploring the Impact of In-Browser Deep Learning Inference on Quality of User Experience and Performance

Add code
Feb 08, 2024
Figure 1 for Exploring the Impact of In-Browser Deep Learning Inference on Quality of User Experience and Performance
Figure 2 for Exploring the Impact of In-Browser Deep Learning Inference on Quality of User Experience and Performance
Figure 3 for Exploring the Impact of In-Browser Deep Learning Inference on Quality of User Experience and Performance
Figure 4 for Exploring the Impact of In-Browser Deep Learning Inference on Quality of User Experience and Performance
Viaarxiv icon

Dark-Skin Individuals Are at More Risk on the Street: Unmasking Fairness Issues of Autonomous Driving Systems

Add code
Aug 05, 2023
Figure 1 for Dark-Skin Individuals Are at More Risk on the Street: Unmasking Fairness Issues of Autonomous Driving Systems
Figure 2 for Dark-Skin Individuals Are at More Risk on the Street: Unmasking Fairness Issues of Autonomous Driving Systems
Figure 3 for Dark-Skin Individuals Are at More Risk on the Street: Unmasking Fairness Issues of Autonomous Driving Systems
Figure 4 for Dark-Skin Individuals Are at More Risk on the Street: Unmasking Fairness Issues of Autonomous Driving Systems
Viaarxiv icon

An Empirical Study on Fairness Improvement with Multiple Protected Attributes

Add code
Jul 25, 2023
Viaarxiv icon