Picture for Maliheh Izadi

Maliheh Izadi

Do Agents Dream of Root Shells? Partial-Credit Evaluation of LLM Agents in Capture The Flag Challenges

Add code
Apr 21, 2026
Viaarxiv icon

Automated Attention Pattern Discovery at Scale in Large Language Models

Add code
Apr 04, 2026
Viaarxiv icon

Investigating Autonomous Agent Contributions in the Wild: Activity Patterns and Code Change over Time

Add code
Apr 01, 2026
Viaarxiv icon

TriCEGAR: A Trace-Driven Abstraction Mechanism for Agentic AI

Add code
Jan 30, 2026
Viaarxiv icon

Model See, Model Do? Exposure-Aware Evaluation of Bug-vs-Fix Preference in Code LLMs

Add code
Jan 15, 2026
Viaarxiv icon

Does In-IDE Calibration of Large Language Models work at Scale?

Add code
Oct 26, 2025
Figure 1 for Does In-IDE Calibration of Large Language Models work at Scale?
Figure 2 for Does In-IDE Calibration of Large Language Models work at Scale?
Figure 3 for Does In-IDE Calibration of Large Language Models work at Scale?
Figure 4 for Does In-IDE Calibration of Large Language Models work at Scale?
Viaarxiv icon

A Qualitative Investigation into LLM-Generated Multilingual Code Comments and Automatic Evaluation Metrics

Add code
May 21, 2025
Viaarxiv icon

Code Red! On the Harmfulness of Applying Off-the-shelf Large Language Models to Programming Tasks

Add code
Apr 02, 2025
Figure 1 for Code Red! On the Harmfulness of Applying Off-the-shelf Large Language Models to Programming Tasks
Figure 2 for Code Red! On the Harmfulness of Applying Off-the-shelf Large Language Models to Programming Tasks
Figure 3 for Code Red! On the Harmfulness of Applying Off-the-shelf Large Language Models to Programming Tasks
Figure 4 for Code Red! On the Harmfulness of Applying Off-the-shelf Large Language Models to Programming Tasks
Viaarxiv icon

Human-AI Experience in Integrated Development Environments: A Systematic Literature Review

Add code
Mar 08, 2025
Viaarxiv icon

Benchmarking AI Models in Software Engineering: A Review, Search Tool, and Enhancement Protocol

Add code
Mar 07, 2025
Figure 1 for Benchmarking AI Models in Software Engineering: A Review, Search Tool, and Enhancement Protocol
Figure 2 for Benchmarking AI Models in Software Engineering: A Review, Search Tool, and Enhancement Protocol
Figure 3 for Benchmarking AI Models in Software Engineering: A Review, Search Tool, and Enhancement Protocol
Figure 4 for Benchmarking AI Models in Software Engineering: A Review, Search Tool, and Enhancement Protocol
Viaarxiv icon