Picture for Toufique Ahmed

Toufique Ahmed

Otter: Generating Tests from Issues to Validate SWE Patches

Add code
Feb 07, 2025
Viaarxiv icon

TDD-Bench Verified: Can LLMs Generate Tests for Issues Before They Get Resolved?

Add code
Dec 03, 2024
Figure 1 for TDD-Bench Verified: Can LLMs Generate Tests for Issues Before They Get Resolved?
Figure 2 for TDD-Bench Verified: Can LLMs Generate Tests for Issues Before They Get Resolved?
Figure 3 for TDD-Bench Verified: Can LLMs Generate Tests for Issues Before They Get Resolved?
Figure 4 for TDD-Bench Verified: Can LLMs Generate Tests for Issues Before They Get Resolved?
Viaarxiv icon

Prompting and Fine-tuning Large Language Models for Automated Code Review Comment Generation

Add code
Nov 15, 2024
Figure 1 for Prompting and Fine-tuning Large Language Models for Automated Code Review Comment Generation
Figure 2 for Prompting and Fine-tuning Large Language Models for Automated Code Review Comment Generation
Figure 3 for Prompting and Fine-tuning Large Language Models for Automated Code Review Comment Generation
Figure 4 for Prompting and Fine-tuning Large Language Models for Automated Code Review Comment Generation
Viaarxiv icon

Can LLMs Replace Manual Annotation of Software Engineering Artifacts?

Add code
Aug 10, 2024
Figure 1 for Can LLMs Replace Manual Annotation of Software Engineering Artifacts?
Figure 2 for Can LLMs Replace Manual Annotation of Software Engineering Artifacts?
Figure 3 for Can LLMs Replace Manual Annotation of Software Engineering Artifacts?
Figure 4 for Can LLMs Replace Manual Annotation of Software Engineering Artifacts?
Viaarxiv icon

Trojans in Large Language Models of Code: A Critical Review through a Trigger-Based Taxonomy

Add code
May 05, 2024
Figure 1 for Trojans in Large Language Models of Code: A Critical Review through a Trigger-Based Taxonomy
Figure 2 for Trojans in Large Language Models of Code: A Critical Review through a Trigger-Based Taxonomy
Figure 3 for Trojans in Large Language Models of Code: A Critical Review through a Trigger-Based Taxonomy
Figure 4 for Trojans in Large Language Models of Code: A Critical Review through a Trigger-Based Taxonomy
Viaarxiv icon

Enhancing Trust in LLM-Generated Code Summaries with Calibrated Confidence Scores

Add code
Apr 30, 2024
Figure 1 for Enhancing Trust in LLM-Generated Code Summaries with Calibrated Confidence Scores
Figure 2 for Enhancing Trust in LLM-Generated Code Summaries with Calibrated Confidence Scores
Figure 3 for Enhancing Trust in LLM-Generated Code Summaries with Calibrated Confidence Scores
Figure 4 for Enhancing Trust in LLM-Generated Code Summaries with Calibrated Confidence Scores
Viaarxiv icon

Studying LLM Performance on Closed- and Open-source Data

Add code
Feb 23, 2024
Figure 1 for Studying LLM Performance on Closed- and Open-source Data
Figure 2 for Studying LLM Performance on Closed- and Open-source Data
Figure 3 for Studying LLM Performance on Closed- and Open-source Data
Figure 4 for Studying LLM Performance on Closed- and Open-source Data
Viaarxiv icon

Quality and Trust in LLM-generated Code

Add code
Feb 09, 2024
Viaarxiv icon

Towards Understanding What Code Language Models Learned

Add code
Jun 20, 2023
Viaarxiv icon

Majority Rule: better patching via Self-Consistency

Add code
May 31, 2023
Figure 1 for Majority Rule: better patching via Self-Consistency
Figure 2 for Majority Rule: better patching via Self-Consistency
Figure 3 for Majority Rule: better patching via Self-Consistency
Figure 4 for Majority Rule: better patching via Self-Consistency
Viaarxiv icon