Picture for Naman Jain

Naman Jain

GSO: Challenging Software Optimization Tasks for Evaluating SWE-Agents

Add code
May 29, 2025
Viaarxiv icon

Waymo Driverless Car Data Analysis and Driving Modeling using CNN and LSTM

Add code
Apr 29, 2025
Viaarxiv icon

R2E-Gym: Procedural Environments and Hybrid Verifiers for Scaling Open-Weights SWE Agents

Add code
Apr 09, 2025
Viaarxiv icon

Challenges and Paths Towards AI for Software Engineering

Add code
Mar 28, 2025
Viaarxiv icon

Syzygy: Dual Code-Test C to (safe) Rust Translation using LLMs and Dynamic Analysis

Add code
Dec 18, 2024
Viaarxiv icon

SelfCodeAlign: Self-Alignment for Code Generation

Add code
Oct 31, 2024
Figure 1 for SelfCodeAlign: Self-Alignment for Code Generation
Figure 2 for SelfCodeAlign: Self-Alignment for Code Generation
Figure 3 for SelfCodeAlign: Self-Alignment for Code Generation
Figure 4 for SelfCodeAlign: Self-Alignment for Code Generation
Viaarxiv icon

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

Add code
Jun 26, 2024
Figure 1 for BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
Figure 2 for BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
Figure 3 for BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
Figure 4 for BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
Viaarxiv icon

RAFT: Adapting Language Model to Domain Specific RAG

Add code
Mar 15, 2024
Figure 1 for RAFT: Adapting Language Model to Domain Specific RAG
Figure 2 for RAFT: Adapting Language Model to Domain Specific RAG
Figure 3 for RAFT: Adapting Language Model to Domain Specific RAG
Figure 4 for RAFT: Adapting Language Model to Domain Specific RAG
Viaarxiv icon

LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code

Add code
Mar 12, 2024
Viaarxiv icon

StarCoder 2 and The Stack v2: The Next Generation

Add code
Feb 29, 2024
Figure 1 for StarCoder 2 and The Stack v2: The Next Generation
Figure 2 for StarCoder 2 and The Stack v2: The Next Generation
Figure 3 for StarCoder 2 and The Stack v2: The Next Generation
Figure 4 for StarCoder 2 and The Stack v2: The Next Generation
Viaarxiv icon