Picture for He Du

He Du

TREX: Automating LLM Fine-tuning via Agent-Driven Tree-based Exploration

Add code
Apr 15, 2026
Viaarxiv icon

Kernel-Smith: A Unified Recipe for Evolutionary Kernel Optimization

Add code
Mar 30, 2026
Viaarxiv icon

Large Language Models Meet Symbolic Provers for Logical Reasoning Evaluation

Add code
Feb 10, 2025
Viaarxiv icon

SWE-Fixer: Training Open-Source LLMs for Effective and Efficient GitHub Issue Resolution

Add code
Jan 09, 2025
Figure 1 for SWE-Fixer: Training Open-Source LLMs for Effective and Efficient GitHub Issue Resolution
Figure 2 for SWE-Fixer: Training Open-Source LLMs for Effective and Efficient GitHub Issue Resolution
Figure 3 for SWE-Fixer: Training Open-Source LLMs for Effective and Efficient GitHub Issue Resolution
Figure 4 for SWE-Fixer: Training Open-Source LLMs for Effective and Efficient GitHub Issue Resolution
Viaarxiv icon

DevBench: A Comprehensive Benchmark for Software Development

Add code
Mar 15, 2024
Figure 1 for DevBench: A Comprehensive Benchmark for Software Development
Figure 2 for DevBench: A Comprehensive Benchmark for Software Development
Figure 3 for DevBench: A Comprehensive Benchmark for Software Development
Figure 4 for DevBench: A Comprehensive Benchmark for Software Development
Viaarxiv icon