Picture for Anjiang Wei

Anjiang Wei

Astra: A Multi-Agent System for GPU Kernel Performance Optimization

Add code
Sep 09, 2025
Viaarxiv icon

SATBench: Benchmarking LLMs' Logical Reasoning via Automated Puzzle Generation from SAT Formulas

Add code
May 20, 2025
Figure 1 for SATBench: Benchmarking LLMs' Logical Reasoning via Automated Puzzle Generation from SAT Formulas
Figure 2 for SATBench: Benchmarking LLMs' Logical Reasoning via Automated Puzzle Generation from SAT Formulas
Figure 3 for SATBench: Benchmarking LLMs' Logical Reasoning via Automated Puzzle Generation from SAT Formulas
Figure 4 for SATBench: Benchmarking LLMs' Logical Reasoning via Automated Puzzle Generation from SAT Formulas
Viaarxiv icon

Improving Assembly Code Performance with Large Language Models via Reinforcement Learning

Add code
May 16, 2025
Figure 1 for Improving Assembly Code Performance with Large Language Models via Reinforcement Learning
Figure 2 for Improving Assembly Code Performance with Large Language Models via Reinforcement Learning
Figure 3 for Improving Assembly Code Performance with Large Language Models via Reinforcement Learning
Figure 4 for Improving Assembly Code Performance with Large Language Models via Reinforcement Learning
Viaarxiv icon

ClarifyCoder: Clarification-Aware Fine-Tuning for Programmatic Problem Solving

Add code
Apr 23, 2025
Viaarxiv icon

VeriCoder: Enhancing LLM-Based RTL Code Generation through Functional Correctness Validation

Add code
Apr 22, 2025
Figure 1 for VeriCoder: Enhancing LLM-Based RTL Code Generation through Functional Correctness Validation
Figure 2 for VeriCoder: Enhancing LLM-Based RTL Code Generation through Functional Correctness Validation
Figure 3 for VeriCoder: Enhancing LLM-Based RTL Code Generation through Functional Correctness Validation
Figure 4 for VeriCoder: Enhancing LLM-Based RTL Code Generation through Functional Correctness Validation
Viaarxiv icon

CodeARC: Benchmarking Reasoning Capabilities of LLM Agents for Inductive Program Synthesis

Add code
Mar 29, 2025
Viaarxiv icon

EquiBench: Benchmarking Code Reasoning Capabilities of Large Language Models via Equivalence Checking

Add code
Feb 18, 2025
Figure 1 for EquiBench: Benchmarking Code Reasoning Capabilities of Large Language Models via Equivalence Checking
Figure 2 for EquiBench: Benchmarking Code Reasoning Capabilities of Large Language Models via Equivalence Checking
Figure 3 for EquiBench: Benchmarking Code Reasoning Capabilities of Large Language Models via Equivalence Checking
Figure 4 for EquiBench: Benchmarking Code Reasoning Capabilities of Large Language Models via Equivalence Checking
Viaarxiv icon

Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation

Add code
Jan 06, 2025
Figure 1 for Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation
Figure 2 for Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation
Figure 3 for Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation
Figure 4 for Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation
Viaarxiv icon

Improving Parallel Program Performance Through DSL-Driven Code Generation with LLM Optimizers

Add code
Oct 21, 2024
Viaarxiv icon