Picture for Michael R. Lyu

Michael R. Lyu

Next Edit Prediction: Learning to Predict Code Edits from Context and Interaction History

Add code
Aug 13, 2025
Viaarxiv icon

ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents

Add code
Jul 30, 2025
Viaarxiv icon

Runtime Failure Hunting for Physics Engine Based Software Systems: How Far Can We Go?

Add code
Jul 29, 2025
Viaarxiv icon

3D Software Synthesis Guided by Constraint-Expressive Intermediate Representation

Add code
Jul 24, 2025
Viaarxiv icon

Entropy-Memorization Law: Evaluating Memorization Difficulty of Data in LLMs

Add code
Jul 08, 2025
Viaarxiv icon

SlideCoder: Layout-aware RAG-enhanced Hierarchical Slide Generation from Design

Add code
Jun 09, 2025
Viaarxiv icon

DesignBench: A Comprehensive Benchmark for MLLM-based Front-end Code Generation

Add code
Jun 06, 2025
Viaarxiv icon

CODECRASH: Stress Testing LLM Reasoning under Structural and Semantic Perturbations

Add code
Apr 19, 2025
Figure 1 for CODECRASH: Stress Testing LLM Reasoning under Structural and Semantic Perturbations
Figure 2 for CODECRASH: Stress Testing LLM Reasoning under Structural and Semantic Perturbations
Figure 3 for CODECRASH: Stress Testing LLM Reasoning under Structural and Semantic Perturbations
Figure 4 for CODECRASH: Stress Testing LLM Reasoning under Structural and Semantic Perturbations
Viaarxiv icon

Fact-or-Fair: A Checklist for Behavioral Testing of AI Models on Fairness-Related Queries

Add code
Feb 09, 2025
Figure 1 for Fact-or-Fair: A Checklist for Behavioral Testing of AI Models on Fairness-Related Queries
Figure 2 for Fact-or-Fair: A Checklist for Behavioral Testing of AI Models on Fairness-Related Queries
Figure 3 for Fact-or-Fair: A Checklist for Behavioral Testing of AI Models on Fairness-Related Queries
Figure 4 for Fact-or-Fair: A Checklist for Behavioral Testing of AI Models on Fairness-Related Queries
Viaarxiv icon

How Should I Build A Benchmark?

Add code
Jan 18, 2025
Viaarxiv icon