Picture for Michael R. Lyu

Michael R. Lyu

Inference-Time Scaling of Verification: Self-Evolving Deep Research Agents via Test-Time Rubric-Guided Verification

Add code
Jan 22, 2026
Viaarxiv icon

Why Does the LLM Stop Computing: An Empirical Study of User-Reported Failures in Open-Source LLMs

Add code
Jan 20, 2026
Viaarxiv icon

From Laboratory to Real-World Applications: Benchmarking Agentic Code Reasoning at the Repository Level

Add code
Jan 07, 2026
Viaarxiv icon

Next Edit Prediction: Learning to Predict Code Edits from Context and Interaction History

Add code
Aug 13, 2025
Viaarxiv icon

ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents

Add code
Jul 30, 2025
Viaarxiv icon

Runtime Failure Hunting for Physics Engine Based Software Systems: How Far Can We Go?

Add code
Jul 29, 2025
Viaarxiv icon

3D Software Synthesis Guided by Constraint-Expressive Intermediate Representation

Add code
Jul 24, 2025
Viaarxiv icon

Entropy-Memorization Law: Evaluating Memorization Difficulty of Data in LLMs

Add code
Jul 08, 2025
Viaarxiv icon

SlideCoder: Layout-aware RAG-enhanced Hierarchical Slide Generation from Design

Add code
Jun 09, 2025
Viaarxiv icon

DesignBench: A Comprehensive Benchmark for MLLM-based Front-end Code Generation

Add code
Jun 06, 2025
Figure 1 for DesignBench: A Comprehensive Benchmark for MLLM-based Front-end Code Generation
Figure 2 for DesignBench: A Comprehensive Benchmark for MLLM-based Front-end Code Generation
Figure 3 for DesignBench: A Comprehensive Benchmark for MLLM-based Front-end Code Generation
Figure 4 for DesignBench: A Comprehensive Benchmark for MLLM-based Front-end Code Generation
Viaarxiv icon