Picture for Mingjie Zhan

Mingjie Zhan

WebGen-Agent: Enhancing Interactive Website Generation with Multi-Level Feedback and Step-Level Reinforcement Learning

Add code
Sep 26, 2025
Viaarxiv icon

VoiceAssistant-Eval: Benchmarking AI Assistants across Listening, Speaking, and Viewing

Add code
Sep 26, 2025
Viaarxiv icon

Alignment with Fill-In-the-Middle for Enhancing Code Generation

Add code
Aug 27, 2025
Viaarxiv icon

Probability-Consistent Preference Optimization for Enhanced LLM Reasoning

Add code
May 29, 2025
Viaarxiv icon

MathCoder-VL: Bridging Vision and Code for Enhanced Multimodal Mathematical Reasoning

Add code
May 15, 2025
Viaarxiv icon

WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch

Add code
May 06, 2025
Viaarxiv icon

Navi-plus: Managing Ambiguous GUI Navigation Tasks with Follow-up

Add code
Mar 31, 2025
Figure 1 for Navi-plus: Managing Ambiguous GUI Navigation Tasks with Follow-up
Figure 2 for Navi-plus: Managing Ambiguous GUI Navigation Tasks with Follow-up
Figure 3 for Navi-plus: Managing Ambiguous GUI Navigation Tasks with Follow-up
Figure 4 for Navi-plus: Managing Ambiguous GUI Navigation Tasks with Follow-up
Viaarxiv icon

SpiritSight Agent: Advanced GUI Agent with One Look

Add code
Mar 05, 2025
Viaarxiv icon

UFO: Enhancing Diffusion-Based Video Generation with a Uniform Frame Organizer

Add code
Dec 12, 2024
Figure 1 for UFO: Enhancing Diffusion-Based Video Generation with a Uniform Frame Organizer
Figure 2 for UFO: Enhancing Diffusion-Based Video Generation with a Uniform Frame Organizer
Figure 3 for UFO: Enhancing Diffusion-Based Video Generation with a Uniform Frame Organizer
Figure 4 for UFO: Enhancing Diffusion-Based Video Generation with a Uniform Frame Organizer
Viaarxiv icon

MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code

Add code
Oct 10, 2024
Figure 1 for MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code
Figure 2 for MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code
Figure 3 for MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code
Figure 4 for MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code
Viaarxiv icon