Picture for Pengzhi Gao

Pengzhi Gao

Scaling, Benchmarking, and Reasoning of Vision-Language Agents for Mobile GUI Navigation

Add code
May 26, 2026
Viaarxiv icon

SimuWoB: Simulating Real-World Mobile Apps for Fast and Faithful GUI Agent Benchmarking

Add code
May 24, 2026
Viaarxiv icon

How Mobile World Model Guides GUI Agents?

Add code
May 11, 2026
Viaarxiv icon

ExPosST: Explicit Positioning with Adaptive Masking for LLM-Based Simultaneous Machine Translation

Add code
Mar 16, 2026
Viaarxiv icon

CoME: Empowering Channel-of-Mobile-Experts with Informative Hybrid-Capabilities Reasoning

Add code
Feb 27, 2026
Viaarxiv icon

Scaling Model and Data for Multilingual Machine Translation with Open Large Language Models

Add code
Feb 12, 2026
Viaarxiv icon

MobileBench-OL: A Comprehensive Chinese Benchmark for Evaluating Mobile GUI Agents in Real-World Environment

Add code
Jan 29, 2026
Viaarxiv icon

STEP: Success-Rate-Aware Trajectory-Efficient Policy Optimization

Add code
Nov 17, 2025
Viaarxiv icon

Revisiting Entropy in Reinforcement Learning for Large Reasoning Models

Add code
Nov 08, 2025
Viaarxiv icon

BacktrackAgent: Enhancing GUI Agent with Error Detection and Backtracking Mechanism

Add code
May 27, 2025
Figure 1 for BacktrackAgent: Enhancing GUI Agent with Error Detection and Backtracking Mechanism
Figure 2 for BacktrackAgent: Enhancing GUI Agent with Error Detection and Backtracking Mechanism
Figure 3 for BacktrackAgent: Enhancing GUI Agent with Error Detection and Backtracking Mechanism
Figure 4 for BacktrackAgent: Enhancing GUI Agent with Error Detection and Backtracking Mechanism
Viaarxiv icon