Picture for Runyu Shi

Runyu Shi

GUI-CEval: A Hierarchical and Comprehensive Chinese Benchmark for Mobile GUI Agents

Add code
Mar 16, 2026
Viaarxiv icon

ProactiveMobile: A Comprehensive Benchmark for Boosting Proactive Intelligence on Mobile Devices

Add code
Feb 26, 2026
Viaarxiv icon

Unified Multimodal and Multilingual Retrieval via Multi-Task Learning with NLU Integration

Add code
Jan 21, 2026
Viaarxiv icon

HyperVL: An Efficient and Dynamic Multimodal Large Language Model for Edge Devices

Add code
Dec 16, 2025
Figure 1 for HyperVL: An Efficient and Dynamic Multimodal Large Language Model for Edge Devices
Figure 2 for HyperVL: An Efficient and Dynamic Multimodal Large Language Model for Edge Devices
Figure 3 for HyperVL: An Efficient and Dynamic Multimodal Large Language Model for Edge Devices
Figure 4 for HyperVL: An Efficient and Dynamic Multimodal Large Language Model for Edge Devices
Viaarxiv icon

Lego-Edit: A General Image Editing Framework with Model-Level Bricks and MLLM Builder

Add code
Sep 16, 2025
Viaarxiv icon