Picture for Runyu Shi

Runyu Shi

Unified Multimodal and Multilingual Retrieval via Multi-Task Learning with NLU Integration

Add code
Jan 21, 2026
Viaarxiv icon

HyperVL: An Efficient and Dynamic Multimodal Large Language Model for Edge Devices

Add code
Dec 16, 2025
Figure 1 for HyperVL: An Efficient and Dynamic Multimodal Large Language Model for Edge Devices
Figure 2 for HyperVL: An Efficient and Dynamic Multimodal Large Language Model for Edge Devices
Figure 3 for HyperVL: An Efficient and Dynamic Multimodal Large Language Model for Edge Devices
Figure 4 for HyperVL: An Efficient and Dynamic Multimodal Large Language Model for Edge Devices
Viaarxiv icon

Lego-Edit: A General Image Editing Framework with Model-Level Bricks and MLLM Builder

Add code
Sep 16, 2025
Viaarxiv icon