Picture for Shengyu Zhang

Shengyu Zhang

OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use

Add code
Aug 06, 2025
Viaarxiv icon

HarmonyGuard: Toward Safety and Utility in Web Agents via Adaptive Policy Enhancement and Dual-Objective Optimization

Add code
Aug 06, 2025
Viaarxiv icon

EC-Diff: Fast and High-Quality Edge-Cloud Collaborative Inference for Diffusion Models

Add code
Jul 16, 2025
Viaarxiv icon

Constellation as a Service: Tailored Connectivity Management in Direct-Satellite-to-Device Networks

Add code
Jul 01, 2025
Viaarxiv icon

Device-Cloud Collaborative Correction for On-Device Recommendation

Add code
Jun 15, 2025
Viaarxiv icon

MNN-LLM: A Generic Inference Engine for Fast Large Language Model Deployment on Mobile Devices

Add code
Jun 12, 2025
Viaarxiv icon

Infi-MMR: Curriculum-based Unlocking Multimodal Reasoning via Phased Reinforcement Learning in Multimodal Small Language Models

Add code
May 29, 2025
Viaarxiv icon

Multimodal LLM-Guided Semantic Correction in Text-to-Image Diffusion

Add code
May 26, 2025
Viaarxiv icon

Cuff-KT: Tackling Learners' Real-time Learning Pattern Adjustment via Tuning-Free Knowledge State Guided Model Updating

Add code
May 26, 2025
Viaarxiv icon

ThinkRec: Thinking-based recommendation via LLM

Add code
May 21, 2025
Viaarxiv icon