Picture for Shengyu Zhang

Shengyu Zhang

Device-Cloud Collaborative Correction for On-Device Recommendation

Add code
Jun 15, 2025
Viaarxiv icon

MNN-LLM: A Generic Inference Engine for Fast Large Language Model Deployment on Mobile Devices

Add code
Jun 12, 2025
Viaarxiv icon

Infi-MMR: Curriculum-based Unlocking Multimodal Reasoning via Phased Reinforcement Learning in Multimodal Small Language Models

Add code
May 29, 2025
Viaarxiv icon

Multimodal LLM-Guided Semantic Correction in Text-to-Image Diffusion

Add code
May 26, 2025
Viaarxiv icon

Cuff-KT: Tackling Learners' Real-time Learning Pattern Adjustment via Tuning-Free Knowledge State Guided Model Updating

Add code
May 26, 2025
Viaarxiv icon

ThinkRec: Thinking-based recommendation via LLM

Add code
May 21, 2025
Viaarxiv icon

EcoAgent: An Efficient Edge-Cloud Collaborative Multi-Agent Framework for Mobile Automation

Add code
May 08, 2025
Viaarxiv icon

InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners

Add code
Apr 19, 2025
Viaarxiv icon

Disentangled Knowledge Tracing for Alleviating Cognitive Bias

Add code
Mar 04, 2025
Viaarxiv icon

AEIA-MN: Evaluating the Robustness of Multimodal LLM-Powered Mobile Agents Against Active Environmental Injection Attacks

Add code
Feb 18, 2025
Viaarxiv icon