Picture for Shengyu Zhang

Shengyu Zhang

Yusuf Hamied Department of Chemistry, University of Cambridge, UK

MNN-LLM: A Generic Inference Engine for Fast Large Language Model Deployment on Mobile Devices

Add code
Jun 12, 2025
Viaarxiv icon

Infi-MMR: Curriculum-based Unlocking Multimodal Reasoning via Phased Reinforcement Learning in Multimodal Small Language Models

Add code
May 29, 2025
Figure 1 for Infi-MMR: Curriculum-based Unlocking Multimodal Reasoning via Phased Reinforcement Learning in Multimodal Small Language Models
Figure 2 for Infi-MMR: Curriculum-based Unlocking Multimodal Reasoning via Phased Reinforcement Learning in Multimodal Small Language Models
Figure 3 for Infi-MMR: Curriculum-based Unlocking Multimodal Reasoning via Phased Reinforcement Learning in Multimodal Small Language Models
Figure 4 for Infi-MMR: Curriculum-based Unlocking Multimodal Reasoning via Phased Reinforcement Learning in Multimodal Small Language Models
Viaarxiv icon

Cuff-KT: Tackling Learners' Real-time Learning Pattern Adjustment via Tuning-Free Knowledge State Guided Model Updating

Add code
May 26, 2025
Figure 1 for Cuff-KT: Tackling Learners' Real-time Learning Pattern Adjustment via Tuning-Free Knowledge State Guided Model Updating
Figure 2 for Cuff-KT: Tackling Learners' Real-time Learning Pattern Adjustment via Tuning-Free Knowledge State Guided Model Updating
Figure 3 for Cuff-KT: Tackling Learners' Real-time Learning Pattern Adjustment via Tuning-Free Knowledge State Guided Model Updating
Figure 4 for Cuff-KT: Tackling Learners' Real-time Learning Pattern Adjustment via Tuning-Free Knowledge State Guided Model Updating
Viaarxiv icon

Multimodal LLM-Guided Semantic Correction in Text-to-Image Diffusion

Add code
May 26, 2025
Figure 1 for Multimodal LLM-Guided Semantic Correction in Text-to-Image Diffusion
Figure 2 for Multimodal LLM-Guided Semantic Correction in Text-to-Image Diffusion
Figure 3 for Multimodal LLM-Guided Semantic Correction in Text-to-Image Diffusion
Figure 4 for Multimodal LLM-Guided Semantic Correction in Text-to-Image Diffusion
Viaarxiv icon

ThinkRec: Thinking-based recommendation via LLM

Add code
May 21, 2025
Figure 1 for ThinkRec: Thinking-based recommendation via LLM
Figure 2 for ThinkRec: Thinking-based recommendation via LLM
Figure 3 for ThinkRec: Thinking-based recommendation via LLM
Figure 4 for ThinkRec: Thinking-based recommendation via LLM
Viaarxiv icon

EcoAgent: An Efficient Edge-Cloud Collaborative Multi-Agent Framework for Mobile Automation

Add code
May 08, 2025
Viaarxiv icon

InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners

Add code
Apr 19, 2025
Viaarxiv icon

Disentangled Knowledge Tracing for Alleviating Cognitive Bias

Add code
Mar 04, 2025
Viaarxiv icon

AEIA-MN: Evaluating the Robustness of Multimodal LLM-Powered Mobile Agents Against Active Environmental Injection Attacks

Add code
Feb 18, 2025
Figure 1 for AEIA-MN: Evaluating the Robustness of Multimodal LLM-Powered Mobile Agents Against Active Environmental Injection Attacks
Figure 2 for AEIA-MN: Evaluating the Robustness of Multimodal LLM-Powered Mobile Agents Against Active Environmental Injection Attacks
Figure 3 for AEIA-MN: Evaluating the Robustness of Multimodal LLM-Powered Mobile Agents Against Active Environmental Injection Attacks
Figure 4 for AEIA-MN: Evaluating the Robustness of Multimodal LLM-Powered Mobile Agents Against Active Environmental Injection Attacks
Viaarxiv icon

InfiR : Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning

Add code
Feb 17, 2025
Figure 1 for InfiR : Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning
Figure 2 for InfiR : Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning
Figure 3 for InfiR : Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning
Figure 4 for InfiR : Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning
Viaarxiv icon