Picture for Shengyu Zhang

Shengyu Zhang

Infi-MMR: Curriculum-based Unlocking Multimodal Reasoning via Phased Reinforcement Learning in Multimodal Small Language Models

Add code
May 29, 2025
Viaarxiv icon

Multimodal LLM-Guided Semantic Correction in Text-to-Image Diffusion

Add code
May 26, 2025
Viaarxiv icon

Cuff-KT: Tackling Learners' Real-time Learning Pattern Adjustment via Tuning-Free Knowledge State Guided Model Updating

Add code
May 26, 2025
Viaarxiv icon

ThinkRec: Thinking-based recommendation via LLM

Add code
May 21, 2025
Viaarxiv icon

EcoAgent: An Efficient Edge-Cloud Collaborative Multi-Agent Framework for Mobile Automation

Add code
May 08, 2025
Viaarxiv icon

InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners

Add code
Apr 19, 2025
Viaarxiv icon

Disentangled Knowledge Tracing for Alleviating Cognitive Bias

Add code
Mar 04, 2025
Viaarxiv icon

AEIA-MN: Evaluating the Robustness of Multimodal LLM-Powered Mobile Agents Against Active Environmental Injection Attacks

Add code
Feb 18, 2025
Viaarxiv icon

InfiR : Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning

Add code
Feb 17, 2025
Viaarxiv icon

Optimize Incompatible Parameters through Compatibility-aware Knowledge Integration

Add code
Jan 10, 2025
Figure 1 for Optimize Incompatible Parameters through Compatibility-aware Knowledge Integration
Figure 2 for Optimize Incompatible Parameters through Compatibility-aware Knowledge Integration
Figure 3 for Optimize Incompatible Parameters through Compatibility-aware Knowledge Integration
Figure 4 for Optimize Incompatible Parameters through Compatibility-aware Knowledge Integration
Viaarxiv icon