Picture for Fei Wu

Fei Wu

OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use

Add code
Aug 06, 2025
Viaarxiv icon

EC-Diff: Fast and High-Quality Edge-Cloud Collaborative Inference for Diffusion Models

Add code
Jul 16, 2025
Viaarxiv icon

SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation

Add code
Jul 16, 2025
Viaarxiv icon

STARS: A Unified Framework for Singing Transcription, Alignment, and Refined Style Annotation

Add code
Jul 09, 2025
Viaarxiv icon

Device-Cloud Collaborative Correction for On-Device Recommendation

Add code
Jun 15, 2025
Viaarxiv icon

Infi-MMR: Curriculum-based Unlocking Multimodal Reasoning via Phased Reinforcement Learning in Multimodal Small Language Models

Add code
May 29, 2025
Viaarxiv icon

Multimodal LLM-Guided Semantic Correction in Text-to-Image Diffusion

Add code
May 26, 2025
Viaarxiv icon

Embracing Imperfection: Simulating Students with Diverse Cognitive Levels Using LLM-based Agents

Add code
May 26, 2025
Viaarxiv icon

AppealCase: A Dataset and Benchmark for Civil Case Appeal Scenarios

Add code
May 22, 2025
Viaarxiv icon

ThinkRec: Thinking-based recommendation via LLM

Add code
May 21, 2025
Viaarxiv icon