Picture for Shengyu Zhang

Shengyu Zhang

SafePred: A Predictive Guardrail for Computer-Using Agents via World Models

Add code
Feb 02, 2026
Viaarxiv icon

MALLOC: Benchmarking the Memory-aware Long Sequence Compression for Large Sequential Recommendation

Add code
Jan 29, 2026
Viaarxiv icon

CORE: Code-based Inverse Self-Training Framework with Graph Expansion for Virtual Agents

Add code
Jan 05, 2026
Viaarxiv icon

GAIR: GUI Automation via Information-Joint Reasoning and Group Reflection

Add code
Dec 10, 2025
Figure 1 for GAIR: GUI Automation via Information-Joint Reasoning and Group Reflection
Figure 2 for GAIR: GUI Automation via Information-Joint Reasoning and Group Reflection
Figure 3 for GAIR: GUI Automation via Information-Joint Reasoning and Group Reflection
Figure 4 for GAIR: GUI Automation via Information-Joint Reasoning and Group Reflection
Viaarxiv icon

AccKV: Towards Efficient Audio-Video LLMs Inference via Adaptive-Focusing and Cross-Calibration KV Cache Optimization

Add code
Nov 14, 2025
Figure 1 for AccKV: Towards Efficient Audio-Video LLMs Inference via Adaptive-Focusing and Cross-Calibration KV Cache Optimization
Figure 2 for AccKV: Towards Efficient Audio-Video LLMs Inference via Adaptive-Focusing and Cross-Calibration KV Cache Optimization
Figure 3 for AccKV: Towards Efficient Audio-Video LLMs Inference via Adaptive-Focusing and Cross-Calibration KV Cache Optimization
Figure 4 for AccKV: Towards Efficient Audio-Video LLMs Inference via Adaptive-Focusing and Cross-Calibration KV Cache Optimization
Viaarxiv icon

Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs

Add code
Oct 01, 2025
Figure 1 for Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs
Figure 2 for Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs
Figure 3 for Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs
Figure 4 for Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs
Viaarxiv icon

OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use

Add code
Aug 06, 2025
Figure 1 for OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use
Figure 2 for OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use
Figure 3 for OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use
Figure 4 for OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use
Viaarxiv icon

HarmonyGuard: Toward Safety and Utility in Web Agents via Adaptive Policy Enhancement and Dual-Objective Optimization

Add code
Aug 06, 2025
Viaarxiv icon

EC-Diff: Fast and High-Quality Edge-Cloud Collaborative Inference for Diffusion Models

Add code
Jul 16, 2025
Viaarxiv icon

Constellation as a Service: Tailored Connectivity Management in Direct-Satellite-to-Device Networks

Add code
Jul 01, 2025
Viaarxiv icon