Picture for Shengyu Zhang

Shengyu Zhang

Yusuf Hamied Department of Chemistry, University of Cambridge, UK

Semantic Trimming and Auxiliary Multi-step Prediction for Generative Recommendation

Add code
Apr 07, 2026
Viaarxiv icon

CIAR: Interval-based Collaborative Decoding for Image Generation Acceleration

Add code
Mar 26, 2026
Viaarxiv icon

ZeroFold: Protein-RNA Binding Affinity Predictions from Pre-Structural Embeddings

Add code
Mar 24, 2026
Viaarxiv icon

World-Model-Augmented Web Agents with Action Correction

Add code
Feb 17, 2026
Viaarxiv icon

SafePred: A Predictive Guardrail for Computer-Using Agents via World Models

Add code
Feb 02, 2026
Viaarxiv icon

MALLOC: Benchmarking the Memory-aware Long Sequence Compression for Large Sequential Recommendation

Add code
Jan 29, 2026
Viaarxiv icon

CORE: Code-based Inverse Self-Training Framework with Graph Expansion for Virtual Agents

Add code
Jan 05, 2026
Viaarxiv icon

GAIR: GUI Automation via Information-Joint Reasoning and Group Reflection

Add code
Dec 10, 2025
Figure 1 for GAIR: GUI Automation via Information-Joint Reasoning and Group Reflection
Figure 2 for GAIR: GUI Automation via Information-Joint Reasoning and Group Reflection
Figure 3 for GAIR: GUI Automation via Information-Joint Reasoning and Group Reflection
Figure 4 for GAIR: GUI Automation via Information-Joint Reasoning and Group Reflection
Viaarxiv icon

AccKV: Towards Efficient Audio-Video LLMs Inference via Adaptive-Focusing and Cross-Calibration KV Cache Optimization

Add code
Nov 14, 2025
Figure 1 for AccKV: Towards Efficient Audio-Video LLMs Inference via Adaptive-Focusing and Cross-Calibration KV Cache Optimization
Figure 2 for AccKV: Towards Efficient Audio-Video LLMs Inference via Adaptive-Focusing and Cross-Calibration KV Cache Optimization
Figure 3 for AccKV: Towards Efficient Audio-Video LLMs Inference via Adaptive-Focusing and Cross-Calibration KV Cache Optimization
Figure 4 for AccKV: Towards Efficient Audio-Video LLMs Inference via Adaptive-Focusing and Cross-Calibration KV Cache Optimization
Viaarxiv icon

Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs

Add code
Oct 01, 2025
Figure 1 for Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs
Figure 2 for Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs
Figure 3 for Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs
Figure 4 for Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs
Viaarxiv icon