Picture for Zhenyu Yang

Zhenyu Yang

ScoutAttention: Efficient KV Cache Offloading via Layer-Ahead CPU Pre-computation for LLM Inference

Add code
Mar 28, 2026
Viaarxiv icon

When Models Judge Themselves: Unsupervised Self-Evolution for Multimodal Reasoning

Add code
Mar 22, 2026
Viaarxiv icon

Efficiency Follows Global-Local Decoupling

Add code
Mar 20, 2026
Viaarxiv icon

Beyond Quadratic: Linear-Time Change Detection with RWKV

Add code
Mar 20, 2026
Viaarxiv icon

UniRef-Image-Edit: Towards Scalable and Consistent Multi-Reference Image Editing

Add code
Feb 15, 2026
Viaarxiv icon

Towards Remote Sensing Change Detection with Neural Memory

Add code
Feb 11, 2026
Viaarxiv icon

MobileWorld: Benchmarking Autonomous Mobile Agents in Agent-User Interactive, and MCP-Augmented Environments

Add code
Dec 22, 2025
Figure 1 for MobileWorld: Benchmarking Autonomous Mobile Agents in Agent-User Interactive, and MCP-Augmented Environments
Figure 2 for MobileWorld: Benchmarking Autonomous Mobile Agents in Agent-User Interactive, and MCP-Augmented Environments
Figure 3 for MobileWorld: Benchmarking Autonomous Mobile Agents in Agent-User Interactive, and MCP-Augmented Environments
Figure 4 for MobileWorld: Benchmarking Autonomous Mobile Agents in Agent-User Interactive, and MCP-Augmented Environments
Viaarxiv icon

LiveStar: Live Streaming Assistant for Real-World Online Video Understanding

Add code
Nov 07, 2025
Viaarxiv icon

C-MAG: Cascade Multimodal Attributed Graphs for Supply Chain Link Prediction

Add code
Aug 13, 2025
Viaarxiv icon

Efficient Agent: Optimizing Planning Capability for Multimodal Retrieval Augmented Generation

Add code
Aug 12, 2025
Viaarxiv icon