Picture for Yang Shi

Yang Shi

Majorization-Guided Test-Time Adaptation for Vision-Language Models under Modality-Specific Shift

Add code
Apr 27, 2026
Viaarxiv icon

TransSplat: Unbalanced Semantic Transport for Language-Driven 3DGS Editing

Add code
Apr 21, 2026
Viaarxiv icon

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

Add code
Apr 06, 2026
Viaarxiv icon

Agentic-MME: What Agentic Capability Really Brings to Multimodal Intelligence?

Add code
Apr 03, 2026
Viaarxiv icon

Architecture-Agnostic Feature Synergy for Universal Defense Against Heterogeneous Generative Threats

Add code
Mar 16, 2026
Viaarxiv icon

VTC-Bench: Evaluating Agentic Multimodal Models via Compositional Visual Tool Chaining

Add code
Mar 16, 2026
Viaarxiv icon

Compressed Proximal Federated Learning for Non-Convex Composite Optimization on Heterogeneous Data

Add code
Mar 08, 2026
Viaarxiv icon

ChordEdit: One-Step Low-Energy Transport for Image Editing

Add code
Feb 22, 2026
Viaarxiv icon

BrowseComp-$V^3$: A Visual, Vertical, and Verifiable Benchmark for Multimodal Browsing Agents

Add code
Feb 13, 2026
Viaarxiv icon

Adaptive Scaffolding for Cognitive Engagement in an Intelligent Tutoring System

Add code
Feb 07, 2026
Viaarxiv icon