Picture for Xue Yang

Xue Yang

SafeSteer: A Decoding-level Defense Mechanism for Multimodal Large Language Models

Add code
May 12, 2026
Viaarxiv icon

ComplexMCP: Evaluation of LLM Agents in Dynamic, Interdependent, and Large-Scale Tool Sandbox

Add code
May 11, 2026
Viaarxiv icon

SWIFT: Prompt-Adaptive Memory for Efficient Interactive Long Video Generation

Add code
May 10, 2026
Viaarxiv icon

Quantum Kernel Advantage over Classical Collapse in Medical Foundation Model Embeddings

Add code
Apr 27, 2026
Viaarxiv icon

MM-WebAgent: A Hierarchical Multimodal Web Agent for Webpage Generation

Add code
Apr 16, 2026
Viaarxiv icon

HiProto: Hierarchical Prototype Learning for Interpretable Object Detection Under Low-quality Conditions

Add code
Apr 15, 2026
Viaarxiv icon

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Add code
Apr 06, 2026
Viaarxiv icon

BizGenEval: A Systematic Benchmark for Commercial Visual Content Generation

Add code
Mar 26, 2026
Viaarxiv icon

End-to-End QGAN-Based Image Synthesis via Neural Noise Encoding and Intensity Calibration

Add code
Mar 19, 2026
Viaarxiv icon

Enhancing Pretrained Model-based Continual Representation Learning via Guided Random Projection

Add code
Mar 19, 2026
Viaarxiv icon