Picture for Xue Yang

Xue Yang

SpaCE-10: A Comprehensive Benchmark for Multimodal Large Language Models in Compositional Spatial Intelligence

Add code
Jun 09, 2025
Viaarxiv icon

ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development

Add code
Jun 05, 2025
Viaarxiv icon

Raw2Drive: Reinforcement Learning with Aligned World Models for End-to-End Autonomous Driving (in CARLA v2)

Add code
May 22, 2025
Viaarxiv icon

Learning Adaptive and Temporally Causal Video Tokenization in a 1D Latent Space

Add code
May 22, 2025
Viaarxiv icon

InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object Recognition

Add code
May 21, 2025
Viaarxiv icon

Interleave-VLA: Enhancing Robot Manipulation with Interleaved Image-Text Instructions

Add code
May 04, 2025
Viaarxiv icon

A Unified Agentic Framework for Evaluating Conditional Image Generation

Add code
Apr 09, 2025
Viaarxiv icon

Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency Adaptation

Add code
Apr 09, 2025
Viaarxiv icon

Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing

Add code
Apr 03, 2025
Viaarxiv icon

SA-Occ: Satellite-Assisted 3D Occupancy Prediction in Real World

Add code
Mar 20, 2025
Viaarxiv icon