Picture for Xue Yang

Xue Yang

Breaking Bad Molecules: Are MLLMs Ready for Structure-Level Molecular Detoxification?

Add code
Jun 12, 2025
Viaarxiv icon

ComfyUI-R1: Exploring Reasoning Models for Workflow Generation

Add code
Jun 11, 2025
Viaarxiv icon

SpaCE-10: A Comprehensive Benchmark for Multimodal Large Language Models in Compositional Spatial Intelligence

Add code
Jun 09, 2025
Viaarxiv icon

ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development

Add code
Jun 05, 2025
Viaarxiv icon

Learning Adaptive and Temporally Causal Video Tokenization in a 1D Latent Space

Add code
May 22, 2025
Viaarxiv icon

Raw2Drive: Reinforcement Learning with Aligned World Models for End-to-End Autonomous Driving (in CARLA v2)

Add code
May 22, 2025
Viaarxiv icon

InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object Recognition

Add code
May 21, 2025
Viaarxiv icon

Interleave-VLA: Enhancing Robot Manipulation with Interleaved Image-Text Instructions

Add code
May 04, 2025
Viaarxiv icon

Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency Adaptation

Add code
Apr 09, 2025
Viaarxiv icon

A Unified Agentic Framework for Evaluating Conditional Image Generation

Add code
Apr 09, 2025
Viaarxiv icon