Picture for Tianyu Yang

Tianyu Yang

Alibaba DAMO Academy

Mind the Gap Between Spatial Reasoning and Acting! Step-by-Step Evaluation of Agents With Spatial-Gym

Add code
Apr 10, 2026
Viaarxiv icon

Unifying Group-Relative and Self-Distillation Policy Optimization via Sample Routing

Add code
Apr 02, 2026
Viaarxiv icon

PLUME: Latent Reasoning Based Universal Multimodal Embedding

Add code
Apr 02, 2026
Viaarxiv icon

M2P: Improving Visual Foundation Models with Mask-to-Point Weakly-Supervised Learning for Dense Point Tracking

Add code
Mar 18, 2026
Viaarxiv icon

Deconstructing Multimodal Mathematical Reasoning: Towards a Unified Perception-Alignment-Reasoning Paradigm

Add code
Mar 09, 2026
Viaarxiv icon

TRACE: Task-Adaptive Reasoning and Representation Learning for Universal Multimodal Retrieval

Add code
Mar 04, 2026
Viaarxiv icon

WildActor: Unconstrained Identity-Preserving Video Generation

Add code
Feb 28, 2026
Viaarxiv icon

WISER: Wider Search, Deeper Thinking, and Adaptive Fusion for Training-Free Zero-Shot Composed Image Retrieval

Add code
Feb 26, 2026
Viaarxiv icon

Infinite-World: Scaling Interactive World Models to 1000-Frame Horizons via Pose-Free Hierarchical Memory

Add code
Feb 03, 2026
Viaarxiv icon

ReCALL: Recalibrating Capability Degradation for MLLM-based Composed Image Retrieval

Add code
Feb 02, 2026
Viaarxiv icon