Picture for Yutian Chen

Yutian Chen

AnyRecon: Arbitrary-View 3D Reconstruction with Video Diffusion Model

Add code
Apr 21, 2026
Viaarxiv icon

HiVLA: A Visual-Grounded-Centric Hierarchical Embodied Manipulation System

Add code
Apr 15, 2026
Viaarxiv icon

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

Add code
Mar 26, 2026
Viaarxiv icon

Attention Residuals

Add code
Mar 16, 2026
Viaarxiv icon

Kimi K2.5: Visual Agentic Intelligence

Add code
Feb 02, 2026
Viaarxiv icon

Co-Me: Confidence-Guided Token Merging for Visual Geometric Transformers

Add code
Nov 18, 2025
Viaarxiv icon

Kimi Linear: An Expressive, Efficient Attention Architecture

Add code
Oct 30, 2025
Viaarxiv icon

Virtual Community: An Open World for Humans, Robots, and Society

Add code
Aug 20, 2025
Figure 1 for Virtual Community: An Open World for Humans, Robots, and Society
Figure 2 for Virtual Community: An Open World for Humans, Robots, and Society
Figure 3 for Virtual Community: An Open World for Humans, Robots, and Society
Figure 4 for Virtual Community: An Open World for Humans, Robots, and Society
Viaarxiv icon

Kimi K2: Open Agentic Intelligence

Add code
Jul 28, 2025
Figure 1 for Kimi K2: Open Agentic Intelligence
Figure 2 for Kimi K2: Open Agentic Intelligence
Figure 3 for Kimi K2: Open Agentic Intelligence
Figure 4 for Kimi K2: Open Agentic Intelligence
Viaarxiv icon

UFM: A Simple Path towards Unified Dense Correspondence with Flow

Add code
Jun 10, 2025
Viaarxiv icon