Picture for Yifan Xu

Yifan Xu

Toward Cognitive Supersensing in Multimodal Large Language Model

Add code
Feb 02, 2026
Viaarxiv icon

PerfGuard: A Performance-Aware Agent for Visual Content Generation

Add code
Jan 30, 2026
Viaarxiv icon

Reframing Conversational Design in HRI: Deliberate Design with AI Scaffolds

Add code
Jan 17, 2026
Viaarxiv icon

Rotate Your Character: Revisiting Video Diffusion Models for High-Quality 3D Character Generation

Add code
Jan 09, 2026
Viaarxiv icon

Benchmarking neural surrogates on realistic spatiotemporal multiphysics flows

Add code
Dec 21, 2025
Viaarxiv icon

Unifying Deep Predicate Invention with Pre-trained Foundation Models

Add code
Dec 19, 2025
Viaarxiv icon

Cambrian-S: Towards Spatial Supersensing in Video

Add code
Nov 06, 2025
Viaarxiv icon

Clone Deterministic 3D Worlds with Geometrically-Regularized World Models

Add code
Oct 30, 2025
Viaarxiv icon

Taming VR Teleoperation and Learning from Demonstration for Multi-Task Bimanual Table Service Manipulation

Add code
Aug 21, 2025
Figure 1 for Taming VR Teleoperation and Learning from Demonstration for Multi-Task Bimanual Table Service Manipulation
Figure 2 for Taming VR Teleoperation and Learning from Demonstration for Multi-Task Bimanual Table Service Manipulation
Figure 3 for Taming VR Teleoperation and Learning from Demonstration for Multi-Task Bimanual Table Service Manipulation
Figure 4 for Taming VR Teleoperation and Learning from Demonstration for Multi-Task Bimanual Table Service Manipulation
Viaarxiv icon

Spatial 3D-LLM: Exploring Spatial Awareness in 3D Vision-Language Models

Add code
Jul 22, 2025
Figure 1 for Spatial 3D-LLM: Exploring Spatial Awareness in 3D Vision-Language Models
Figure 2 for Spatial 3D-LLM: Exploring Spatial Awareness in 3D Vision-Language Models
Figure 3 for Spatial 3D-LLM: Exploring Spatial Awareness in 3D Vision-Language Models
Figure 4 for Spatial 3D-LLM: Exploring Spatial Awareness in 3D Vision-Language Models
Viaarxiv icon