Picture for Li Fei-Fei

Li Fei-Fei

Stanford University

VideoWeave: A Data-Centric Approach for Efficient Video Understanding

Add code
Jan 09, 2026
Viaarxiv icon

PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation

Add code
Jan 07, 2026
Viaarxiv icon

Dream2Flow: Bridging Video Generation and Open-World Manipulation with 3D Object Flow

Add code
Dec 31, 2025
Viaarxiv icon

VLIC: Vision-Language Models As Perceptual Judges for Human-Aligned Image Compression

Add code
Dec 17, 2025
Figure 1 for VLIC: Vision-Language Models As Perceptual Judges for Human-Aligned Image Compression
Figure 2 for VLIC: Vision-Language Models As Perceptual Judges for Human-Aligned Image Compression
Figure 3 for VLIC: Vision-Language Models As Perceptual Judges for Human-Aligned Image Compression
Figure 4 for VLIC: Vision-Language Models As Perceptual Judges for Human-Aligned Image Compression
Viaarxiv icon

Cambrian-S: Towards Spatial Supersensing in Video

Add code
Nov 06, 2025
Viaarxiv icon

GET-USE: Learning Generalized Tool Usage for Bimanual Mobile Manipulation via Simulated Embodiment Extensions

Add code
Oct 29, 2025
Viaarxiv icon

Spatial Mental Modeling from Limited Views

Add code
Jun 26, 2025
Figure 1 for Spatial Mental Modeling from Limited Views
Figure 2 for Spatial Mental Modeling from Limited Views
Figure 3 for Spatial Mental Modeling from Limited Views
Figure 4 for Spatial Mental Modeling from Limited Views
Viaarxiv icon

UAD: Unsupervised Affordance Distillation for Generalization in Robotic Manipulation

Add code
Jun 10, 2025
Viaarxiv icon

Exploring Diffusion Transformer Designs via Grafting

Add code
Jun 06, 2025
Viaarxiv icon

RAGEN: Understanding Self-Evolution in LLM Agents via Multi-Turn Reinforcement Learning

Add code
Apr 24, 2025
Viaarxiv icon