Picture for Trevor Darrell

Trevor Darrell

MultiGen: Using Multimodal Generation in Simulation to Learn Multimodal Policies in Real

Add code
Jul 03, 2025
Viaarxiv icon

Activation Reward Models for Few-Shot Model Alignment

Add code
Jul 02, 2025
Viaarxiv icon

Whole-Body Conditioned Egocentric Video Prediction

Add code
Jun 26, 2025
Figure 1 for Whole-Body Conditioned Egocentric Video Prediction
Figure 2 for Whole-Body Conditioned Egocentric Video Prediction
Figure 3 for Whole-Body Conditioned Egocentric Video Prediction
Figure 4 for Whole-Body Conditioned Egocentric Video Prediction
Viaarxiv icon

LeVERB: Humanoid Whole-Body Control with Latent Vision-Language Instruction

Add code
Jun 16, 2025
Figure 1 for LeVERB: Humanoid Whole-Body Control with Latent Vision-Language Instruction
Figure 2 for LeVERB: Humanoid Whole-Body Control with Latent Vision-Language Instruction
Figure 3 for LeVERB: Humanoid Whole-Body Control with Latent Vision-Language Instruction
Figure 4 for LeVERB: Humanoid Whole-Body Control with Latent Vision-Language Instruction
Viaarxiv icon

Hidden in plain sight: VLMs overlook their visual representations

Add code
Jun 09, 2025
Viaarxiv icon

Search Arena: Analyzing Search-Augmented LLMs

Add code
Jun 05, 2025
Viaarxiv icon

REOrdering Patches Improves Vision Models

Add code
May 29, 2025
Viaarxiv icon

Puzzled by Puzzles: When Vision-Language Models Can't Take a Hint

Add code
May 29, 2025
Viaarxiv icon

Visual Imitation Enables Contextual Humanoid Control

Add code
May 07, 2025
Figure 1 for Visual Imitation Enables Contextual Humanoid Control
Figure 2 for Visual Imitation Enables Contextual Humanoid Control
Figure 3 for Visual Imitation Enables Contextual Humanoid Control
Figure 4 for Visual Imitation Enables Contextual Humanoid Control
Viaarxiv icon

LISAT: Language-Instructed Segmentation Assistant for Satellite Imagery

Add code
May 05, 2025
Viaarxiv icon