Picture for Trevor Darrell

Trevor Darrell

Simple Token-Level Confidence Improves Caption Correctness

Add code
May 11, 2023
Figure 1 for Simple Token-Level Confidence Improves Caption Correctness
Figure 2 for Simple Token-Level Confidence Improves Caption Correctness
Figure 3 for Simple Token-Level Confidence Improves Caption Correctness
Figure 4 for Simple Token-Level Confidence Improves Caption Correctness
Viaarxiv icon

Incorporating Structured Representations into Pretrained Vision & Language Models Using Scene Graphs

Add code
May 10, 2023
Figure 1 for Incorporating Structured Representations into Pretrained Vision & Language Models Using Scene Graphs
Figure 2 for Incorporating Structured Representations into Pretrained Vision & Language Models Using Scene Graphs
Figure 3 for Incorporating Structured Representations into Pretrained Vision & Language Models Using Scene Graphs
Figure 4 for Incorporating Structured Representations into Pretrained Vision & Language Models Using Scene Graphs
Viaarxiv icon

PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models

Add code
Mar 30, 2023
Viaarxiv icon

Top-Down Visual Attention from Analysis by Synthesis

Add code
Mar 24, 2023
Figure 1 for Top-Down Visual Attention from Analysis by Synthesis
Figure 2 for Top-Down Visual Attention from Analysis by Synthesis
Figure 3 for Top-Down Visual Attention from Analysis by Synthesis
Figure 4 for Top-Down Visual Attention from Analysis by Synthesis
Viaarxiv icon

Learning and Verification of Task Structure in Instructional Videos

Add code
Mar 23, 2023
Figure 1 for Learning and Verification of Task Structure in Instructional Videos
Figure 2 for Learning and Verification of Task Structure in Instructional Videos
Figure 3 for Learning and Verification of Task Structure in Instructional Videos
Figure 4 for Learning and Verification of Task Structure in Instructional Videos
Viaarxiv icon

Scaling Vision-Language Models with Sparse Mixture of Experts

Add code
Mar 13, 2023
Viaarxiv icon

Learning Humanoid Locomotion with Transformers

Add code
Mar 06, 2023
Viaarxiv icon

Dropout Reduces Underfitting

Add code
Mar 02, 2023
Viaarxiv icon

Guiding Pretraining in Reinforcement Learning with Large Language Models

Add code
Feb 13, 2023
Figure 1 for Guiding Pretraining in Reinforcement Learning with Large Language Models
Figure 2 for Guiding Pretraining in Reinforcement Learning with Large Language Models
Figure 3 for Guiding Pretraining in Reinforcement Learning with Large Language Models
Figure 4 for Guiding Pretraining in Reinforcement Learning with Large Language Models
Viaarxiv icon

Scale-MAE: A Scale-Aware Masked Autoencoder for Multiscale Geospatial Representation Learning

Add code
Jan 02, 2023
Figure 1 for Scale-MAE: A Scale-Aware Masked Autoencoder for Multiscale Geospatial Representation Learning
Figure 2 for Scale-MAE: A Scale-Aware Masked Autoencoder for Multiscale Geospatial Representation Learning
Figure 3 for Scale-MAE: A Scale-Aware Masked Autoencoder for Multiscale Geospatial Representation Learning
Figure 4 for Scale-MAE: A Scale-Aware Masked Autoencoder for Multiscale Geospatial Representation Learning
Viaarxiv icon