Picture for Trevor Darrell

Trevor Darrell

Describing Differences in Image Sets with Natural Language

Add code
Dec 05, 2023
Figure 1 for Describing Differences in Image Sets with Natural Language
Figure 2 for Describing Differences in Image Sets with Natural Language
Figure 3 for Describing Differences in Image Sets with Natural Language
Figure 4 for Describing Differences in Image Sets with Natural Language
Viaarxiv icon

Readout Guidance: Learning Control from Diffusion Features

Add code
Dec 04, 2023
Figure 1 for Readout Guidance: Learning Control from Diffusion Features
Figure 2 for Readout Guidance: Learning Control from Diffusion Features
Figure 3 for Readout Guidance: Learning Control from Diffusion Features
Figure 4 for Readout Guidance: Learning Control from Diffusion Features
Viaarxiv icon

IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks

Add code
Dec 04, 2023
Figure 1 for IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
Figure 2 for IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
Figure 3 for IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
Figure 4 for IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
Viaarxiv icon

Recursive Visual Programming

Add code
Dec 04, 2023
Figure 1 for Recursive Visual Programming
Figure 2 for Recursive Visual Programming
Figure 3 for Recursive Visual Programming
Figure 4 for Recursive Visual Programming
Viaarxiv icon

Sequential Modeling Enables Scalable Learning for Large Vision Models

Add code
Dec 01, 2023
Figure 1 for Sequential Modeling Enables Scalable Learning for Large Vision Models
Figure 2 for Sequential Modeling Enables Scalable Learning for Large Vision Models
Figure 3 for Sequential Modeling Enables Scalable Learning for Large Vision Models
Figure 4 for Sequential Modeling Enables Scalable Learning for Large Vision Models
Viaarxiv icon

Initializing Models with Larger Ones

Add code
Nov 30, 2023
Figure 1 for Initializing Models with Larger Ones
Figure 2 for Initializing Models with Larger Ones
Figure 3 for Initializing Models with Larger Ones
Figure 4 for Initializing Models with Larger Ones
Viaarxiv icon

Object-based (yet Class-agnostic) Video Domain Adaptation

Add code
Nov 29, 2023
Figure 1 for Object-based (yet Class-agnostic) Video Domain Adaptation
Figure 2 for Object-based (yet Class-agnostic) Video Domain Adaptation
Figure 3 for Object-based (yet Class-agnostic) Video Domain Adaptation
Figure 4 for Object-based (yet Class-agnostic) Video Domain Adaptation
Viaarxiv icon

Compositional Chain-of-Thought Prompting for Large Multimodal Models

Add code
Nov 27, 2023
Figure 1 for Compositional Chain-of-Thought Prompting for Large Multimodal Models
Figure 2 for Compositional Chain-of-Thought Prompting for Large Multimodal Models
Figure 3 for Compositional Chain-of-Thought Prompting for Large Multimodal Models
Figure 4 for Compositional Chain-of-Thought Prompting for Large Multimodal Models
Viaarxiv icon

Self-correcting LLM-controlled Diffusion Models

Add code
Nov 27, 2023
Viaarxiv icon

From Wrong To Right: A Recursive Approach Towards Vision-Language Explanation

Add code
Nov 21, 2023
Figure 1 for From Wrong To Right: A Recursive Approach Towards Vision-Language Explanation
Figure 2 for From Wrong To Right: A Recursive Approach Towards Vision-Language Explanation
Figure 3 for From Wrong To Right: A Recursive Approach Towards Vision-Language Explanation
Figure 4 for From Wrong To Right: A Recursive Approach Towards Vision-Language Explanation
Viaarxiv icon