Picture for Mohit Bansal

Mohit Bansal

Shammie

Adaptive Contextual Perception: How to Generalize to New Backgrounds and Ambiguous Objects

Add code
Jun 09, 2023
Figure 1 for Adaptive Contextual Perception: How to Generalize to New Backgrounds and Ambiguous Objects
Figure 2 for Adaptive Contextual Perception: How to Generalize to New Backgrounds and Ambiguous Objects
Figure 3 for Adaptive Contextual Perception: How to Generalize to New Backgrounds and Ambiguous Objects
Figure 4 for Adaptive Contextual Perception: How to Generalize to New Backgrounds and Ambiguous Objects
Viaarxiv icon

Resolving Interference When Merging Models

Add code
Jun 02, 2023
Figure 1 for Resolving Interference When Merging Models
Figure 2 for Resolving Interference When Merging Models
Figure 3 for Resolving Interference When Merging Models
Figure 4 for Resolving Interference When Merging Models
Viaarxiv icon

PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation

Add code
May 30, 2023
Viaarxiv icon

Non-Sequential Graph Script Induction via Multimedia Grounding

Add code
May 27, 2023
Viaarxiv icon

Paxion: Patching Action Knowledge in Video-Language Foundation Models

Add code
May 26, 2023
Figure 1 for Paxion: Patching Action Knowledge in Video-Language Foundation Models
Figure 2 for Paxion: Patching Action Knowledge in Video-Language Foundation Models
Figure 3 for Paxion: Patching Action Knowledge in Video-Language Foundation Models
Figure 4 for Paxion: Patching Action Knowledge in Video-Language Foundation Models
Viaarxiv icon

MixCE: Training Autoregressive Language Models by Mixing Forward and Reverse Cross-Entropies

Add code
May 26, 2023
Figure 1 for MixCE: Training Autoregressive Language Models by Mixing Forward and Reverse Cross-Entropies
Figure 2 for MixCE: Training Autoregressive Language Models by Mixing Forward and Reverse Cross-Entropies
Figure 3 for MixCE: Training Autoregressive Language Models by Mixing Forward and Reverse Cross-Entropies
Figure 4 for MixCE: Training Autoregressive Language Models by Mixing Forward and Reverse Cross-Entropies
Viaarxiv icon

Visual Programming for Text-to-Image Generation and Evaluation

Add code
May 24, 2023
Figure 1 for Visual Programming for Text-to-Image Generation and Evaluation
Figure 2 for Visual Programming for Text-to-Image Generation and Evaluation
Figure 3 for Visual Programming for Text-to-Image Generation and Evaluation
Figure 4 for Visual Programming for Text-to-Image Generation and Evaluation
Viaarxiv icon

Any-to-Any Generation via Composable Diffusion

Add code
May 19, 2023
Figure 1 for Any-to-Any Generation via Composable Diffusion
Figure 2 for Any-to-Any Generation via Composable Diffusion
Figure 3 for Any-to-Any Generation via Composable Diffusion
Figure 4 for Any-to-Any Generation via Composable Diffusion
Viaarxiv icon

Self-Chained Image-Language Model for Video Localization and Question Answering

Add code
May 11, 2023
Figure 1 for Self-Chained Image-Language Model for Video Localization and Question Answering
Figure 2 for Self-Chained Image-Language Model for Video Localization and Question Answering
Figure 3 for Self-Chained Image-Language Model for Video Localization and Question Answering
Figure 4 for Self-Chained Image-Language Model for Video Localization and Question Answering
Viaarxiv icon

HistAlign: Improving Context Dependency in Language Generation by Aligning with History

Add code
May 08, 2023
Figure 1 for HistAlign: Improving Context Dependency in Language Generation by Aligning with History
Figure 2 for HistAlign: Improving Context Dependency in Language Generation by Aligning with History
Figure 3 for HistAlign: Improving Context Dependency in Language Generation by Aligning with History
Figure 4 for HistAlign: Improving Context Dependency in Language Generation by Aligning with History
Viaarxiv icon