Picture for Ali Farhadi

Ali Farhadi

MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound

Add code
Jan 07, 2022
Figure 1 for MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound
Figure 2 for MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound
Figure 3 for MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound
Figure 4 for MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound
Viaarxiv icon

The Introspective Agent: Interdependence of Strategy, Physiology, and Sensing for Embodied Agents

Add code
Jan 02, 2022
Figure 1 for The Introspective Agent: Interdependence of Strategy, Physiology, and Sensing for Embodied Agents
Figure 2 for The Introspective Agent: Interdependence of Strategy, Physiology, and Sensing for Embodied Agents
Figure 3 for The Introspective Agent: Interdependence of Strategy, Physiology, and Sensing for Embodied Agents
Figure 4 for The Introspective Agent: Interdependence of Strategy, Physiology, and Sensing for Embodied Agents
Viaarxiv icon

Forward Compatible Training for Representation Learning

Add code
Dec 06, 2021
Figure 1 for Forward Compatible Training for Representation Learning
Figure 2 for Forward Compatible Training for Representation Learning
Figure 3 for Forward Compatible Training for Representation Learning
Figure 4 for Forward Compatible Training for Representation Learning
Viaarxiv icon

Iconary: A Pictionary-Based Game for Testing Multimodal Communication with Drawings and Text

Add code
Dec 01, 2021
Viaarxiv icon

LCS: Learning Compressible Subspaces for Adaptive Network Compression at Inference Time

Add code
Oct 08, 2021
Figure 1 for LCS: Learning Compressible Subspaces for Adaptive Network Compression at Inference Time
Figure 2 for LCS: Learning Compressible Subspaces for Adaptive Network Compression at Inference Time
Figure 3 for LCS: Learning Compressible Subspaces for Adaptive Network Compression at Inference Time
Figure 4 for LCS: Learning Compressible Subspaces for Adaptive Network Compression at Inference Time
Viaarxiv icon

Robust fine-tuning of zero-shot models

Add code
Sep 04, 2021
Figure 1 for Robust fine-tuning of zero-shot models
Figure 2 for Robust fine-tuning of zero-shot models
Figure 3 for Robust fine-tuning of zero-shot models
Figure 4 for Robust fine-tuning of zero-shot models
Viaarxiv icon

LanguageRefer: Spatial-Language Model for 3D Visual Grounding

Add code
Jul 22, 2021
Figure 1 for LanguageRefer: Spatial-Language Model for 3D Visual Grounding
Figure 2 for LanguageRefer: Spatial-Language Model for 3D Visual Grounding
Figure 3 for LanguageRefer: Spatial-Language Model for 3D Visual Grounding
Figure 4 for LanguageRefer: Spatial-Language Model for 3D Visual Grounding
Viaarxiv icon

MERLOT: Multimodal Neural Script Knowledge Models

Add code
Jun 10, 2021
Figure 1 for MERLOT: Multimodal Neural Script Knowledge Models
Figure 2 for MERLOT: Multimodal Neural Script Knowledge Models
Figure 3 for MERLOT: Multimodal Neural Script Knowledge Models
Figure 4 for MERLOT: Multimodal Neural Script Knowledge Models
Viaarxiv icon

LLC: Accurate, Multi-purpose Learnt Low-dimensional Binary Codes

Add code
Jun 02, 2021
Figure 1 for LLC: Accurate, Multi-purpose Learnt Low-dimensional Binary Codes
Figure 2 for LLC: Accurate, Multi-purpose Learnt Low-dimensional Binary Codes
Figure 3 for LLC: Accurate, Multi-purpose Learnt Low-dimensional Binary Codes
Figure 4 for LLC: Accurate, Multi-purpose Learnt Low-dimensional Binary Codes
Viaarxiv icon

PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World

Add code
Jun 01, 2021
Figure 1 for PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World
Figure 2 for PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World
Figure 3 for PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World
Figure 4 for PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World
Viaarxiv icon