Picture for Alireza Fathi

Alireza Fathi

A Generative Approach for Wikipedia-Scale Visual Entity Recognition

Add code
Mar 04, 2024
Figure 1 for A Generative Approach for Wikipedia-Scale Visual Entity Recognition
Figure 2 for A Generative Approach for Wikipedia-Scale Visual Entity Recognition
Figure 3 for A Generative Approach for Wikipedia-Scale Visual Entity Recognition
Figure 4 for A Generative Approach for Wikipedia-Scale Visual Entity Recognition
Viaarxiv icon

SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code

Add code
Mar 02, 2024
Figure 1 for SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code
Figure 2 for SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code
Figure 3 for SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code
Figure 4 for SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code
Viaarxiv icon

AVIS: Autonomous Visual Information Seeking with Large Language Models

Add code
Jun 13, 2023
Figure 1 for AVIS: Autonomous Visual Information Seeking with Large Language Models
Figure 2 for AVIS: Autonomous Visual Information Seeking with Large Language Models
Figure 3 for AVIS: Autonomous Visual Information Seeking with Large Language Models
Figure 4 for AVIS: Autonomous Visual Information Seeking with Large Language Models
Viaarxiv icon

Retrieval-Enhanced Contrastive Vision-Text Models

Add code
Jun 12, 2023
Figure 1 for Retrieval-Enhanced Contrastive Vision-Text Models
Figure 2 for Retrieval-Enhanced Contrastive Vision-Text Models
Figure 3 for Retrieval-Enhanced Contrastive Vision-Text Models
Figure 4 for Retrieval-Enhanced Contrastive Vision-Text Models
Viaarxiv icon

Improving Image Recognition by Retrieving from Web-Scale Image-Text Data

Add code
Apr 11, 2023
Figure 1 for Improving Image Recognition by Retrieving from Web-Scale Image-Text Data
Figure 2 for Improving Image Recognition by Retrieving from Web-Scale Image-Text Data
Figure 3 for Improving Image Recognition by Retrieving from Web-Scale Image-Text Data
Figure 4 for Improving Image Recognition by Retrieving from Web-Scale Image-Text Data
Viaarxiv icon

Learning Object-Centric Neural Scattering Functions for Free-viewpoint Relighting and Scene Composition

Add code
Mar 10, 2023
Figure 1 for Learning Object-Centric Neural Scattering Functions for Free-viewpoint Relighting and Scene Composition
Figure 2 for Learning Object-Centric Neural Scattering Functions for Free-viewpoint Relighting and Scene Composition
Figure 3 for Learning Object-Centric Neural Scattering Functions for Free-viewpoint Relighting and Scene Composition
Figure 4 for Learning Object-Centric Neural Scattering Functions for Free-viewpoint Relighting and Scene Composition
Viaarxiv icon

REVEAL: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge Memory

Add code
Dec 10, 2022
Figure 1 for REVEAL: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge Memory
Figure 2 for REVEAL: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge Memory
Figure 3 for REVEAL: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge Memory
Figure 4 for REVEAL: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge Memory
Viaarxiv icon

A Memory Transformer Network for Incremental Learning

Add code
Oct 10, 2022
Figure 1 for A Memory Transformer Network for Incremental Learning
Figure 2 for A Memory Transformer Network for Incremental Learning
Figure 3 for A Memory Transformer Network for Incremental Learning
Figure 4 for A Memory Transformer Network for Incremental Learning
Viaarxiv icon

im2nerf: Image to Neural Radiance Field in the Wild

Add code
Sep 08, 2022
Figure 1 for im2nerf: Image to Neural Radiance Field in the Wild
Figure 2 for im2nerf: Image to Neural Radiance Field in the Wild
Figure 3 for im2nerf: Image to Neural Radiance Field in the Wild
Figure 4 for im2nerf: Image to Neural Radiance Field in the Wild
Viaarxiv icon

Panoptic Neural Fields: A Semantic Object-Aware Neural Scene Representation

Add code
May 09, 2022
Figure 1 for Panoptic Neural Fields: A Semantic Object-Aware Neural Scene Representation
Figure 2 for Panoptic Neural Fields: A Semantic Object-Aware Neural Scene Representation
Figure 3 for Panoptic Neural Fields: A Semantic Object-Aware Neural Scene Representation
Figure 4 for Panoptic Neural Fields: A Semantic Object-Aware Neural Scene Representation
Viaarxiv icon