Alert button
Picture for Alireza Fathi

Alireza Fathi

Alert button

A Generative Approach for Wikipedia-Scale Visual Entity Recognition

Mar 04, 2024
Mathilde Caron, Ahmet Iscen, Alireza Fathi, Cordelia Schmid

Figure 1 for A Generative Approach for Wikipedia-Scale Visual Entity Recognition
Figure 2 for A Generative Approach for Wikipedia-Scale Visual Entity Recognition
Figure 3 for A Generative Approach for Wikipedia-Scale Visual Entity Recognition
Figure 4 for A Generative Approach for Wikipedia-Scale Visual Entity Recognition
Viaarxiv icon

SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code

Mar 02, 2024
Ziniu Hu, Ahmet Iscen, Aashi Jain, Thomas Kipf, Yisong Yue, David A. Ross, Cordelia Schmid, Alireza Fathi

Figure 1 for SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code
Figure 2 for SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code
Figure 3 for SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code
Figure 4 for SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code
Viaarxiv icon

AVIS: Autonomous Visual Information Seeking with Large Language Models

Jun 13, 2023
Ziniu Hu, Ahmet Iscen, Chen Sun, Kai-Wei Chang, Yizhou Sun, David A Ross, Cordelia Schmid, Alireza Fathi

Figure 1 for AVIS: Autonomous Visual Information Seeking with Large Language Models
Figure 2 for AVIS: Autonomous Visual Information Seeking with Large Language Models
Figure 3 for AVIS: Autonomous Visual Information Seeking with Large Language Models
Figure 4 for AVIS: Autonomous Visual Information Seeking with Large Language Models
Viaarxiv icon

Retrieval-Enhanced Contrastive Vision-Text Models

Jun 12, 2023
Ahmet Iscen, Mathilde Caron, Alireza Fathi, Cordelia Schmid

Figure 1 for Retrieval-Enhanced Contrastive Vision-Text Models
Figure 2 for Retrieval-Enhanced Contrastive Vision-Text Models
Figure 3 for Retrieval-Enhanced Contrastive Vision-Text Models
Figure 4 for Retrieval-Enhanced Contrastive Vision-Text Models
Viaarxiv icon

Improving Image Recognition by Retrieving from Web-Scale Image-Text Data

Apr 11, 2023
Ahmet Iscen, Alireza Fathi, Cordelia Schmid

Figure 1 for Improving Image Recognition by Retrieving from Web-Scale Image-Text Data
Figure 2 for Improving Image Recognition by Retrieving from Web-Scale Image-Text Data
Figure 3 for Improving Image Recognition by Retrieving from Web-Scale Image-Text Data
Figure 4 for Improving Image Recognition by Retrieving from Web-Scale Image-Text Data
Viaarxiv icon

Learning Object-Centric Neural Scattering Functions for Free-viewpoint Relighting and Scene Composition

Mar 10, 2023
Hong-Xing Yu, Michelle Guo, Alireza Fathi, Yen-Yu Chang, Eric Ryan Chan, Ruohan Gao, Thomas Funkhouser, Jiajun Wu

Figure 1 for Learning Object-Centric Neural Scattering Functions for Free-viewpoint Relighting and Scene Composition
Figure 2 for Learning Object-Centric Neural Scattering Functions for Free-viewpoint Relighting and Scene Composition
Figure 3 for Learning Object-Centric Neural Scattering Functions for Free-viewpoint Relighting and Scene Composition
Figure 4 for Learning Object-Centric Neural Scattering Functions for Free-viewpoint Relighting and Scene Composition
Viaarxiv icon

REVEAL: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge Memory

Dec 10, 2022
Ziniu Hu, Ahmet Iscen, Chen Sun, Zirui Wang, Kai-Wei Chang, Yizhou Sun, Cordelia Schmid, David A. Ross, Alireza Fathi

Figure 1 for REVEAL: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge Memory
Figure 2 for REVEAL: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge Memory
Figure 3 for REVEAL: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge Memory
Figure 4 for REVEAL: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge Memory
Viaarxiv icon

A Memory Transformer Network for Incremental Learning

Oct 10, 2022
Ahmet Iscen, Thomas Bird, Mathilde Caron, Alireza Fathi, Cordelia Schmid

Figure 1 for A Memory Transformer Network for Incremental Learning
Figure 2 for A Memory Transformer Network for Incremental Learning
Figure 3 for A Memory Transformer Network for Incremental Learning
Figure 4 for A Memory Transformer Network for Incremental Learning
Viaarxiv icon

im2nerf: Image to Neural Radiance Field in the Wild

Sep 08, 2022
Lu Mi, Abhijit Kundu, David Ross, Frank Dellaert, Noah Snavely, Alireza Fathi

Figure 1 for im2nerf: Image to Neural Radiance Field in the Wild
Figure 2 for im2nerf: Image to Neural Radiance Field in the Wild
Figure 3 for im2nerf: Image to Neural Radiance Field in the Wild
Figure 4 for im2nerf: Image to Neural Radiance Field in the Wild
Viaarxiv icon

Panoptic Neural Fields: A Semantic Object-Aware Neural Scene Representation

May 09, 2022
Abhijit Kundu, Kyle Genova, Xiaoqi Yin, Alireza Fathi, Caroline Pantofaru, Leonidas Guibas, Andrea Tagliasacchi, Frank Dellaert, Thomas Funkhouser

Figure 1 for Panoptic Neural Fields: A Semantic Object-Aware Neural Scene Representation
Figure 2 for Panoptic Neural Fields: A Semantic Object-Aware Neural Scene Representation
Figure 3 for Panoptic Neural Fields: A Semantic Object-Aware Neural Scene Representation
Figure 4 for Panoptic Neural Fields: A Semantic Object-Aware Neural Scene Representation
Viaarxiv icon