Alert button
Picture for Alireza Fathi

Alireza Fathi

Alert button

A Generative Approach for Wikipedia-Scale Visual Entity Recognition

Add code
Bookmark button
Alert button
Mar 04, 2024
Mathilde Caron, Ahmet Iscen, Alireza Fathi, Cordelia Schmid

Figure 1 for A Generative Approach for Wikipedia-Scale Visual Entity Recognition
Figure 2 for A Generative Approach for Wikipedia-Scale Visual Entity Recognition
Figure 3 for A Generative Approach for Wikipedia-Scale Visual Entity Recognition
Figure 4 for A Generative Approach for Wikipedia-Scale Visual Entity Recognition
Viaarxiv icon

SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code

Add code
Bookmark button
Alert button
Mar 02, 2024
Ziniu Hu, Ahmet Iscen, Aashi Jain, Thomas Kipf, Yisong Yue, David A. Ross, Cordelia Schmid, Alireza Fathi

Figure 1 for SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code
Figure 2 for SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code
Figure 3 for SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code
Figure 4 for SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code
Viaarxiv icon

AVIS: Autonomous Visual Information Seeking with Large Language Models

Add code
Bookmark button
Alert button
Jun 13, 2023
Ziniu Hu, Ahmet Iscen, Chen Sun, Kai-Wei Chang, Yizhou Sun, David A Ross, Cordelia Schmid, Alireza Fathi

Figure 1 for AVIS: Autonomous Visual Information Seeking with Large Language Models
Figure 2 for AVIS: Autonomous Visual Information Seeking with Large Language Models
Figure 3 for AVIS: Autonomous Visual Information Seeking with Large Language Models
Figure 4 for AVIS: Autonomous Visual Information Seeking with Large Language Models
Viaarxiv icon

Retrieval-Enhanced Contrastive Vision-Text Models

Add code
Bookmark button
Alert button
Jun 12, 2023
Ahmet Iscen, Mathilde Caron, Alireza Fathi, Cordelia Schmid

Figure 1 for Retrieval-Enhanced Contrastive Vision-Text Models
Figure 2 for Retrieval-Enhanced Contrastive Vision-Text Models
Figure 3 for Retrieval-Enhanced Contrastive Vision-Text Models
Figure 4 for Retrieval-Enhanced Contrastive Vision-Text Models
Viaarxiv icon

Improving Image Recognition by Retrieving from Web-Scale Image-Text Data

Add code
Bookmark button
Alert button
Apr 11, 2023
Ahmet Iscen, Alireza Fathi, Cordelia Schmid

Figure 1 for Improving Image Recognition by Retrieving from Web-Scale Image-Text Data
Figure 2 for Improving Image Recognition by Retrieving from Web-Scale Image-Text Data
Figure 3 for Improving Image Recognition by Retrieving from Web-Scale Image-Text Data
Figure 4 for Improving Image Recognition by Retrieving from Web-Scale Image-Text Data
Viaarxiv icon

Learning Object-Centric Neural Scattering Functions for Free-viewpoint Relighting and Scene Composition

Add code
Bookmark button
Alert button
Mar 10, 2023
Hong-Xing Yu, Michelle Guo, Alireza Fathi, Yen-Yu Chang, Eric Ryan Chan, Ruohan Gao, Thomas Funkhouser, Jiajun Wu

Figure 1 for Learning Object-Centric Neural Scattering Functions for Free-viewpoint Relighting and Scene Composition
Figure 2 for Learning Object-Centric Neural Scattering Functions for Free-viewpoint Relighting and Scene Composition
Figure 3 for Learning Object-Centric Neural Scattering Functions for Free-viewpoint Relighting and Scene Composition
Figure 4 for Learning Object-Centric Neural Scattering Functions for Free-viewpoint Relighting and Scene Composition
Viaarxiv icon

REVEAL: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge Memory

Add code
Bookmark button
Alert button
Dec 10, 2022
Ziniu Hu, Ahmet Iscen, Chen Sun, Zirui Wang, Kai-Wei Chang, Yizhou Sun, Cordelia Schmid, David A. Ross, Alireza Fathi

Figure 1 for REVEAL: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge Memory
Figure 2 for REVEAL: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge Memory
Figure 3 for REVEAL: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge Memory
Figure 4 for REVEAL: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge Memory
Viaarxiv icon

A Memory Transformer Network for Incremental Learning

Add code
Bookmark button
Alert button
Oct 10, 2022
Ahmet Iscen, Thomas Bird, Mathilde Caron, Alireza Fathi, Cordelia Schmid

Figure 1 for A Memory Transformer Network for Incremental Learning
Figure 2 for A Memory Transformer Network for Incremental Learning
Figure 3 for A Memory Transformer Network for Incremental Learning
Figure 4 for A Memory Transformer Network for Incremental Learning
Viaarxiv icon

im2nerf: Image to Neural Radiance Field in the Wild

Add code
Bookmark button
Alert button
Sep 08, 2022
Lu Mi, Abhijit Kundu, David Ross, Frank Dellaert, Noah Snavely, Alireza Fathi

Figure 1 for im2nerf: Image to Neural Radiance Field in the Wild
Figure 2 for im2nerf: Image to Neural Radiance Field in the Wild
Figure 3 for im2nerf: Image to Neural Radiance Field in the Wild
Figure 4 for im2nerf: Image to Neural Radiance Field in the Wild
Viaarxiv icon

Panoptic Neural Fields: A Semantic Object-Aware Neural Scene Representation

Add code
Bookmark button
Alert button
May 09, 2022
Abhijit Kundu, Kyle Genova, Xiaoqi Yin, Alireza Fathi, Caroline Pantofaru, Leonidas Guibas, Andrea Tagliasacchi, Frank Dellaert, Thomas Funkhouser

Figure 1 for Panoptic Neural Fields: A Semantic Object-Aware Neural Scene Representation
Figure 2 for Panoptic Neural Fields: A Semantic Object-Aware Neural Scene Representation
Figure 3 for Panoptic Neural Fields: A Semantic Object-Aware Neural Scene Representation
Figure 4 for Panoptic Neural Fields: A Semantic Object-Aware Neural Scene Representation
Viaarxiv icon