Alert button
Picture for Vittorio Ferrari

Vittorio Ferrari

Alert button

HAMMR: HierArchical MultiModal React agents for generic VQA

Add code
Bookmark button
Alert button
Apr 08, 2024
Lluis Castrejon, Thomas Mensink, Howard Zhou, Vittorio Ferrari, Andre Araujo, Jasper Uijlings

Viaarxiv icon

Grounding Everything: Emerging Localization Properties in Vision-Language Transformers

Add code
Bookmark button
Alert button
Dec 05, 2023
Walid Bousselham, Felix Petersen, Vittorio Ferrari, Hilde Kuehne

Viaarxiv icon

StoryBench: A Multifaceted Benchmark for Continuous Story Visualization

Add code
Bookmark button
Alert button
Aug 22, 2023
Emanuele Bugliarello, Hernan Moraldo, Ruben Villegas, Mohammad Babaeizadeh, Mohammad Taghi Saffar, Han Zhang, Dumitru Erhan, Vittorio Ferrari, Pieter-Jan Kindermans, Paul Voigtlaender

Figure 1 for StoryBench: A Multifaceted Benchmark for Continuous Story Visualization
Figure 2 for StoryBench: A Multifaceted Benchmark for Continuous Story Visualization
Figure 3 for StoryBench: A Multifaceted Benchmark for Continuous Story Visualization
Figure 4 for StoryBench: A Multifaceted Benchmark for Continuous Story Visualization
Viaarxiv icon

Encyclopedic VQA: Visual questions about detailed properties of fine-grained categories

Add code
Bookmark button
Alert button
Jun 15, 2023
Thomas Mensink, Jasper Uijlings, Lluis Castrejon, Arushi Goel, Felipe Cadar, Howard Zhou, Fei Sha, André Araujo, Vittorio Ferrari

Figure 1 for Encyclopedic VQA: Visual questions about detailed properties of fine-grained categories
Figure 2 for Encyclopedic VQA: Visual questions about detailed properties of fine-grained categories
Figure 3 for Encyclopedic VQA: Visual questions about detailed properties of fine-grained categories
Figure 4 for Encyclopedic VQA: Visual questions about detailed properties of fine-grained categories
Viaarxiv icon

NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations

Add code
Bookmark button
Alert button
Jun 15, 2023
Varun Jampani, Kevis-Kokitsi Maninis, Andreas Engelhardt, Arjun Karpur, Karen Truong, Kyle Sargent, Stefan Popov, André Araujo, Ricardo Martin-Brualla, Kaushal Patel, Daniel Vlasic, Vittorio Ferrari, Ameesh Makadia, Ce Liu, Yuanzhen Li, Howard Zhou

Figure 1 for NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations
Figure 2 for NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations
Figure 3 for NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations
Figure 4 for NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations
Viaarxiv icon

Estimating Generic 3D Room Structures from 2D Annotations

Add code
Bookmark button
Alert button
Jun 15, 2023
Denys Rozumnyi, Stefan Popov, Kevis-Kokitsi Maninis, Matthias Nießner, Vittorio Ferrari

Figure 1 for Estimating Generic 3D Room Structures from 2D Annotations
Figure 2 for Estimating Generic 3D Room Structures from 2D Annotations
Figure 3 for Estimating Generic 3D Room Structures from 2D Annotations
Figure 4 for Estimating Generic 3D Room Structures from 2D Annotations
Viaarxiv icon

CAD-Estate: Large-scale CAD Model Annotation in RGB Videos

Add code
Bookmark button
Alert button
Jun 15, 2023
Kevis-Kokitsi Maninis, Stefan Popov, Matthias Nießner, Vittorio Ferrari

Figure 1 for CAD-Estate: Large-scale CAD Model Annotation in RGB Videos
Figure 2 for CAD-Estate: Large-scale CAD Model Annotation in RGB Videos
Figure 3 for CAD-Estate: Large-scale CAD Model Annotation in RGB Videos
Figure 4 for CAD-Estate: Large-scale CAD Model Annotation in RGB Videos
Viaarxiv icon

Tracking by 3D Model Estimation of Unknown Objects in Videos

Add code
Bookmark button
Alert button
Apr 13, 2023
Denys Rozumnyi, Jiri Matas, Marc Pollefeys, Vittorio Ferrari, Martin R. Oswald

Figure 1 for Tracking by 3D Model Estimation of Unknown Objects in Videos
Figure 2 for Tracking by 3D Model Estimation of Unknown Objects in Videos
Figure 3 for Tracking by 3D Model Estimation of Unknown Objects in Videos
Figure 4 for Tracking by 3D Model Estimation of Unknown Objects in Videos
Viaarxiv icon

Connecting Vision and Language with Video Localized Narratives

Add code
Bookmark button
Alert button
Mar 15, 2023
Paul Voigtlaender, Soravit Changpinyo, Jordi Pont-Tuset, Radu Soricut, Vittorio Ferrari

Figure 1 for Connecting Vision and Language with Video Localized Narratives
Figure 2 for Connecting Vision and Language with Video Localized Narratives
Figure 3 for Connecting Vision and Language with Video Localized Narratives
Figure 4 for Connecting Vision and Language with Video Localized Narratives
Viaarxiv icon

Agile Modeling: Image Classification with Domain Experts in the Loop

Add code
Bookmark button
Alert button
Feb 25, 2023
Otilia Stretcu, Edward Vendrow, Kenji Hata, Krishnamurthy Viswanathan, Vittorio Ferrari, Sasan Tavakkol, Wenlei Zhou, Aditya Avinash, Enming Luo, Neil Gordon Alldrin, MohammadHossein Bateni, Gabriel Berger, Andrew Bunner, Chun-Ta Lu, Javier A Rey, Ariel Fuxman

Figure 1 for Agile Modeling: Image Classification with Domain Experts in the Loop
Figure 2 for Agile Modeling: Image Classification with Domain Experts in the Loop
Figure 3 for Agile Modeling: Image Classification with Domain Experts in the Loop
Figure 4 for Agile Modeling: Image Classification with Domain Experts in the Loop
Viaarxiv icon