Alert button
Picture for Wojciech Galuba

Wojciech Galuba

Alert button

DINOv2: Learning Robust Visual Features without Supervision

Apr 14, 2023
Maxime Oquab, Timothée Darcet, Théo Moutakanni, Huy Vo, Marc Szafraniec, Vasil Khalidov, Pierre Fernandez, Daniel Haziza, Francisco Massa, Alaaeldin El-Nouby, Mahmoud Assran, Nicolas Ballas, Wojciech Galuba, Russell Howes, Po-Yao Huang, Shang-Wen Li, Ishan Misra, Michael Rabbat, Vasu Sharma, Gabriel Synnaeve, Hu Xu, Hervé Jegou, Julien Mairal, Patrick Labatut, Armand Joulin, Piotr Bojanowski

Figure 1 for DINOv2: Learning Robust Visual Features without Supervision
Figure 2 for DINOv2: Learning Robust Visual Features without Supervision
Figure 3 for DINOv2: Learning Robust Visual Features without Supervision
Figure 4 for DINOv2: Learning Robust Visual Features without Supervision
Viaarxiv icon

Masked Autoencoders that Listen

Jul 13, 2022
Po-Yao, Huang, Hu Xu, Juncheng Li, Alexei Baevski, Michael Auli, Wojciech Galuba, Florian Metze, Christoph Feichtenhofer

Figure 1 for Masked Autoencoders that Listen
Figure 2 for Masked Autoencoders that Listen
Figure 3 for Masked Autoencoders that Listen
Figure 4 for Masked Autoencoders that Listen
Viaarxiv icon

FLAVA: A Foundational Language And Vision Alignment Model

Dec 08, 2021
Amanpreet Singh, Ronghang Hu, Vedanuj Goswami, Guillaume Couairon, Wojciech Galuba, Marcus Rohrbach, Douwe Kiela

Figure 1 for FLAVA: A Foundational Language And Vision Alignment Model
Figure 2 for FLAVA: A Foundational Language And Vision Alignment Model
Figure 3 for FLAVA: A Foundational Language And Vision Alignment Model
Figure 4 for FLAVA: A Foundational Language And Vision Alignment Model
Viaarxiv icon

Habitat-Matterport 3D Dataset (HM3D): 1000 Large-scale 3D Environments for Embodied AI

Sep 16, 2021
Santhosh K. Ramakrishnan, Aaron Gokaslan, Erik Wijmans, Oleksandr Maksymets, Alex Clegg, John Turner, Eric Undersander, Wojciech Galuba, Andrew Westbury, Angel X. Chang, Manolis Savva, Yili Zhao, Dhruv Batra

Figure 1 for Habitat-Matterport 3D Dataset (HM3D): 1000 Large-scale 3D Environments for Embodied AI
Figure 2 for Habitat-Matterport 3D Dataset (HM3D): 1000 Large-scale 3D Environments for Embodied AI
Figure 3 for Habitat-Matterport 3D Dataset (HM3D): 1000 Large-scale 3D Environments for Embodied AI
Figure 4 for Habitat-Matterport 3D Dataset (HM3D): 1000 Large-scale 3D Environments for Embodied AI
Viaarxiv icon

Habitat 2.0: Training Home Assistants to Rearrange their Habitat

Jun 28, 2021
Andrew Szot, Alex Clegg, Eric Undersander, Erik Wijmans, Yili Zhao, John Turner, Noah Maestre, Mustafa Mukadam, Devendra Chaplot, Oleksandr Maksymets, Aaron Gokaslan, Vladimir Vondrus, Sameer Dharur, Franziska Meier, Wojciech Galuba, Angel Chang, Zsolt Kira, Vladlen Koltun, Jitendra Malik, Manolis Savva, Dhruv Batra

Figure 1 for Habitat 2.0: Training Home Assistants to Rearrange their Habitat
Figure 2 for Habitat 2.0: Training Home Assistants to Rearrange their Habitat
Figure 3 for Habitat 2.0: Training Home Assistants to Rearrange their Habitat
Figure 4 for Habitat 2.0: Training Home Assistants to Rearrange their Habitat
Viaarxiv icon

Human-Adversarial Visual Question Answering

Jun 04, 2021
Sasha Sheng, Amanpreet Singh, Vedanuj Goswami, Jose Alberto Lopez Magana, Wojciech Galuba, Devi Parikh, Douwe Kiela

Figure 1 for Human-Adversarial Visual Question Answering
Figure 2 for Human-Adversarial Visual Question Answering
Figure 3 for Human-Adversarial Visual Question Answering
Figure 4 for Human-Adversarial Visual Question Answering
Viaarxiv icon

TextOCR: Towards large-scale end-to-end reasoning for arbitrary-shaped scene text

May 12, 2021
Amanpreet Singh, Guan Pang, Mandy Toh, Jing Huang, Wojciech Galuba, Tal Hassner

Figure 1 for TextOCR: Towards large-scale end-to-end reasoning for arbitrary-shaped scene text
Figure 2 for TextOCR: Towards large-scale end-to-end reasoning for arbitrary-shaped scene text
Figure 3 for TextOCR: Towards large-scale end-to-end reasoning for arbitrary-shaped scene text
Figure 4 for TextOCR: Towards large-scale end-to-end reasoning for arbitrary-shaped scene text
Viaarxiv icon