Alert button
Picture for Howard Zhou

Howard Zhou

Alert button

HAMMR: HierArchical MultiModal React agents for generic VQA

Add code
Bookmark button
Alert button
Apr 08, 2024
Lluis Castrejon, Thomas Mensink, Howard Zhou, Vittorio Ferrari, Andre Araujo, Jasper Uijlings

Viaarxiv icon

Modeling Collaborator: Enabling Subjective Vision Classification With Minimal Human Effort via LLM Tool-Use

Add code
Bookmark button
Alert button
Mar 05, 2024
Imad Eddine Toubal, Aditya Avinash, Neil Gordon Alldrin, Jan Dlabal, Wenlei Zhou, Enming Luo, Otilia Stretcu, Hao Xiong, Chun-Ta Lu, Howard Zhou, Ranjay Krishna, Ariel Fuxman, Tom Duerig

Figure 1 for Modeling Collaborator: Enabling Subjective Vision Classification With Minimal Human Effort via LLM Tool-Use
Figure 2 for Modeling Collaborator: Enabling Subjective Vision Classification With Minimal Human Effort via LLM Tool-Use
Figure 3 for Modeling Collaborator: Enabling Subjective Vision Classification With Minimal Human Effort via LLM Tool-Use
Figure 4 for Modeling Collaborator: Enabling Subjective Vision Classification With Minimal Human Effort via LLM Tool-Use
Viaarxiv icon

Encyclopedic VQA: Visual questions about detailed properties of fine-grained categories

Add code
Bookmark button
Alert button
Jun 15, 2023
Thomas Mensink, Jasper Uijlings, Lluis Castrejon, Arushi Goel, Felipe Cadar, Howard Zhou, Fei Sha, André Araujo, Vittorio Ferrari

Figure 1 for Encyclopedic VQA: Visual questions about detailed properties of fine-grained categories
Figure 2 for Encyclopedic VQA: Visual questions about detailed properties of fine-grained categories
Figure 3 for Encyclopedic VQA: Visual questions about detailed properties of fine-grained categories
Figure 4 for Encyclopedic VQA: Visual questions about detailed properties of fine-grained categories
Viaarxiv icon

NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations

Add code
Bookmark button
Alert button
Jun 15, 2023
Varun Jampani, Kevis-Kokitsi Maninis, Andreas Engelhardt, Arjun Karpur, Karen Truong, Kyle Sargent, Stefan Popov, André Araujo, Ricardo Martin-Brualla, Kaushal Patel, Daniel Vlasic, Vittorio Ferrari, Ameesh Makadia, Ce Liu, Yuanzhen Li, Howard Zhou

Figure 1 for NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations
Figure 2 for NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations
Figure 3 for NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations
Figure 4 for NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations
Viaarxiv icon

LFM-3D: Learnable Feature Matching Across Wide Baselines Using 3D Signals

Add code
Bookmark button
Alert button
Mar 22, 2023
Arjun Karpur, Guilherme Perrotta, Ricardo Martin-Brualla, Howard Zhou, Andre Araujo

Figure 1 for LFM-3D: Learnable Feature Matching Across Wide Baselines Using 3D Signals
Figure 2 for LFM-3D: Learnable Feature Matching Across Wide Baselines Using 3D Signals
Figure 3 for LFM-3D: Learnable Feature Matching Across Wide Baselines Using 3D Signals
Figure 4 for LFM-3D: Learnable Feature Matching Across Wide Baselines Using 3D Signals
Viaarxiv icon

IBRNet: Learning Multi-View Image-Based Rendering

Add code
Bookmark button
Alert button
Feb 25, 2021
Qianqian Wang, Zhicheng Wang, Kyle Genova, Pratul Srinivasan, Howard Zhou, Jonathan T. Barron, Ricardo Martin-Brualla, Noah Snavely, Thomas Funkhouser

Figure 1 for IBRNet: Learning Multi-View Image-Based Rendering
Figure 2 for IBRNet: Learning Multi-View Image-Based Rendering
Figure 3 for IBRNet: Learning Multi-View Image-Based Rendering
Figure 4 for IBRNet: Learning Multi-View Image-Based Rendering
Viaarxiv icon

Unifying Specialist Image Embedding into Universal Image Embedding

Add code
Bookmark button
Alert button
Mar 08, 2020
Yang Feng, Futang Peng, Xu Zhang, Wei Zhu, Shanfeng Zhang, Howard Zhou, Zhen Li, Tom Duerig, Shih-Fu Chang, Jiebo Luo

Figure 1 for Unifying Specialist Image Embedding into Universal Image Embedding
Figure 2 for Unifying Specialist Image Embedding into Universal Image Embedding
Figure 3 for Unifying Specialist Image Embedding into Universal Image Embedding
Figure 4 for Unifying Specialist Image Embedding into Universal Image Embedding
Viaarxiv icon

The Unreasonable Effectiveness of Noisy Data for Fine-Grained Recognition

Add code
Bookmark button
Alert button
Oct 18, 2016
Jonathan Krause, Benjamin Sapp, Andrew Howard, Howard Zhou, Alexander Toshev, Tom Duerig, James Philbin, Li Fei-Fei

Figure 1 for The Unreasonable Effectiveness of Noisy Data for Fine-Grained Recognition
Figure 2 for The Unreasonable Effectiveness of Noisy Data for Fine-Grained Recognition
Figure 3 for The Unreasonable Effectiveness of Noisy Data for Fine-Grained Recognition
Figure 4 for The Unreasonable Effectiveness of Noisy Data for Fine-Grained Recognition
Viaarxiv icon

Blockout: Dynamic Model Selection for Hierarchical Deep Networks

Add code
Bookmark button
Alert button
Dec 16, 2015
Calvin Murdock, Zhen Li, Howard Zhou, Tom Duerig

Figure 1 for Blockout: Dynamic Model Selection for Hierarchical Deep Networks
Figure 2 for Blockout: Dynamic Model Selection for Hierarchical Deep Networks
Figure 3 for Blockout: Dynamic Model Selection for Hierarchical Deep Networks
Figure 4 for Blockout: Dynamic Model Selection for Hierarchical Deep Networks
Viaarxiv icon