Alert button

"Image": models, code, and papers
Alert button

Object2Scene: Putting Objects in Context for Open-Vocabulary 3D Detection

Sep 18, 2023
Chenming Zhu, Wenwei Zhang, Tai Wang, Xihui Liu, Kai Chen

Figure 1 for Object2Scene: Putting Objects in Context for Open-Vocabulary 3D Detection
Figure 2 for Object2Scene: Putting Objects in Context for Open-Vocabulary 3D Detection
Figure 3 for Object2Scene: Putting Objects in Context for Open-Vocabulary 3D Detection
Figure 4 for Object2Scene: Putting Objects in Context for Open-Vocabulary 3D Detection
Viaarxiv icon

Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages

Add code
Bookmark button
Alert button
Aug 23, 2023
Jinyi Hu, Yuan Yao, Chongyi Wang, Shan Wang, Yinxu Pan, Qianyu Chen, Tianyu Yu, Hanghao Wu, Yue Zhao, Haoye Zhang, Xu Han, Yankai Lin, Jiao Xue, Dahai Li, Zhiyuan Liu, Maosong Sun

Figure 1 for Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages
Figure 2 for Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages
Figure 3 for Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages
Figure 4 for Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages
Viaarxiv icon

Socratis: Are large multimodal models emotionally aware?

Aug 31, 2023
Katherine Deng, Arijit Ray, Reuben Tan, Saadia Gabriel, Bryan A. Plummer, Kate Saenko

Figure 1 for Socratis: Are large multimodal models emotionally aware?
Figure 2 for Socratis: Are large multimodal models emotionally aware?
Figure 3 for Socratis: Are large multimodal models emotionally aware?
Figure 4 for Socratis: Are large multimodal models emotionally aware?
Viaarxiv icon

M3D-NCA: Robust 3D Segmentation with Built-in Quality Control

Add code
Bookmark button
Alert button
Sep 06, 2023
John Kalkhof, Anirban Mukhopadhyay

Figure 1 for M3D-NCA: Robust 3D Segmentation with Built-in Quality Control
Figure 2 for M3D-NCA: Robust 3D Segmentation with Built-in Quality Control
Figure 3 for M3D-NCA: Robust 3D Segmentation with Built-in Quality Control
Figure 4 for M3D-NCA: Robust 3D Segmentation with Built-in Quality Control
Viaarxiv icon

Towards Addressing the Misalignment of Object Proposal Evaluation for Vision-Language Tasks via Semantic Grounding

Add code
Bookmark button
Alert button
Sep 01, 2023
Joshua Feinglass, Yezhou Yang

Figure 1 for Towards Addressing the Misalignment of Object Proposal Evaluation for Vision-Language Tasks via Semantic Grounding
Figure 2 for Towards Addressing the Misalignment of Object Proposal Evaluation for Vision-Language Tasks via Semantic Grounding
Figure 3 for Towards Addressing the Misalignment of Object Proposal Evaluation for Vision-Language Tasks via Semantic Grounding
Figure 4 for Towards Addressing the Misalignment of Object Proposal Evaluation for Vision-Language Tasks via Semantic Grounding
Viaarxiv icon

AnomalyGPT: Detecting Industrial Anomalies using Large Vision-Language Models

Add code
Bookmark button
Alert button
Sep 04, 2023
Zhaopeng Gu, Bingke Zhu, Guibo Zhu, Yingying Chen, Ming Tang, Jinqiao Wang

Figure 1 for AnomalyGPT: Detecting Industrial Anomalies using Large Vision-Language Models
Figure 2 for AnomalyGPT: Detecting Industrial Anomalies using Large Vision-Language Models
Figure 3 for AnomalyGPT: Detecting Industrial Anomalies using Large Vision-Language Models
Figure 4 for AnomalyGPT: Detecting Industrial Anomalies using Large Vision-Language Models
Viaarxiv icon

DeViL: Decoding Vision features into Language

Add code
Bookmark button
Alert button
Sep 04, 2023
Meghal Dani, Isabel Rio-Torto, Stephan Alaniz, Zeynep Akata

Viaarxiv icon

Detection and Localization of Firearm Carriers in Complex Scenes for Improved Safety Measures

Sep 17, 2023
Arif Mahmood, Abdul Basit, M. Akhtar Munir, Mohsen Ali

Figure 1 for Detection and Localization of Firearm Carriers in Complex Scenes for Improved Safety Measures
Figure 2 for Detection and Localization of Firearm Carriers in Complex Scenes for Improved Safety Measures
Figure 3 for Detection and Localization of Firearm Carriers in Complex Scenes for Improved Safety Measures
Figure 4 for Detection and Localization of Firearm Carriers in Complex Scenes for Improved Safety Measures
Viaarxiv icon

Active Learning for Semantic Segmentation with Multi-class Label Query

Sep 17, 2023
Sehyun Hwang, Sohyun Lee, Hoyoung Kim, Minhyeon Oh, Jungseul Ok, Suha Kwak

Viaarxiv icon

Shape of my heart: Cardiac models through learned signed distance functions

Sep 05, 2023
Jan Verhülsdonk, Thomas Grandits, Francisco Sahli Costabal, Rolf Krause, Angelo Auricchio, Gundolf Haase, Simone Pezzuto, Alexander Effland

Figure 1 for Shape of my heart: Cardiac models through learned signed distance functions
Figure 2 for Shape of my heart: Cardiac models through learned signed distance functions
Figure 3 for Shape of my heart: Cardiac models through learned signed distance functions
Figure 4 for Shape of my heart: Cardiac models through learned signed distance functions
Viaarxiv icon