Alert button

"Image": models, code, and papers
Alert button

Retrieval-Enhanced Contrastive Vision-Text Models

Jun 12, 2023
Ahmet Iscen, Mathilde Caron, Alireza Fathi, Cordelia Schmid

Figure 1 for Retrieval-Enhanced Contrastive Vision-Text Models
Figure 2 for Retrieval-Enhanced Contrastive Vision-Text Models
Figure 3 for Retrieval-Enhanced Contrastive Vision-Text Models
Figure 4 for Retrieval-Enhanced Contrastive Vision-Text Models
Viaarxiv icon

Revisiting Token Pruning for Object Detection and Instance Segmentation

Jun 12, 2023
Yifei Liu, Mathias Gehrig, Nico Messikommer, Marco Cannici, Davide Scaramuzza

Figure 1 for Revisiting Token Pruning for Object Detection and Instance Segmentation
Figure 2 for Revisiting Token Pruning for Object Detection and Instance Segmentation
Figure 3 for Revisiting Token Pruning for Object Detection and Instance Segmentation
Figure 4 for Revisiting Token Pruning for Object Detection and Instance Segmentation
Viaarxiv icon

Temporal-controlled Frame Swap for Generating High-Fidelity Stereo Driving Data for Autonomy Analysis

Add code
Bookmark button
Alert button
Jun 12, 2023
Yedi Luo, Xiangyu Bai, Le Jiang, Aniket Gupta, Eric Mortin, Hanumant Singh Sarah Ostadabbas

Figure 1 for Temporal-controlled Frame Swap for Generating High-Fidelity Stereo Driving Data for Autonomy Analysis
Figure 2 for Temporal-controlled Frame Swap for Generating High-Fidelity Stereo Driving Data for Autonomy Analysis
Figure 3 for Temporal-controlled Frame Swap for Generating High-Fidelity Stereo Driving Data for Autonomy Analysis
Figure 4 for Temporal-controlled Frame Swap for Generating High-Fidelity Stereo Driving Data for Autonomy Analysis
Viaarxiv icon

LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day

Jun 01, 2023
Chunyuan Li, Cliff Wong, Sheng Zhang, Naoto Usuyama, Haotian Liu, Jianwei Yang, Tristan Naumann, Hoifung Poon, Jianfeng Gao

Figure 1 for LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
Figure 2 for LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
Figure 3 for LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
Figure 4 for LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
Viaarxiv icon

G-CAME: Gaussian-Class Activation Mapping Explainer for Object Detectors

Jun 06, 2023
Quoc Khanh Nguyen, Truong Thanh Hung Nguyen, Vo Thanh Khang Nguyen, Van Binh Truong, Quoc Hung Cao

Figure 1 for G-CAME: Gaussian-Class Activation Mapping Explainer for Object Detectors
Figure 2 for G-CAME: Gaussian-Class Activation Mapping Explainer for Object Detectors
Figure 3 for G-CAME: Gaussian-Class Activation Mapping Explainer for Object Detectors
Figure 4 for G-CAME: Gaussian-Class Activation Mapping Explainer for Object Detectors
Viaarxiv icon

Real-Time Onboard Object Detection for Augmented Reality: Enhancing Head-Mounted Display with YOLOv8

Add code
Bookmark button
Alert button
Jun 06, 2023
Mikołaj Łysakowski, Kamil Żywanowski, Adam Banaszczyk, Michał R. Nowicki, Piotr Skrzypczyński, Sławomir K. Tadeja

Figure 1 for Real-Time Onboard Object Detection for Augmented Reality: Enhancing Head-Mounted Display with YOLOv8
Figure 2 for Real-Time Onboard Object Detection for Augmented Reality: Enhancing Head-Mounted Display with YOLOv8
Figure 3 for Real-Time Onboard Object Detection for Augmented Reality: Enhancing Head-Mounted Display with YOLOv8
Figure 4 for Real-Time Onboard Object Detection for Augmented Reality: Enhancing Head-Mounted Display with YOLOv8
Viaarxiv icon

Interpretable Alzheimer's Disease Classification Via a Contrastive Diffusion Autoencoder

Jun 05, 2023
Ayodeji Ijishakin, Ahmed Abdulaal, Adamos Hadjivasiliou, Sophie Martin, James Cole

Figure 1 for Interpretable Alzheimer's Disease Classification Via a Contrastive Diffusion Autoencoder
Figure 2 for Interpretable Alzheimer's Disease Classification Via a Contrastive Diffusion Autoencoder
Figure 3 for Interpretable Alzheimer's Disease Classification Via a Contrastive Diffusion Autoencoder
Figure 4 for Interpretable Alzheimer's Disease Classification Via a Contrastive Diffusion Autoencoder
Viaarxiv icon

XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models

Add code
Bookmark button
Alert button
Jun 13, 2023
Omkar Thawkar, Abdelrahman Shaker, Sahal Shaji Mullappilly, Hisham Cholakkal, Rao Muhammad Anwer, Salman Khan, Jorma Laaksonen, Fahad Shahbaz Khan

Figure 1 for XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models
Figure 2 for XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models
Figure 3 for XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models
Figure 4 for XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models
Viaarxiv icon

Learning Unnormalized Statistical Models via Compositional Optimization

Add code
Bookmark button
Alert button
Jun 13, 2023
Wei Jiang, Jiayu Qin, Lingyu Wu, Changyou Chen, Tianbao Yang, Lijun Zhang

Figure 1 for Learning Unnormalized Statistical Models via Compositional Optimization
Figure 2 for Learning Unnormalized Statistical Models via Compositional Optimization
Figure 3 for Learning Unnormalized Statistical Models via Compositional Optimization
Figure 4 for Learning Unnormalized Statistical Models via Compositional Optimization
Viaarxiv icon

An Empirical Study on the Robustness of the Segment Anything Model (SAM)

May 10, 2023
Yuqing Wang, Yun Zhao, Linda Petzold

Figure 1 for An Empirical Study on the Robustness of the Segment Anything Model (SAM)
Figure 2 for An Empirical Study on the Robustness of the Segment Anything Model (SAM)
Figure 3 for An Empirical Study on the Robustness of the Segment Anything Model (SAM)
Figure 4 for An Empirical Study on the Robustness of the Segment Anything Model (SAM)
Viaarxiv icon