Alert button

"Image": models, code, and papers
Alert button

Is GPT-3 all you need for Visual Question Answering in Cultural Heritage?

Jul 25, 2022
Pietro Bongini, Federico Becattini, Alberto Del Bimbo

Figure 1 for Is GPT-3 all you need for Visual Question Answering in Cultural Heritage?
Figure 2 for Is GPT-3 all you need for Visual Question Answering in Cultural Heritage?
Figure 3 for Is GPT-3 all you need for Visual Question Answering in Cultural Heritage?
Figure 4 for Is GPT-3 all you need for Visual Question Answering in Cultural Heritage?
Viaarxiv icon

Multi-Contrast MRI Segmentation Trained on Synthetic Images

Jul 06, 2022
Ismail Irmakci, Zeki Emre Unel, Nazli Ikizler-Cinbis, Ulas Bagci

Figure 1 for Multi-Contrast MRI Segmentation Trained on Synthetic Images
Figure 2 for Multi-Contrast MRI Segmentation Trained on Synthetic Images
Figure 3 for Multi-Contrast MRI Segmentation Trained on Synthetic Images
Figure 4 for Multi-Contrast MRI Segmentation Trained on Synthetic Images
Viaarxiv icon

ZeroMesh: Zero-shot Single-view 3D Mesh Reconstruction

Add code
Bookmark button
Alert button
Aug 04, 2022
Xianghui Yang, Guosheng Lin, Luping Zhou

Figure 1 for ZeroMesh: Zero-shot Single-view 3D Mesh Reconstruction
Figure 2 for ZeroMesh: Zero-shot Single-view 3D Mesh Reconstruction
Figure 3 for ZeroMesh: Zero-shot Single-view 3D Mesh Reconstruction
Figure 4 for ZeroMesh: Zero-shot Single-view 3D Mesh Reconstruction
Viaarxiv icon

Adversarial Style Augmentation for Domain Generalized Urban-Scene Segmentation

Jul 11, 2022
Zhun Zhong, Yuyang Zhao, Gim Hee Lee, Nicu Sebe

Figure 1 for Adversarial Style Augmentation for Domain Generalized Urban-Scene Segmentation
Figure 2 for Adversarial Style Augmentation for Domain Generalized Urban-Scene Segmentation
Figure 3 for Adversarial Style Augmentation for Domain Generalized Urban-Scene Segmentation
Figure 4 for Adversarial Style Augmentation for Domain Generalized Urban-Scene Segmentation
Viaarxiv icon

Synthesizing Photorealistic Virtual Humans Through Cross-modal Disentanglement

Sep 03, 2022
Siddarth Ravichandran, Ondřej Texler, Dimitar Dinev, Hyun Jae Kang

Figure 1 for Synthesizing Photorealistic Virtual Humans Through Cross-modal Disentanglement
Figure 2 for Synthesizing Photorealistic Virtual Humans Through Cross-modal Disentanglement
Figure 3 for Synthesizing Photorealistic Virtual Humans Through Cross-modal Disentanglement
Figure 4 for Synthesizing Photorealistic Virtual Humans Through Cross-modal Disentanglement
Viaarxiv icon

Deep Neural Network Approximation of Invariant Functions through Dynamical Systems

Aug 18, 2022
Qianxiao Li, Ting Lin, Zuowei Shen

Figure 1 for Deep Neural Network Approximation of Invariant Functions through Dynamical Systems
Viaarxiv icon

Semi-Supervised Medical Image Segmentation via Cross Teaching between CNN and Transformer

Add code
Bookmark button
Alert button
Dec 09, 2021
Xiangde Luo, Minhao Hu, Tao Song, Guotai Wang, Shaoting Zhang

Figure 1 for Semi-Supervised Medical Image Segmentation via Cross Teaching between CNN and Transformer
Figure 2 for Semi-Supervised Medical Image Segmentation via Cross Teaching between CNN and Transformer
Figure 3 for Semi-Supervised Medical Image Segmentation via Cross Teaching between CNN and Transformer
Figure 4 for Semi-Supervised Medical Image Segmentation via Cross Teaching between CNN and Transformer
Viaarxiv icon

Towards Intelligent Millimeter and Terahertz Communication for 6G: Computer Vision-aided Beamforming

Sep 06, 2022
Yongjun Ahn, Jinhong Kim, Seungnyun Kim, Kyuhong Shim, Jiyoung Kim, Sangtae Kim, Byonghyo Shim

Figure 1 for Towards Intelligent Millimeter and Terahertz Communication for 6G: Computer Vision-aided Beamforming
Figure 2 for Towards Intelligent Millimeter and Terahertz Communication for 6G: Computer Vision-aided Beamforming
Figure 3 for Towards Intelligent Millimeter and Terahertz Communication for 6G: Computer Vision-aided Beamforming
Figure 4 for Towards Intelligent Millimeter and Terahertz Communication for 6G: Computer Vision-aided Beamforming
Viaarxiv icon

Effectiveness of Function Matching in Driving Scene Recognition

Aug 20, 2022
Shingo Yashima

Figure 1 for Effectiveness of Function Matching in Driving Scene Recognition
Figure 2 for Effectiveness of Function Matching in Driving Scene Recognition
Figure 3 for Effectiveness of Function Matching in Driving Scene Recognition
Figure 4 for Effectiveness of Function Matching in Driving Scene Recognition
Viaarxiv icon

An End-to-End OCR Framework for Robust Arabic-Handwriting Recognition using a Novel Transformers-based Model and an Innovative 270 Million-Words Multi-Font Corpus of Classical Arabic with Diacritics

Aug 26, 2022
Aly Mostafa, Omar Mohamed, Ali Ashraf, Ahmed Elbehery, Salma Jamal, Anas Salah, Amr S. Ghoneim

Figure 1 for An End-to-End OCR Framework for Robust Arabic-Handwriting Recognition using a Novel Transformers-based Model and an Innovative 270 Million-Words Multi-Font Corpus of Classical Arabic with Diacritics
Figure 2 for An End-to-End OCR Framework for Robust Arabic-Handwriting Recognition using a Novel Transformers-based Model and an Innovative 270 Million-Words Multi-Font Corpus of Classical Arabic with Diacritics
Figure 3 for An End-to-End OCR Framework for Robust Arabic-Handwriting Recognition using a Novel Transformers-based Model and an Innovative 270 Million-Words Multi-Font Corpus of Classical Arabic with Diacritics
Figure 4 for An End-to-End OCR Framework for Robust Arabic-Handwriting Recognition using a Novel Transformers-based Model and an Innovative 270 Million-Words Multi-Font Corpus of Classical Arabic with Diacritics
Viaarxiv icon