Alert button

"Image": models, code, and papers
Alert button

Donut: Document Understanding Transformer without OCR

Add code
Bookmark button
Alert button
Nov 30, 2021
Geewook Kim, Teakgyu Hong, Moonbin Yim, Jinyoung Park, Jinyeong Yim, Wonseok Hwang, Sangdoo Yun, Dongyoon Han, Seunghyun Park

Figure 1 for Donut: Document Understanding Transformer without OCR
Figure 2 for Donut: Document Understanding Transformer without OCR
Figure 3 for Donut: Document Understanding Transformer without OCR
Figure 4 for Donut: Document Understanding Transformer without OCR
Viaarxiv icon

Single Image Optical Flow Estimation with an Event Camera

Apr 01, 2020
Liyuan Pan, Miaomiao Liu, Richard Hartley

Figure 1 for Single Image Optical Flow Estimation with an Event Camera
Figure 2 for Single Image Optical Flow Estimation with an Event Camera
Figure 3 for Single Image Optical Flow Estimation with an Event Camera
Figure 4 for Single Image Optical Flow Estimation with an Event Camera
Viaarxiv icon

DeepFLASH: An Efficient Network for Learning-based Medical Image Registration

Add code
Bookmark button
Alert button
Apr 05, 2020
Jian Wang, Miaomiao Zhang

Figure 1 for DeepFLASH: An Efficient Network for Learning-based Medical Image Registration
Figure 2 for DeepFLASH: An Efficient Network for Learning-based Medical Image Registration
Figure 3 for DeepFLASH: An Efficient Network for Learning-based Medical Image Registration
Figure 4 for DeepFLASH: An Efficient Network for Learning-based Medical Image Registration
Viaarxiv icon

Rain Removal and Illumination Enhancement Done in One Go

Aug 09, 2021
Yecong Wan, Yuanshuo Cheng, Mingwen Shao

Figure 1 for Rain Removal and Illumination Enhancement Done in One Go
Figure 2 for Rain Removal and Illumination Enhancement Done in One Go
Figure 3 for Rain Removal and Illumination Enhancement Done in One Go
Figure 4 for Rain Removal and Illumination Enhancement Done in One Go
Viaarxiv icon

FOCUS: Familiar Objects in Common and Uncommon Settings

Add code
Bookmark button
Alert button
Oct 07, 2021
Priyatham Kattakinda, Soheil Feizi

Figure 1 for FOCUS: Familiar Objects in Common and Uncommon Settings
Figure 2 for FOCUS: Familiar Objects in Common and Uncommon Settings
Figure 3 for FOCUS: Familiar Objects in Common and Uncommon Settings
Figure 4 for FOCUS: Familiar Objects in Common and Uncommon Settings
Viaarxiv icon

Aligning Visual Regions and Textual Concepts: Learning Fine-Grained Image Representations for Image Captioning

Add code
Bookmark button
Alert button
May 15, 2019
Fenglin Liu, Yuanxin Liu, Xuancheng Ren, Kai Lei, Xu Sun

Figure 1 for Aligning Visual Regions and Textual Concepts: Learning Fine-Grained Image Representations for Image Captioning
Figure 2 for Aligning Visual Regions and Textual Concepts: Learning Fine-Grained Image Representations for Image Captioning
Figure 3 for Aligning Visual Regions and Textual Concepts: Learning Fine-Grained Image Representations for Image Captioning
Figure 4 for Aligning Visual Regions and Textual Concepts: Learning Fine-Grained Image Representations for Image Captioning
Viaarxiv icon

Goal-driven text descriptions for images

Add code
Bookmark button
Alert button
Aug 28, 2021
Ruotian Luo

Figure 1 for Goal-driven text descriptions for images
Figure 2 for Goal-driven text descriptions for images
Figure 3 for Goal-driven text descriptions for images
Figure 4 for Goal-driven text descriptions for images
Viaarxiv icon

A Cross-Modal Image Fusion Theory Guided by Human Visual Characteristics

Add code
Bookmark button
Alert button
Dec 18, 2019
Aiqing Fang, Xinbo Zhao, Yanning Zhang

Figure 1 for A Cross-Modal Image Fusion Theory Guided by Human Visual Characteristics
Figure 2 for A Cross-Modal Image Fusion Theory Guided by Human Visual Characteristics
Figure 3 for A Cross-Modal Image Fusion Theory Guided by Human Visual Characteristics
Figure 4 for A Cross-Modal Image Fusion Theory Guided by Human Visual Characteristics
Viaarxiv icon

Audio Deepfake Perceptions in College Going Populations

Dec 06, 2021
Gabrielle Watson, Zahra Khanjani, Vandana P. Janeja

Figure 1 for Audio Deepfake Perceptions in College Going Populations
Figure 2 for Audio Deepfake Perceptions in College Going Populations
Figure 3 for Audio Deepfake Perceptions in College Going Populations
Figure 4 for Audio Deepfake Perceptions in College Going Populations
Viaarxiv icon

LiteDepthwiseNet: An Extreme Lightweight Network for Hyperspectral Image Classification

Oct 15, 2020
Benlei Cui, XueMei Dong, Qiaoqiao Zhan, Jiangtao Peng, Weiwei Sun

Figure 1 for LiteDepthwiseNet: An Extreme Lightweight Network for Hyperspectral Image Classification
Figure 2 for LiteDepthwiseNet: An Extreme Lightweight Network for Hyperspectral Image Classification
Figure 3 for LiteDepthwiseNet: An Extreme Lightweight Network for Hyperspectral Image Classification
Figure 4 for LiteDepthwiseNet: An Extreme Lightweight Network for Hyperspectral Image Classification
Viaarxiv icon