Alert button

"Image": models, code, and papers
Alert button

Scene-based Factored Attention for Image Captioning

Add code
Bookmark button
Alert button
Aug 18, 2019
Chen Shen, Rongrong Ji, Fuhai Chen, Xiaoshuai Sun, Xiangming Li

Figure 1 for Scene-based Factored Attention for Image Captioning
Figure 2 for Scene-based Factored Attention for Image Captioning
Figure 3 for Scene-based Factored Attention for Image Captioning
Figure 4 for Scene-based Factored Attention for Image Captioning
Viaarxiv icon

Generative Image Inpainting with Submanifold Alignment

Aug 01, 2019
Ang Li, Jianzhong Qi, Rui Zhang, Xingjun Ma, Kotagiri Ramamohanarao

Figure 1 for Generative Image Inpainting with Submanifold Alignment
Figure 2 for Generative Image Inpainting with Submanifold Alignment
Figure 3 for Generative Image Inpainting with Submanifold Alignment
Figure 4 for Generative Image Inpainting with Submanifold Alignment
Viaarxiv icon

A nonlocal feature-driven exemplar-based approach for image inpainting

Add code
Bookmark button
Alert button
Sep 20, 2019
Viktor Reshniak, Jeremy Trageser, Clayton G. Webster

Figure 1 for A nonlocal feature-driven exemplar-based approach for image inpainting
Figure 2 for A nonlocal feature-driven exemplar-based approach for image inpainting
Figure 3 for A nonlocal feature-driven exemplar-based approach for image inpainting
Figure 4 for A nonlocal feature-driven exemplar-based approach for image inpainting
Viaarxiv icon

Image-Dependent Local Entropy Models for Learned Image Compression

May 31, 2018
David Minnen, George Toderici, Saurabh Singh, Sung Jin Hwang, Michele Covell

Figure 1 for Image-Dependent Local Entropy Models for Learned Image Compression
Figure 2 for Image-Dependent Local Entropy Models for Learned Image Compression
Figure 3 for Image-Dependent Local Entropy Models for Learned Image Compression
Figure 4 for Image-Dependent Local Entropy Models for Learned Image Compression
Viaarxiv icon

E2E-VLP: End-to-End Vision-Language Pre-training Enhanced by Visual Learning

Jun 04, 2021
Haiyang Xu, Ming Yan, Chenliang Li, Bin Bi, Songfang Huang, Wenming Xiao, Fei Huang

Figure 1 for E2E-VLP: End-to-End Vision-Language Pre-training Enhanced by Visual Learning
Figure 2 for E2E-VLP: End-to-End Vision-Language Pre-training Enhanced by Visual Learning
Figure 3 for E2E-VLP: End-to-End Vision-Language Pre-training Enhanced by Visual Learning
Figure 4 for E2E-VLP: End-to-End Vision-Language Pre-training Enhanced by Visual Learning
Viaarxiv icon

Arbitrary Virtual Try-On Network: Characteristics Preservation and Trade-off between Body and Clothing

Nov 24, 2021
Yu Liu, Mingbo Zhao, Zhao Zhang, Haijun Zhang, Shuicheng Yan

Figure 1 for Arbitrary Virtual Try-On Network: Characteristics Preservation and Trade-off between Body and Clothing
Figure 2 for Arbitrary Virtual Try-On Network: Characteristics Preservation and Trade-off between Body and Clothing
Figure 3 for Arbitrary Virtual Try-On Network: Characteristics Preservation and Trade-off between Body and Clothing
Figure 4 for Arbitrary Virtual Try-On Network: Characteristics Preservation and Trade-off between Body and Clothing
Viaarxiv icon

Fiducial marker recovery and detection from severely truncated data in navigation assisted spine surgery

Sep 01, 2021
Fuxin Fan, Björn Kreher, Holger Keil, Andreas Maier, Yixing Huang

Figure 1 for Fiducial marker recovery and detection from severely truncated data in navigation assisted spine surgery
Figure 2 for Fiducial marker recovery and detection from severely truncated data in navigation assisted spine surgery
Figure 3 for Fiducial marker recovery and detection from severely truncated data in navigation assisted spine surgery
Figure 4 for Fiducial marker recovery and detection from severely truncated data in navigation assisted spine surgery
Viaarxiv icon

Light Field Synthesis by Training Deep Network in the Refocused Image Domain

Nov 07, 2019
Chang-Le Liu, Kuang-Tsu Shih, Homer H. Chen

Figure 1 for Light Field Synthesis by Training Deep Network in the Refocused Image Domain
Figure 2 for Light Field Synthesis by Training Deep Network in the Refocused Image Domain
Figure 3 for Light Field Synthesis by Training Deep Network in the Refocused Image Domain
Figure 4 for Light Field Synthesis by Training Deep Network in the Refocused Image Domain
Viaarxiv icon

TEAM-Net: Multi-modal Learning for Video Action Recognition with Partial Decoding

Add code
Bookmark button
Alert button
Oct 17, 2021
Zhengwei Wang, Qi She, Aljosa Smolic

Figure 1 for TEAM-Net: Multi-modal Learning for Video Action Recognition with Partial Decoding
Figure 2 for TEAM-Net: Multi-modal Learning for Video Action Recognition with Partial Decoding
Figure 3 for TEAM-Net: Multi-modal Learning for Video Action Recognition with Partial Decoding
Figure 4 for TEAM-Net: Multi-modal Learning for Video Action Recognition with Partial Decoding
Viaarxiv icon

An Image Based Visual Servo Approach with Deep Learning for Robotic Manipulation

Sep 17, 2019
Jingshu Liu, Yuan Li

Figure 1 for An Image Based Visual Servo Approach with Deep Learning for Robotic Manipulation
Figure 2 for An Image Based Visual Servo Approach with Deep Learning for Robotic Manipulation
Figure 3 for An Image Based Visual Servo Approach with Deep Learning for Robotic Manipulation
Figure 4 for An Image Based Visual Servo Approach with Deep Learning for Robotic Manipulation
Viaarxiv icon