Alert button

"Image": models, code, and papers
Alert button

A Deep Neural Framework for Image Caption Generation Using GRU-Based Attention Mechanism

Mar 03, 2022
Rashid Khan, M Shujah Islam, Khadija Kanwal, Mansoor Iqbal, Md. Imran Hossain, Zhongfu Ye

Figure 1 for A Deep Neural Framework for Image Caption Generation Using GRU-Based Attention Mechanism
Figure 2 for A Deep Neural Framework for Image Caption Generation Using GRU-Based Attention Mechanism
Figure 3 for A Deep Neural Framework for Image Caption Generation Using GRU-Based Attention Mechanism
Figure 4 for A Deep Neural Framework for Image Caption Generation Using GRU-Based Attention Mechanism
Viaarxiv icon

One Model, Multiple Modalities: A Sparsely Activated Approach for Text, Sound, Image, Video and Code

May 12, 2022
Yong Dai, Duyu Tang, Liangxin Liu, Minghuan Tan, Cong Zhou, Jingquan Wang, Zhangyin Feng, Fan Zhang, Xueyu Hu, Shuming Shi

Figure 1 for One Model, Multiple Modalities: A Sparsely Activated Approach for Text, Sound, Image, Video and Code
Figure 2 for One Model, Multiple Modalities: A Sparsely Activated Approach for Text, Sound, Image, Video and Code
Figure 3 for One Model, Multiple Modalities: A Sparsely Activated Approach for Text, Sound, Image, Video and Code
Figure 4 for One Model, Multiple Modalities: A Sparsely Activated Approach for Text, Sound, Image, Video and Code
Viaarxiv icon

Space-time design for deep joint source channel coding of images Over MIMO channels

Add code
Bookmark button
Alert button
Oct 30, 2022
Chenghong Bian, Yulin Shao, Haotian Wu, Deniz Gunduz

Figure 1 for Space-time design for deep joint source channel coding of images Over MIMO channels
Figure 2 for Space-time design for deep joint source channel coding of images Over MIMO channels
Figure 3 for Space-time design for deep joint source channel coding of images Over MIMO channels
Figure 4 for Space-time design for deep joint source channel coding of images Over MIMO channels
Viaarxiv icon

A Novel Approach for Neuromorphic Vision Data Compression based on Deep Belief Network

Oct 27, 2022
Sally Khaidem, Mansi Sharma, Abhipraay Nevatia

Figure 1 for A Novel Approach for Neuromorphic Vision Data Compression based on Deep Belief Network
Figure 2 for A Novel Approach for Neuromorphic Vision Data Compression based on Deep Belief Network
Figure 3 for A Novel Approach for Neuromorphic Vision Data Compression based on Deep Belief Network
Figure 4 for A Novel Approach for Neuromorphic Vision Data Compression based on Deep Belief Network
Viaarxiv icon

MSSNet: Multi-Scale-Stage Network for Single Image Deblurring

Add code
Bookmark button
Alert button
Feb 19, 2022
Kiyeon Kim, Seungyong Lee, Sunghyun Cho

Figure 1 for MSSNet: Multi-Scale-Stage Network for Single Image Deblurring
Figure 2 for MSSNet: Multi-Scale-Stage Network for Single Image Deblurring
Figure 3 for MSSNet: Multi-Scale-Stage Network for Single Image Deblurring
Figure 4 for MSSNet: Multi-Scale-Stage Network for Single Image Deblurring
Viaarxiv icon

MICDIR: Multi-scale Inverse-consistent Deformable Image Registration using UNetMSS with Self-Constructing Graph Latent

Add code
Bookmark button
Alert button
Mar 08, 2022
Soumick Chatterjee, Himanshi Bajaj, Istiyak H. Siddiquee, Nandish Bandi Subbarayappa, Steve Simon, Suraj Bangalore Shashidhar, Oliver Speck, Andreas Nürnberge

Figure 1 for MICDIR: Multi-scale Inverse-consistent Deformable Image Registration using UNetMSS with Self-Constructing Graph Latent
Figure 2 for MICDIR: Multi-scale Inverse-consistent Deformable Image Registration using UNetMSS with Self-Constructing Graph Latent
Figure 3 for MICDIR: Multi-scale Inverse-consistent Deformable Image Registration using UNetMSS with Self-Constructing Graph Latent
Figure 4 for MICDIR: Multi-scale Inverse-consistent Deformable Image Registration using UNetMSS with Self-Constructing Graph Latent
Viaarxiv icon

This is what a pandemic looks like: Visual framing of COVID-19 on search engines

Sep 22, 2022
Mykola Makhortykh, Aleksandra Urman, Roberto Ulloa

Figure 1 for This is what a pandemic looks like: Visual framing of COVID-19 on search engines
Figure 2 for This is what a pandemic looks like: Visual framing of COVID-19 on search engines
Viaarxiv icon

CurveFormer: 3D Lane Detection by Curve Propagation with Curve Queries and Attention

Sep 16, 2022
Yifeng Bai, Zhirong Chen, Zhangjie Fu, Lang Peng, Pengpeng Liang, Erkang Cheng

Figure 1 for CurveFormer: 3D Lane Detection by Curve Propagation with Curve Queries and Attention
Figure 2 for CurveFormer: 3D Lane Detection by Curve Propagation with Curve Queries and Attention
Figure 3 for CurveFormer: 3D Lane Detection by Curve Propagation with Curve Queries and Attention
Figure 4 for CurveFormer: 3D Lane Detection by Curve Propagation with Curve Queries and Attention
Viaarxiv icon

3DConvCaps: 3DUnet with Convolutional Capsule Encoder for Medical Image Segmentation

Add code
Bookmark button
Alert button
May 19, 2022
Minh Tran, Viet-Khoa Vo-Ho, Ngan T. H. Le

Figure 1 for 3DConvCaps: 3DUnet with Convolutional Capsule Encoder for Medical Image Segmentation
Figure 2 for 3DConvCaps: 3DUnet with Convolutional Capsule Encoder for Medical Image Segmentation
Figure 3 for 3DConvCaps: 3DUnet with Convolutional Capsule Encoder for Medical Image Segmentation
Figure 4 for 3DConvCaps: 3DUnet with Convolutional Capsule Encoder for Medical Image Segmentation
Viaarxiv icon

A Visual Tour Of Current Challenges In Multimodal Language Models

Add code
Bookmark button
Alert button
Oct 22, 2022
Shashank Sonkar, Naiming Liu, Richard G. Baraniuk

Figure 1 for A Visual Tour Of Current Challenges In Multimodal Language Models
Figure 2 for A Visual Tour Of Current Challenges In Multimodal Language Models
Figure 3 for A Visual Tour Of Current Challenges In Multimodal Language Models
Figure 4 for A Visual Tour Of Current Challenges In Multimodal Language Models
Viaarxiv icon