Alert button

"Image": models, code, and papers
Alert button

Jointly Training Large Autoregressive Multimodal Models

Add code
Bookmark button
Alert button
Sep 28, 2023
Emanuele Aiello, Lili Yu, Yixin Nie, Armen Aghajanyan, Barlas Oguz

Figure 1 for Jointly Training Large Autoregressive Multimodal Models
Figure 2 for Jointly Training Large Autoregressive Multimodal Models
Figure 3 for Jointly Training Large Autoregressive Multimodal Models
Figure 4 for Jointly Training Large Autoregressive Multimodal Models
Viaarxiv icon

Evaluating Explanation Methods for Vision-and-Language Navigation

Add code
Bookmark button
Alert button
Oct 10, 2023
Guanqi Chen, Lei Yang, Guanhua Chen, Jia Pan

Viaarxiv icon

Blind Dates: Examining the Expression of Temporality in Historical Photographs

Add code
Bookmark button
Alert button
Oct 10, 2023
Alexandra Barancová, Melvin Wevers, Nanne van Noord

Viaarxiv icon

Hierarchical Mask2Former: Panoptic Segmentation of Crops, Weeds and Leaves

Add code
Bookmark button
Alert button
Oct 10, 2023
Madeleine Darbyshire, Elizabeth Sklar, Simon Parsons

Figure 1 for Hierarchical Mask2Former: Panoptic Segmentation of Crops, Weeds and Leaves
Figure 2 for Hierarchical Mask2Former: Panoptic Segmentation of Crops, Weeds and Leaves
Figure 3 for Hierarchical Mask2Former: Panoptic Segmentation of Crops, Weeds and Leaves
Figure 4 for Hierarchical Mask2Former: Panoptic Segmentation of Crops, Weeds and Leaves
Viaarxiv icon

Compression Ratio Learning and Semantic Communications for Video Imaging

Oct 10, 2023
Bowen Zhang, Zhijin Qin, Geoffrey Ye Li

Figure 1 for Compression Ratio Learning and Semantic Communications for Video Imaging
Figure 2 for Compression Ratio Learning and Semantic Communications for Video Imaging
Figure 3 for Compression Ratio Learning and Semantic Communications for Video Imaging
Figure 4 for Compression Ratio Learning and Semantic Communications for Video Imaging
Viaarxiv icon

Domain Expansion via Network Adaptation for Solving Inverse Problems

Oct 10, 2023
Nebiyou Yismaw, Ulugbek S. Kamilov, M. Salman Asif

Viaarxiv icon

Persis: A Persian Font Recognition Pipeline Using Convolutional Neural Networks

Oct 10, 2023
Mehrdad Mohammadian, Neda Maleki, Tobias Olsson, Fredrik Ahlgren

Figure 1 for Persis: A Persian Font Recognition Pipeline Using Convolutional Neural Networks
Figure 2 for Persis: A Persian Font Recognition Pipeline Using Convolutional Neural Networks
Figure 3 for Persis: A Persian Font Recognition Pipeline Using Convolutional Neural Networks
Figure 4 for Persis: A Persian Font Recognition Pipeline Using Convolutional Neural Networks
Viaarxiv icon

Geometry-Guided Ray Augmentation for Neural Surface Reconstruction with Sparse Views

Oct 09, 2023
Jiawei Yao, Chen Wang, Tong Wu, Chuming Li

Figure 1 for Geometry-Guided Ray Augmentation for Neural Surface Reconstruction with Sparse Views
Figure 2 for Geometry-Guided Ray Augmentation for Neural Surface Reconstruction with Sparse Views
Figure 3 for Geometry-Guided Ray Augmentation for Neural Surface Reconstruction with Sparse Views
Figure 4 for Geometry-Guided Ray Augmentation for Neural Surface Reconstruction with Sparse Views
Viaarxiv icon

Learning Layer-wise Equivariances Automatically using Gradients

Add code
Bookmark button
Alert button
Oct 09, 2023
Tycho F. A. van der Ouderaa, Alexander Immer, Mark van der Wilk

Figure 1 for Learning Layer-wise Equivariances Automatically using Gradients
Figure 2 for Learning Layer-wise Equivariances Automatically using Gradients
Figure 3 for Learning Layer-wise Equivariances Automatically using Gradients
Figure 4 for Learning Layer-wise Equivariances Automatically using Gradients
Viaarxiv icon

BATINet: Background-Aware Text to Image Synthesis and Manipulation Network

Aug 11, 2023
Ryugo Morita, Zhiqiang Zhang, Jinjia Zhou

Figure 1 for BATINet: Background-Aware Text to Image Synthesis and Manipulation Network
Figure 2 for BATINet: Background-Aware Text to Image Synthesis and Manipulation Network
Figure 3 for BATINet: Background-Aware Text to Image Synthesis and Manipulation Network
Figure 4 for BATINet: Background-Aware Text to Image Synthesis and Manipulation Network
Viaarxiv icon