Alert button

"Image": models, code, and papers
Alert button

ACORT: A Compact Object Relation Transformer for Parameter Efficient Image Captioning

Feb 11, 2022
Jia Huei Tan, Ying Hua Tan, Chee Seng Chan, Joon Huang Chuah

Figure 1 for ACORT: A Compact Object Relation Transformer for Parameter Efficient Image Captioning
Figure 2 for ACORT: A Compact Object Relation Transformer for Parameter Efficient Image Captioning
Figure 3 for ACORT: A Compact Object Relation Transformer for Parameter Efficient Image Captioning
Figure 4 for ACORT: A Compact Object Relation Transformer for Parameter Efficient Image Captioning
Viaarxiv icon

Exploring Gradient-based Multi-directional Controls in GANs

Sep 01, 2022
Zikun Chen, Ruowei Jiang, Brendan Duke, Han Zhao, Parham Aarabi

Figure 1 for Exploring Gradient-based Multi-directional Controls in GANs
Figure 2 for Exploring Gradient-based Multi-directional Controls in GANs
Figure 3 for Exploring Gradient-based Multi-directional Controls in GANs
Figure 4 for Exploring Gradient-based Multi-directional Controls in GANs
Viaarxiv icon

Few-Shot Learning Meets Transformer: Unified Query-Support Transformers for Few-Shot Classification

Aug 26, 2022
Xixi Wang, Xiao Wang, Bo Jiang, Bin Luo

Figure 1 for Few-Shot Learning Meets Transformer: Unified Query-Support Transformers for Few-Shot Classification
Figure 2 for Few-Shot Learning Meets Transformer: Unified Query-Support Transformers for Few-Shot Classification
Figure 3 for Few-Shot Learning Meets Transformer: Unified Query-Support Transformers for Few-Shot Classification
Figure 4 for Few-Shot Learning Meets Transformer: Unified Query-Support Transformers for Few-Shot Classification
Viaarxiv icon

GeoFill: Reference-Based Image Inpainting of Scenes with Complex Geometry

Jan 20, 2022
Yunhan Zhao, Connelly Barnes, Yuqian Zhou, Eli Shechtman, Sohrab Amirghodsi, Charless Fowlkes

Figure 1 for GeoFill: Reference-Based Image Inpainting of Scenes with Complex Geometry
Figure 2 for GeoFill: Reference-Based Image Inpainting of Scenes with Complex Geometry
Figure 3 for GeoFill: Reference-Based Image Inpainting of Scenes with Complex Geometry
Figure 4 for GeoFill: Reference-Based Image Inpainting of Scenes with Complex Geometry
Viaarxiv icon

LEAVES: Learning Views for Time-Series Data in Contrastive Learning

Oct 13, 2022
Han Yu, Huiyuan Yang, Akane Sano

Figure 1 for LEAVES: Learning Views for Time-Series Data in Contrastive Learning
Figure 2 for LEAVES: Learning Views for Time-Series Data in Contrastive Learning
Figure 3 for LEAVES: Learning Views for Time-Series Data in Contrastive Learning
Figure 4 for LEAVES: Learning Views for Time-Series Data in Contrastive Learning
Viaarxiv icon

Learning with Style: Continual Semantic Segmentation Across Tasks and Domains

Oct 13, 2022
Marco Toldo, Umberto Michieli, Pietro Zanuttigh

Figure 1 for Learning with Style: Continual Semantic Segmentation Across Tasks and Domains
Figure 2 for Learning with Style: Continual Semantic Segmentation Across Tasks and Domains
Figure 3 for Learning with Style: Continual Semantic Segmentation Across Tasks and Domains
Figure 4 for Learning with Style: Continual Semantic Segmentation Across Tasks and Domains
Viaarxiv icon

Geometric Active Learning for Segmentation of Large 3D Volumes

Oct 13, 2022
Thomas Lang, Tomas Sauer

Figure 1 for Geometric Active Learning for Segmentation of Large 3D Volumes
Figure 2 for Geometric Active Learning for Segmentation of Large 3D Volumes
Figure 3 for Geometric Active Learning for Segmentation of Large 3D Volumes
Figure 4 for Geometric Active Learning for Segmentation of Large 3D Volumes
Viaarxiv icon

Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding

Sep 28, 2022
Fengyuan Shi, Ruopeng Gao, Weilin Huang, Limin Wang

Figure 1 for Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding
Figure 2 for Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding
Figure 3 for Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding
Figure 4 for Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding
Viaarxiv icon

LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval

Mar 10, 2022
Jie Lei, Xinlei Chen, Ning Zhang, Mengjiao Wang, Mohit Bansal, Tamara L. Berg, Licheng Yu

Figure 1 for LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval
Figure 2 for LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval
Figure 3 for LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval
Figure 4 for LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval
Viaarxiv icon

CES-KD: Curriculum-based Expert Selection for Guided Knowledge Distillation

Sep 15, 2022
Ibtihel Amara, Maryam Ziaeefard, Brett H. Meyer, Warren Gross, James J. Clark

Figure 1 for CES-KD: Curriculum-based Expert Selection for Guided Knowledge Distillation
Figure 2 for CES-KD: Curriculum-based Expert Selection for Guided Knowledge Distillation
Figure 3 for CES-KD: Curriculum-based Expert Selection for Guided Knowledge Distillation
Figure 4 for CES-KD: Curriculum-based Expert Selection for Guided Knowledge Distillation
Viaarxiv icon