Alert button

"Image": models, code, and papers
Alert button

BlazeBVD: Make Scale-Time Equalization Great Again for Blind Video Deflickering

Mar 10, 2024
Xinmin Qiu, Congying Han, Zicheng Zhang, Bonan Li, Tiande Guo, Pingyu Wang, Xuecheng Nie

Figure 1 for BlazeBVD: Make Scale-Time Equalization Great Again for Blind Video Deflickering
Figure 2 for BlazeBVD: Make Scale-Time Equalization Great Again for Blind Video Deflickering
Figure 3 for BlazeBVD: Make Scale-Time Equalization Great Again for Blind Video Deflickering
Figure 4 for BlazeBVD: Make Scale-Time Equalization Great Again for Blind Video Deflickering
Viaarxiv icon

Low-Frequency Black-Box Backdoor Attack via Evolutionary Algorithm

Mar 06, 2024
Yanqi Qiao, Dazhuang Liu, Rui Wang, Kaitai Liang

Viaarxiv icon

ACC-ViT : Atrous Convolution's Comeback in Vision Transformers

Mar 07, 2024
Nabil Ibtehaz, Ning Yan, Masood Mortazavi, Daisuke Kihara

Figure 1 for ACC-ViT : Atrous Convolution's Comeback in Vision Transformers
Figure 2 for ACC-ViT : Atrous Convolution's Comeback in Vision Transformers
Figure 3 for ACC-ViT : Atrous Convolution's Comeback in Vision Transformers
Figure 4 for ACC-ViT : Atrous Convolution's Comeback in Vision Transformers
Viaarxiv icon

Object-level Geometric Structure Preserving for Natural Image Stitching

Add code
Bookmark button
Alert button
Feb 20, 2024
Wenxiao Cai, Wankou Yang

Viaarxiv icon

D4C glove-train: solving the RPM and Bongard-logo problem by distributing and Circumscribing concepts

Mar 09, 2024
Ruizhuo Song, Beiming Yuan

Figure 1 for D4C glove-train: solving the RPM and Bongard-logo problem by distributing and Circumscribing concepts
Figure 2 for D4C glove-train: solving the RPM and Bongard-logo problem by distributing and Circumscribing concepts
Figure 3 for D4C glove-train: solving the RPM and Bongard-logo problem by distributing and Circumscribing concepts
Figure 4 for D4C glove-train: solving the RPM and Bongard-logo problem by distributing and Circumscribing concepts
Viaarxiv icon

Multi-Grained Cross-modal Alignment for Learning Open-vocabulary Semantic Segmentation from Text Supervision

Mar 06, 2024
Yajie Liu, Pu Ge, Qingjie Liu, Di Huang

Figure 1 for Multi-Grained Cross-modal Alignment for Learning Open-vocabulary Semantic Segmentation from Text Supervision
Figure 2 for Multi-Grained Cross-modal Alignment for Learning Open-vocabulary Semantic Segmentation from Text Supervision
Figure 3 for Multi-Grained Cross-modal Alignment for Learning Open-vocabulary Semantic Segmentation from Text Supervision
Figure 4 for Multi-Grained Cross-modal Alignment for Learning Open-vocabulary Semantic Segmentation from Text Supervision
Viaarxiv icon

Decoupled Contrastive Learning for Long-Tailed Recognition

Add code
Bookmark button
Alert button
Mar 10, 2024
Shiyu Xuan, Shiliang Zhang

Figure 1 for Decoupled Contrastive Learning for Long-Tailed Recognition
Figure 2 for Decoupled Contrastive Learning for Long-Tailed Recognition
Figure 3 for Decoupled Contrastive Learning for Long-Tailed Recognition
Figure 4 for Decoupled Contrastive Learning for Long-Tailed Recognition
Viaarxiv icon

On depth prediction for autonomous driving using self-supervised learning

Mar 10, 2024
Houssem Boulahbal

Figure 1 for On depth prediction for autonomous driving using self-supervised learning
Figure 2 for On depth prediction for autonomous driving using self-supervised learning
Figure 3 for On depth prediction for autonomous driving using self-supervised learning
Figure 4 for On depth prediction for autonomous driving using self-supervised learning
Viaarxiv icon

VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models

Add code
Bookmark button
Alert button
Mar 10, 2024
Wenhao Wang, Yi Yang

Figure 1 for VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models
Figure 2 for VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models
Figure 3 for VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models
Figure 4 for VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models
Viaarxiv icon

Fourier-basis Functions to Bridge Augmentation Gap: Rethinking Frequency Augmentation in Image Classification

Add code
Bookmark button
Alert button
Mar 05, 2024
Puru Vaish, Shunxin Wang, Nicola Strisciuglio

Figure 1 for Fourier-basis Functions to Bridge Augmentation Gap: Rethinking Frequency Augmentation in Image Classification
Figure 2 for Fourier-basis Functions to Bridge Augmentation Gap: Rethinking Frequency Augmentation in Image Classification
Figure 3 for Fourier-basis Functions to Bridge Augmentation Gap: Rethinking Frequency Augmentation in Image Classification
Figure 4 for Fourier-basis Functions to Bridge Augmentation Gap: Rethinking Frequency Augmentation in Image Classification
Viaarxiv icon