Alert button

"Image": models, code, and papers
Alert button

CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation

Mar 21, 2023
Seokju Cho, Heeseong Shin, Sunghwan Hong, Seungjun An, Seungjun Lee, Anurag Arnab, Paul Hongsuck Seo, Seungryong Kim

Figure 1 for CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation
Figure 2 for CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation
Figure 3 for CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation
Figure 4 for CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation
Viaarxiv icon

CHIMLE: Conditional Hierarchical IMLE for Multimodal Conditional Image Synthesis

Nov 25, 2022
Shichong Peng, Alireza Moazeni, Ke Li

Figure 1 for CHIMLE: Conditional Hierarchical IMLE for Multimodal Conditional Image Synthesis
Figure 2 for CHIMLE: Conditional Hierarchical IMLE for Multimodal Conditional Image Synthesis
Figure 3 for CHIMLE: Conditional Hierarchical IMLE for Multimodal Conditional Image Synthesis
Figure 4 for CHIMLE: Conditional Hierarchical IMLE for Multimodal Conditional Image Synthesis
Viaarxiv icon

Audio-Driven Talking Face Generation with Diverse yet Realistic Facial Animations

Apr 18, 2023
Rongliang Wu, Yingchen Yu, Fangneng Zhan, Jiahui Zhang, Xiaoqin Zhang, Shijian Lu

Figure 1 for Audio-Driven Talking Face Generation with Diverse yet Realistic Facial Animations
Figure 2 for Audio-Driven Talking Face Generation with Diverse yet Realistic Facial Animations
Figure 3 for Audio-Driven Talking Face Generation with Diverse yet Realistic Facial Animations
Figure 4 for Audio-Driven Talking Face Generation with Diverse yet Realistic Facial Animations
Viaarxiv icon

Patch-aware Batch Normalization for Improving Cross-domain Robustness

Apr 06, 2023
Lei Qi, Dongjia Zhao, Yinghuan Shi, Xin Geng

Figure 1 for Patch-aware Batch Normalization for Improving Cross-domain Robustness
Figure 2 for Patch-aware Batch Normalization for Improving Cross-domain Robustness
Figure 3 for Patch-aware Batch Normalization for Improving Cross-domain Robustness
Figure 4 for Patch-aware Batch Normalization for Improving Cross-domain Robustness
Viaarxiv icon

Making Vision Transformers Efficient from A Token Sparsification View

Mar 30, 2023
Shuning Chang, Pichao Wang, Ming Lin, Fan Wang, David Junhao Zhang, Rong Jin, Mike Zheng Shou

Figure 1 for Making Vision Transformers Efficient from A Token Sparsification View
Figure 2 for Making Vision Transformers Efficient from A Token Sparsification View
Figure 3 for Making Vision Transformers Efficient from A Token Sparsification View
Figure 4 for Making Vision Transformers Efficient from A Token Sparsification View
Viaarxiv icon

WSSL: Weighted Self-supervised Learning Framework For Image-inpainting

Nov 25, 2022
Shubham Gupta, Rahul Kunigal Ravishankar, Madhoolika Gangaraju, Poojasree Dwarkanath, Natarajan Subramanyam

Figure 1 for WSSL: Weighted Self-supervised Learning Framework For Image-inpainting
Figure 2 for WSSL: Weighted Self-supervised Learning Framework For Image-inpainting
Figure 3 for WSSL: Weighted Self-supervised Learning Framework For Image-inpainting
Figure 4 for WSSL: Weighted Self-supervised Learning Framework For Image-inpainting
Viaarxiv icon

Equivariant Similarity for Vision-Language Foundation Models

Mar 25, 2023
Tan Wang, Kevin Lin, Linjie Li, Chung-Ching Lin, Zhengyuan Yang, Hanwang Zhang, Zicheng Liu, Lijuan Wang

Figure 1 for Equivariant Similarity for Vision-Language Foundation Models
Figure 2 for Equivariant Similarity for Vision-Language Foundation Models
Figure 3 for Equivariant Similarity for Vision-Language Foundation Models
Figure 4 for Equivariant Similarity for Vision-Language Foundation Models
Viaarxiv icon

Data-efficient Large Scale Place Recognition with Graded Similarity Supervision

Mar 25, 2023
Maria Leyva-Vallina, Nicola Strisciuglio, Nicolai Petkov

Figure 1 for Data-efficient Large Scale Place Recognition with Graded Similarity Supervision
Figure 2 for Data-efficient Large Scale Place Recognition with Graded Similarity Supervision
Figure 3 for Data-efficient Large Scale Place Recognition with Graded Similarity Supervision
Figure 4 for Data-efficient Large Scale Place Recognition with Graded Similarity Supervision
Viaarxiv icon

One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale

Mar 12, 2023
Fan Bao, Shen Nie, Kaiwen Xue, Chongxuan Li, Shi Pu, Yaole Wang, Gang Yue, Yue Cao, Hang Su, Jun Zhu

Figure 1 for One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale
Figure 2 for One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale
Figure 3 for One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale
Figure 4 for One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale
Viaarxiv icon

Weakly-Supervised 3D Medical Image Segmentation using Geometric Prior and Contrastive Similarity

Feb 04, 2023
Hao Du, Qihua Dong, Yan Xu, Jing Liao

Figure 1 for Weakly-Supervised 3D Medical Image Segmentation using Geometric Prior and Contrastive Similarity
Figure 2 for Weakly-Supervised 3D Medical Image Segmentation using Geometric Prior and Contrastive Similarity
Figure 3 for Weakly-Supervised 3D Medical Image Segmentation using Geometric Prior and Contrastive Similarity
Figure 4 for Weakly-Supervised 3D Medical Image Segmentation using Geometric Prior and Contrastive Similarity
Viaarxiv icon