Alert button

"Image": models, code, and papers
Alert button

CycleCL: Self-supervised Learning for Periodic Videos

Nov 05, 2023
Matteo Destro, Michael Gygli

Viaarxiv icon

Rotation Invariant Transformer for Recognizing Object in UAVs

Add code
Bookmark button
Alert button
Nov 05, 2023
Shuoyi Chen, Mang Ye, Bo Du

Viaarxiv icon

Sequential Semantic Generative Communication for Progressive Text-to-Image Generation

Sep 08, 2023
Hyelin Nam, Jihong Park, Jinho Choi, Seong-Lyun Kim

Figure 1 for Sequential Semantic Generative Communication for Progressive Text-to-Image Generation
Figure 2 for Sequential Semantic Generative Communication for Progressive Text-to-Image Generation
Viaarxiv icon

Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-training and Multi-modal Tokens

Add code
Bookmark button
Alert button
Sep 15, 2023
Minsu Kim, Jeongsoo Choi, Soumi Maiti, Jeong Hun Yeo, Shinji Watanabe, Yong Man Ro

Figure 1 for Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-training and Multi-modal Tokens
Figure 2 for Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-training and Multi-modal Tokens
Figure 3 for Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-training and Multi-modal Tokens
Figure 4 for Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-training and Multi-modal Tokens
Viaarxiv icon

CHITNet: A Complementary to Harmonious Information Transfer Network for Infrared and Visible Image Fusion

Sep 26, 2023
Yafei Zhang, Keying Du, Huafeng Li, Zhengtao Yu, Yu Liu

Figure 1 for CHITNet: A Complementary to Harmonious Information Transfer Network for Infrared and Visible Image Fusion
Figure 2 for CHITNet: A Complementary to Harmonious Information Transfer Network for Infrared and Visible Image Fusion
Figure 3 for CHITNet: A Complementary to Harmonious Information Transfer Network for Infrared and Visible Image Fusion
Figure 4 for CHITNet: A Complementary to Harmonious Information Transfer Network for Infrared and Visible Image Fusion
Viaarxiv icon

Prototypical Contrastive Learning-based CLIP Fine-tuning for Object Re-identification

Oct 26, 2023
Jiachen Li, Xiaojin Gong

Viaarxiv icon

A Survey on Transferability of Adversarial Examples across Deep Neural Networks

Add code
Bookmark button
Alert button
Oct 26, 2023
Jindong Gu, Xiaojun Jia, Pau de Jorge, Wenqain Yu, Xinwei Liu, Avery Ma, Yuan Xun, Anjun Hu, Ashkan Khakzar, Zhijiang Li, Xiaochun Cao, Philip Torr

Viaarxiv icon

Lost Your Style? Navigating with Semantic-Level Approach for Text-to-Outfit Retrieval

Nov 03, 2023
Junkyu Jang, Eugene Hwang, Sung-Hyuk Park

Figure 1 for Lost Your Style? Navigating with Semantic-Level Approach for Text-to-Outfit Retrieval
Figure 2 for Lost Your Style? Navigating with Semantic-Level Approach for Text-to-Outfit Retrieval
Figure 3 for Lost Your Style? Navigating with Semantic-Level Approach for Text-to-Outfit Retrieval
Figure 4 for Lost Your Style? Navigating with Semantic-Level Approach for Text-to-Outfit Retrieval
Viaarxiv icon

Batch-less stochastic gradient descent for compressive learning of deep regularization for image denoising

Oct 02, 2023
Hui Shi, Yann Traonmilin, J-F Aujol

Viaarxiv icon

COMPASS: High-Efficiency Deep Image Compression with Arbitrary-scale Spatial Scalability

Sep 11, 2023
Jongmin Park, Jooyoung Lee, Munchurl Kim

Figure 1 for COMPASS: High-Efficiency Deep Image Compression with Arbitrary-scale Spatial Scalability
Figure 2 for COMPASS: High-Efficiency Deep Image Compression with Arbitrary-scale Spatial Scalability
Figure 3 for COMPASS: High-Efficiency Deep Image Compression with Arbitrary-scale Spatial Scalability
Figure 4 for COMPASS: High-Efficiency Deep Image Compression with Arbitrary-scale Spatial Scalability
Viaarxiv icon