Alert button

"Image": models, code, and papers
Alert button

LGFCTR: Local and Global Feature Convolutional Transformer for Image Matching

Nov 29, 2023
Wenhao Zhong, Jie Jiang

Viaarxiv icon

Compound Text-Guided Prompt Tuning via Image-Adaptive Cues

Dec 11, 2023
Hao Tan, Jun Li, Yizhuang Zhou, Jun Wan, Zhen Lei, Xiangyu Zhang

Viaarxiv icon

Topic-VQ-VAE: Leveraging Latent Codebooks for Flexible Topic-Guided Document Generation

Dec 15, 2023
YoungJoon Yoo, Jongwon Choi

Figure 1 for Topic-VQ-VAE: Leveraging Latent Codebooks for Flexible Topic-Guided Document Generation
Figure 2 for Topic-VQ-VAE: Leveraging Latent Codebooks for Flexible Topic-Guided Document Generation
Figure 3 for Topic-VQ-VAE: Leveraging Latent Codebooks for Flexible Topic-Guided Document Generation
Figure 4 for Topic-VQ-VAE: Leveraging Latent Codebooks for Flexible Topic-Guided Document Generation
Viaarxiv icon

Multiscale Vision Transformer With Deep Clustering-Guided Refinement for Weakly Supervised Object Localization

Dec 15, 2023
David Kim, Sinhae Cha, Byeongkeun Kang

Viaarxiv icon

Towards Transfer Learning for Large-Scale Image Classification Using Annealing-based Quantum Boltzmann Machines

Nov 27, 2023
Daniëlle Schuman, Leo Sünkel, Philipp Altmann, Jonas Stein, Christoph Roch, Thomas Gabor, Claudia Linnhoff-Popien

Viaarxiv icon

EVCap: Retrieval-Augmented Image Captioning with External Visual-Name Memory for Open-World Comprehension

Nov 27, 2023
Jiaxuan Li, Duc Minh Vo, Akihiro Sugimoto, Hideki Nakayama

Viaarxiv icon

VILA: On Pre-training for Visual Language Models

Dec 14, 2023
Ji Lin, Hongxu Yin, Wei Ping, Yao Lu, Pavlo Molchanov, Andrew Tao, Huizi Mao, Jan Kautz, Mohammad Shoeybi, Song Han

Figure 1 for VILA: On Pre-training for Visual Language Models
Figure 2 for VILA: On Pre-training for Visual Language Models
Figure 3 for VILA: On Pre-training for Visual Language Models
Figure 4 for VILA: On Pre-training for Visual Language Models
Viaarxiv icon

Exploring the Naturalness of AI-Generated Images

Dec 14, 2023
Zijian Chen, Wei Sun, Haoning Wu, Zicheng Zhang, Jun Jia, Xiongkuo Min, Guangtao Zhai, Wenjun Zhang

Viaarxiv icon

Domain Transfer in Latent Space (DTLS) Wins on Image Super-Resolution -- a Non-Denoising Model

Nov 20, 2023
Chun-Chuen Hui, Wan-Chi Siu, Ngai-Fong Law

Viaarxiv icon

Enhancing Object Coherence in Layout-to-Image Synthesis

Nov 17, 2023
Yibin Wang, Weizhong Zhang, Jianwei Zheng, Cheng Jin

Viaarxiv icon