Alert button

"Image": models, code, and papers
Alert button

WeatherProof: Leveraging Language Guidance for Semantic Segmentation in Adverse Weather

Mar 21, 2024
Blake Gella, Howard Zhang, Rishi Upadhyay, Tiffany Chang, Nathan Wei, Matthew Waliman, Yunhao Bao, Celso de Melo, Alex Wong, Achuta Kadambi

Viaarxiv icon

Zero-Shot Image Feature Consensus with Deep Functional Maps

Mar 18, 2024
Xinle Cheng, Congyue Deng, Adam Harley, Yixin Zhu, Leonidas Guibas

Figure 1 for Zero-Shot Image Feature Consensus with Deep Functional Maps
Figure 2 for Zero-Shot Image Feature Consensus with Deep Functional Maps
Figure 3 for Zero-Shot Image Feature Consensus with Deep Functional Maps
Figure 4 for Zero-Shot Image Feature Consensus with Deep Functional Maps
Viaarxiv icon

Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing

Add code
Bookmark button
Alert button
Mar 06, 2024
Bingyan Liu, Chengyu Wang, Tingfeng Cao, Kui Jia, Jun Huang

Figure 1 for Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing
Figure 2 for Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing
Figure 3 for Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing
Figure 4 for Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing
Viaarxiv icon

Total Disentanglement of Font Images into Style and Character Class Features

Mar 19, 2024
Daichi Haraguchi, Wataru Shimoda, Kota Yamaguchi, Seiichi Uchida

Figure 1 for Total Disentanglement of Font Images into Style and Character Class Features
Figure 2 for Total Disentanglement of Font Images into Style and Character Class Features
Figure 3 for Total Disentanglement of Font Images into Style and Character Class Features
Figure 4 for Total Disentanglement of Font Images into Style and Character Class Features
Viaarxiv icon

PromptCharm: Text-to-Image Generation through Multi-modal Prompting and Refinement

Add code
Bookmark button
Alert button
Mar 06, 2024
Zhijie Wang, Yuheng Huang, Da Song, Lei Ma, Tianyi Zhang

Figure 1 for PromptCharm: Text-to-Image Generation through Multi-modal Prompting and Refinement
Figure 2 for PromptCharm: Text-to-Image Generation through Multi-modal Prompting and Refinement
Figure 3 for PromptCharm: Text-to-Image Generation through Multi-modal Prompting and Refinement
Figure 4 for PromptCharm: Text-to-Image Generation through Multi-modal Prompting and Refinement
Viaarxiv icon

An Upload-Efficient Scheme for Transferring Knowledge From a Server-Side Pre-trained Generator to Clients in Heterogeneous Federated Learning

Add code
Bookmark button
Alert button
Mar 23, 2024
Jianqing Zhang, Yang Liu, Yang Hua, Jian Cao

Viaarxiv icon

Open-Vocabulary Scene Text Recognition via Pseudo-Image Labeling and Margin Loss

Mar 12, 2024
Xuhua Ren, Hengcan Shi, Jin Li

Figure 1 for Open-Vocabulary Scene Text Recognition via Pseudo-Image Labeling and Margin Loss
Figure 2 for Open-Vocabulary Scene Text Recognition via Pseudo-Image Labeling and Margin Loss
Figure 3 for Open-Vocabulary Scene Text Recognition via Pseudo-Image Labeling and Margin Loss
Figure 4 for Open-Vocabulary Scene Text Recognition via Pseudo-Image Labeling and Margin Loss
Viaarxiv icon

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Add code
Bookmark button
Alert button
Mar 05, 2024
Patrick Esser, Sumith Kulal, Andreas Blattmann, Rahim Entezari, Jonas Müller, Harry Saini, Yam Levi, Dominik Lorenz, Axel Sauer, Frederic Boesel, Dustin Podell, Tim Dockhorn, Zion English, Kyle Lacey, Alex Goodwin, Yannik Marek, Robin Rombach

Figure 1 for Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Figure 2 for Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Figure 3 for Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Figure 4 for Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Viaarxiv icon

RAR: Retrieving And Ranking Augmented MLLMs for Visual Recognition

Add code
Bookmark button
Alert button
Mar 20, 2024
Ziyu Liu, Zeyi Sun, Yuhang Zang, Wei Li, Pan Zhang, Xiaoyi Dong, Yuanjun Xiong, Dahua Lin, Jiaqi Wang

Figure 1 for RAR: Retrieving And Ranking Augmented MLLMs for Visual Recognition
Figure 2 for RAR: Retrieving And Ranking Augmented MLLMs for Visual Recognition
Figure 3 for RAR: Retrieving And Ranking Augmented MLLMs for Visual Recognition
Figure 4 for RAR: Retrieving And Ranking Augmented MLLMs for Visual Recognition
Viaarxiv icon

Average Calibration Error: A Differentiable Loss for Improved Reliability in Image Segmentation

Add code
Bookmark button
Alert button
Mar 11, 2024
Theodore Barfoot, Luis Garcia-Peraza-Herrera, Ben Glocker, Tom Vercauteren

Figure 1 for Average Calibration Error: A Differentiable Loss for Improved Reliability in Image Segmentation
Figure 2 for Average Calibration Error: A Differentiable Loss for Improved Reliability in Image Segmentation
Figure 3 for Average Calibration Error: A Differentiable Loss for Improved Reliability in Image Segmentation
Figure 4 for Average Calibration Error: A Differentiable Loss for Improved Reliability in Image Segmentation
Viaarxiv icon