Alert button

"Text": models, code, and papers
Alert button

Image Clustering Conditioned on Text Criteria

Oct 30, 2023
Sehyun Kwon, Jaeseung Park, Minkyu Kim, Jaewoong Cho, Ernest K. Ryu, Kangwook Lee

Figure 1 for Image Clustering Conditioned on Text Criteria
Figure 2 for Image Clustering Conditioned on Text Criteria
Figure 3 for Image Clustering Conditioned on Text Criteria
Figure 4 for Image Clustering Conditioned on Text Criteria
Viaarxiv icon

Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

Dec 13, 2023
Zeyi Sun, Ye Fang, Tong Wu, Pan Zhang, Yuhang Zang, Shu Kong, Yuanjun Xiong, Dahua Lin, Jiaqi Wang

Viaarxiv icon

Cross-modal Prominent Fragments Enhancement Aligning Network for Image-text Retrieval

Nov 03, 2023
Yang Zhang

Viaarxiv icon

IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks

Dec 04, 2023
Jiarui Xu, Yossi Gandelsman, Amir Bar, Jianwei Yang, Jianfeng Gao, Trevor Darrell, Xiaolong Wang

Figure 1 for IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
Figure 2 for IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
Figure 3 for IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
Figure 4 for IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
Viaarxiv icon

X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model

Dec 04, 2023
Lingmin Ran, Xiaodong Cun, JiaWei Liu, Rui Zhao, Song Zijie, Xintao Wang, Jussi Keppo, Mike Zheng Shou

Viaarxiv icon

MagicStick: Controllable Video Editing via Control Handle Transformations

Dec 05, 2023
Yue Ma, Xiaodong Cun, Yingqing He, Chenyang Qi, Xintao Wang, Ying Shan, Xiu Li, Qifeng Chen

Figure 1 for MagicStick: Controllable Video Editing via Control Handle Transformations
Figure 2 for MagicStick: Controllable Video Editing via Control Handle Transformations
Figure 3 for MagicStick: Controllable Video Editing via Control Handle Transformations
Figure 4 for MagicStick: Controllable Video Editing via Control Handle Transformations
Viaarxiv icon

Adaptive Compression of the Latent Space in Variational Autoencoders

Dec 11, 2023
Gabriela Sejnova, Michal Vavrecka, Karla Stepanova

Viaarxiv icon

A Pipeline For Discourse Circuits From CCG

Nov 29, 2023
Jonathon Liu, Razin A. Shaikh, Benjamin Rodatz, Richie Yeung, Bob Coecke

Viaarxiv icon

Exploring the Consistency, Quality and Challenges in Manual and Automated Coding of Free-text Diagnoses from Hospital Outpatient Letters

Nov 17, 2023
Warren Del-Pinto, George Demetriou, Meghna Jani, Rikesh Patel, Leanne Gray, Alex Bulcock, Niels Peek, Andrew S. Kanter, William G Dixon, Goran Nenadic

Viaarxiv icon

Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression

Nov 17, 2023
Animesh Sinha, Bo Sun, Anmol Kalia, Arantxa Casanova, Elliot Blanchard, David Yan, Winnie Zhang, Tony Nelli, Jiahui Chen, Hardik Shah, Licheng Yu, Mitesh Kumar Singh, Ankit Ramchandani, Maziar Sanjabi, Sonal Gupta, Amy Bearman, Dhruv Mahajan

Viaarxiv icon