Alert button

"Text": models, code, and papers
Alert button

Lost in Translation: When GPT-4V(ision) Can't See Eye to Eye with Text. A Vision-Language-Consistency Analysis of VLLMs and Beyond

Oct 19, 2023
Xiang Zhang, Senyu Li, Zijun Wu, Ning Shi

Viaarxiv icon

Image-Text Pre-Training for Logo Recognition

Sep 18, 2023
Mark Hubenthal, Suren Kumar

Figure 1 for Image-Text Pre-Training for Logo Recognition
Figure 2 for Image-Text Pre-Training for Logo Recognition
Figure 3 for Image-Text Pre-Training for Logo Recognition
Figure 4 for Image-Text Pre-Training for Logo Recognition
Viaarxiv icon

CLN-VC: Text-Free Voice Conversion Based on Fine-Grained Style Control and Contrastive Learning with Negative Samples Augmentation

Nov 15, 2023
Yimin Deng, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao

Viaarxiv icon

Can We Utilize Pre-trained Language Models within Causal Discovery Algorithms?

Nov 19, 2023
Chanhui Lee, Juhyeon Kim, Yongjun Jeong, Juhyun Lyu, Junghee Kim, Sangmin Lee, Sangjun Han, Hyeokjun Choe, Soyeon Park, Woohyung Lim, Sungbin Lim, Sanghack Lee

Viaarxiv icon

Large-Scale Korean Text Dataset for Classifying Biased Speech in Real-World Online Services

Oct 06, 2023
Dasol Choi, Jooyoung Song, Eunsun Lee, Jinwoo Seo, Heejune Park, Dongbin Na

Figure 1 for Large-Scale Korean Text Dataset for Classifying Biased Speech in Real-World Online Services
Figure 2 for Large-Scale Korean Text Dataset for Classifying Biased Speech in Real-World Online Services
Figure 3 for Large-Scale Korean Text Dataset for Classifying Biased Speech in Real-World Online Services
Figure 4 for Large-Scale Korean Text Dataset for Classifying Biased Speech in Real-World Online Services
Viaarxiv icon

Fuse to Forget: Bias Reduction and Selective Memorization through Model Fusion

Nov 13, 2023
Kerem Zaman, Leshem Choshen, Shashank Srivastava

Viaarxiv icon

Legend at ArAIEval Shared Task: Persuasion Technique Detection using a Language-Agnostic Text Representation Model

Oct 14, 2023
Olumide E. Ojo, Olaronke O. Adebanji, Hiram Calvo, Damian O. Dieke, Olumuyiwa E. Ojo, Seye E. Akinsanya, Tolulope O. Abiola, Anna Feldman

Figure 1 for Legend at ArAIEval Shared Task: Persuasion Technique Detection using a Language-Agnostic Text Representation Model
Figure 2 for Legend at ArAIEval Shared Task: Persuasion Technique Detection using a Language-Agnostic Text Representation Model
Viaarxiv icon

EMIT-Diff: Enhancing Medical Image Segmentation via Text-Guided Diffusion Model

Oct 19, 2023
Zheyuan Zhang, Lanhong Yao, Bin Wang, Debesh Jha, Elif Keles, Alpay Medetalibeyoglu, Ulas Bagci

Viaarxiv icon

Battle of the Large Language Models: Dolly vs LLaMA vs Vicuna vs Guanaco vs Bard vs ChatGPT -- A Text-to-SQL Parsing Comparison

Oct 16, 2023
Shuo Sun, Yuchen Zhang, Jiahuan Yan, Yuze Gao, Donovan Ong, Bin Chen, Jian Su

Viaarxiv icon

FT-Shield: A Watermark Against Unauthorized Fine-tuning in Text-to-Image Diffusion Models

Oct 03, 2023
Yingqian Cui, Jie Ren, Yuping Lin, Han Xu, Pengfei He, Yue Xing, Wenqi Fan, Hui Liu, Jiliang Tang

Figure 1 for FT-Shield: A Watermark Against Unauthorized Fine-tuning in Text-to-Image Diffusion Models
Figure 2 for FT-Shield: A Watermark Against Unauthorized Fine-tuning in Text-to-Image Diffusion Models
Figure 3 for FT-Shield: A Watermark Against Unauthorized Fine-tuning in Text-to-Image Diffusion Models
Figure 4 for FT-Shield: A Watermark Against Unauthorized Fine-tuning in Text-to-Image Diffusion Models
Viaarxiv icon