Alert button

"Text": models, code, and papers
Alert button

Taiyi-Diffusion-XL: Advancing Bilingual Text-to-Image Generation with Large Vision-Language Model Support

Jan 26, 2024
Xiaojun Wu, Dixiang Zhang, Ruyi Gan, Junyu Lu, Ziwei Wu, Renliang Sun, Jiaxing Zhang, Pingjian Zhang, Yan Song

Viaarxiv icon

VIPTR: A Vision Permutable Extractor for Fast and Efficient Scene Text Recognition

Jan 24, 2024
Xianfu Cheng, Weixiao Zhou, Xiang Li, Xiaoming Chen, Jian Yang, Tongliang Li, Zhoujun Li

Viaarxiv icon

Detecting Racist Text in Bengali: An Ensemble Deep Learning Framework

Jan 30, 2024
S. S. Saruar, Nusrat, Sadia

Viaarxiv icon

TrICy: Trigger-guided Data-to-text Generation with Intent aware Attention-Copy

Jan 25, 2024
Vibhav Agarwal, Sourav Ghosh, Harichandana BSS, Himanshu Arora, Barath Raj Kandur Raja

Viaarxiv icon

Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs

Jan 22, 2024
Ling Yang, Zhaochen Yu, Chenlin Meng, Minkai Xu, Stefano Ermon, Bin Cui

Viaarxiv icon

AraSpider: Democratizing Arabic-to-SQL

Feb 12, 2024
Ahmed Heakl, Youssef Mohamed, Ahmed B. Zaky

Viaarxiv icon

Health Text Simplification: An Annotated Corpus for Digestive Cancer Education and Novel Strategies for Reinforcement Learning

Jan 26, 2024
Md Mushfiqur Rahman, Mohammad Sabik Irbaz, Kai North, Michelle S. Williams, Marcos Zampieri, Kevin Lybarger

Viaarxiv icon

Language Model Sentence Completion with a Parser-Driven Rhetorical Control Method

Feb 09, 2024
Joshua Zingale, Jugal Kalita

Viaarxiv icon

EFUF: Efficient Fine-grained Unlearning Framework for Mitigating Hallucinations in Multimodal Large Language Models

Feb 15, 2024
Shangyu Xing, Fei Zhao, Zhen Wu, Tuo An, Weihao Chen, Chunhui Li, Jianbing Zhang, Xinyu Dai

Viaarxiv icon

Exploring the Adversarial Capabilities of Large Language Models

Feb 15, 2024
Lukas Struppek, Minh Hieu Le, Dominik Hintersdorf, Kristian Kersting

Viaarxiv icon