Alert button

"Text": models, code, and papers
Alert button

Emage: Non-Autoregressive Text-to-Image Generation

Dec 22, 2023
Zhangyin Feng, Runyi Hu, Liangxin Liu, Fan Zhang, Duyu Tang, Yong Dai, Xiaocheng Feng, Jiwei Li, Bing Qin, Shuming Shi

Viaarxiv icon

Prompt Decoupling for Text-to-Image Person Re-identification

Jan 04, 2024
Weihao Li, Lei Tan, Pingyang Dai, Yan Zhang

Viaarxiv icon

MLLM-Protector: Ensuring MLLM's Safety without Hurting Performance

Jan 17, 2024
Renjie Pi, Tianyang Han, Yueqi Xie, Rui Pan, Qing Lian, Hanze Dong, Jipeng Zhang, Tong Zhang

Viaarxiv icon

MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning

Jan 24, 2024
Chenyu Wang, Weixin Luo, Qianyu Chen, Haonan Mai, Jindi Guo, Sixun Dong, Xiaohua, Xuan, Zhengxin Li, Lin Ma, Shenghua Gao

Viaarxiv icon

Machine Translation Models are Zero-Shot Detectors of Translation Direction

Jan 12, 2024
Michelle Wastl, Jannis Vamvas, Rico Sennrich

Viaarxiv icon

Your Student is Better Than Expected: Adaptive Teacher-Student Collaboration for Text-Conditional Diffusion Models

Dec 28, 2023
Nikita Starodubcev, Artem Fedorov, Artem Babenko, Dmitry Baranchuk

Viaarxiv icon

MCMChaos: Improvising Rap Music with MCMC Methods and Chaos Theory

Jan 15, 2024
Robert G. Kimelman

Viaarxiv icon

MM-TTS: Multi-modal Prompt based Style Transfer for Expressive Text-to-Speech Synthesis

Dec 28, 2023
Wenhao Guan, Yishuang Li, Tao Li, Hukai Huang, Feng Wang, Jiayan Lin, Lingyan Huang, Lin Li, Qingyang Hong

Viaarxiv icon

Research on Multilingual Natural Scene Text Detection Algorithm

Dec 18, 2023
Tao Wang

Viaarxiv icon

Prototype-Guided Text-based Person Search based on Rich Chinese Descriptions

Dec 22, 2023
Ziqiang Wu, Bingpeng Ma

Viaarxiv icon