Alert button

"Text": models, code, and papers
Alert button

An Item is Worth a Prompt: Versatile Image Editing with Disentangled Control

Mar 07, 2024
Aosong Feng, Weikang Qiu, Jinbin Bai, Kaicheng Zhou, Zhen Dong, Xiao Zhang, Rex Ying, Leandros Tassiulas

Viaarxiv icon

VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis

Mar 13, 2024
Enric Corona, Andrei Zanfir, Eduard Gabriel Bazavan, Nikos Kolotouros, Thiemo Alldieck, Cristian Sminchisescu

Viaarxiv icon

Debiasing Text-to-Image Diffusion Models

Feb 22, 2024
Ruifei He, Chuhui Xue, Haoru Tan, Wenqing Zhang, Yingchen Yu, Song Bai, Xiaojuan Qi

Viaarxiv icon

Text Diffusion with Reinforced Conditioning

Feb 19, 2024
Yuxuan Liu, Tianchi Yang, Shaohan Huang, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang

Viaarxiv icon

Medical Speech Symptoms Classification via Disentangled Representation

Mar 08, 2024
Jianzong Wang, Pengcheng Li, Xulong Zhang, Ning Cheng, Jing Xiao

Figure 1 for Medical Speech Symptoms Classification via Disentangled Representation
Figure 2 for Medical Speech Symptoms Classification via Disentangled Representation
Figure 3 for Medical Speech Symptoms Classification via Disentangled Representation
Figure 4 for Medical Speech Symptoms Classification via Disentangled Representation
Viaarxiv icon

A$^{3}$lign-DFER: Pioneering Comprehensive Dynamic Affective Alignment for Dynamic Facial Expression Recognition with CLIP

Mar 07, 2024
Zeng Tao, Yan Wang, Junxiong Lin, Haoran Wang, Xinji Mai, Jiawen Yu, Xuan Tong, Ziheng Zhou, Shaoqi Yan, Qing Zhao, Liyuan Han, Wenqiang Zhang

Figure 1 for A$^{3}$lign-DFER: Pioneering Comprehensive Dynamic Affective Alignment for Dynamic Facial Expression Recognition with CLIP
Figure 2 for A$^{3}$lign-DFER: Pioneering Comprehensive Dynamic Affective Alignment for Dynamic Facial Expression Recognition with CLIP
Figure 3 for A$^{3}$lign-DFER: Pioneering Comprehensive Dynamic Affective Alignment for Dynamic Facial Expression Recognition with CLIP
Figure 4 for A$^{3}$lign-DFER: Pioneering Comprehensive Dynamic Affective Alignment for Dynamic Facial Expression Recognition with CLIP
Viaarxiv icon

Contrastive Prompts Improve Disentanglement in Text-to-Image Diffusion Models

Feb 21, 2024
Chen Wu, Fernando De la Torre

Viaarxiv icon

Machine-generated Text Localization

Feb 19, 2024
Zhongping Zhang, Wenda Qin, Bryan A. Plummer

Viaarxiv icon

A Systematic Review of Data-to-Text NLG

Feb 20, 2024
Chinonso Cynthia Osuji, Thiago Castro Ferreira, Brian Davis

Viaarxiv icon

What makes an image realistic?

Mar 11, 2024
Lucas Theis

Viaarxiv icon