Alert button

"Text": models, code, and papers
Alert button

Towards Zero-shot Human-Object Interaction Detection via Vision-Language Integration

Mar 12, 2024
Weiying Xue, Qi Liu, Qiwei Xiong, Yuxiao Wang, Zhenao Wei, Xiaofen Xing, Xiangmin Xu

Viaarxiv icon

VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis

Mar 13, 2024
Enric Corona, Andrei Zanfir, Eduard Gabriel Bazavan, Nikos Kolotouros, Thiemo Alldieck, Cristian Sminchisescu

Viaarxiv icon

Debiasing Text-to-Image Diffusion Models

Feb 22, 2024
Ruifei He, Chuhui Xue, Haoru Tan, Wenqing Zhang, Yingchen Yu, Song Bai, Xiaojuan Qi

Viaarxiv icon

Text Diffusion with Reinforced Conditioning

Feb 19, 2024
Yuxuan Liu, Tianchi Yang, Shaohan Huang, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang

Viaarxiv icon

Medical Speech Symptoms Classification via Disentangled Representation

Mar 08, 2024
Jianzong Wang, Pengcheng Li, Xulong Zhang, Ning Cheng, Jing Xiao

Figure 1 for Medical Speech Symptoms Classification via Disentangled Representation
Figure 2 for Medical Speech Symptoms Classification via Disentangled Representation
Figure 3 for Medical Speech Symptoms Classification via Disentangled Representation
Figure 4 for Medical Speech Symptoms Classification via Disentangled Representation
Viaarxiv icon

A$^{3}$lign-DFER: Pioneering Comprehensive Dynamic Affective Alignment for Dynamic Facial Expression Recognition with CLIP

Mar 07, 2024
Zeng Tao, Yan Wang, Junxiong Lin, Haoran Wang, Xinji Mai, Jiawen Yu, Xuan Tong, Ziheng Zhou, Shaoqi Yan, Qing Zhao, Liyuan Han, Wenqiang Zhang

Figure 1 for A$^{3}$lign-DFER: Pioneering Comprehensive Dynamic Affective Alignment for Dynamic Facial Expression Recognition with CLIP
Figure 2 for A$^{3}$lign-DFER: Pioneering Comprehensive Dynamic Affective Alignment for Dynamic Facial Expression Recognition with CLIP
Figure 3 for A$^{3}$lign-DFER: Pioneering Comprehensive Dynamic Affective Alignment for Dynamic Facial Expression Recognition with CLIP
Figure 4 for A$^{3}$lign-DFER: Pioneering Comprehensive Dynamic Affective Alignment for Dynamic Facial Expression Recognition with CLIP
Viaarxiv icon

Attack Deterministic Conditional Image Generative Models for Diverse and Controllable Generation

Mar 13, 2024
Tianyi Chu, Wei Xing, Jiafu Chen, Zhizhong Wang, Jiakai Sun, Lei Zhao, Haibo Chen, Huaizhong Lin

Viaarxiv icon

Contrastive Prompts Improve Disentanglement in Text-to-Image Diffusion Models

Feb 21, 2024
Chen Wu, Fernando De la Torre

Viaarxiv icon

Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model

Mar 12, 2024
Yuxuan Zhang, Lifu Wei, Qing Zhang, Yiren Song, Jiaming Liu, Huaxia Li, Xu Tang, Yao Hu, Haibo Zhao

Viaarxiv icon

Large, Small or Both: A Novel Data Augmentation Framework Based on Language Models for Debiasing Opinion Summarization

Mar 12, 2024
Yanyue Zhang, Pengfei Li, Yilong Lai, Deyu Zhou

Viaarxiv icon