Alert button

"Text": models, code, and papers
Alert button

Finetuning Text-to-Image Diffusion Models for Fairness

Nov 11, 2023
Xudong Shen, Chao Du, Tianyu Pang, Min Lin, Yongkang Wong, Mohan Kankanhalli

Viaarxiv icon

SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language Models

Dec 11, 2023
Yuzhou Huang, Liangbin Xie, Xintao Wang, Ziyang Yuan, Xiaodong Cun, Yixiao Ge, Jiantao Zhou, Chao Dong, Rui Huang, Ruimao Zhang, Ying Shan

Viaarxiv icon

GTA: Gated Toxicity Avoidance for LM Performance Preservation

Dec 11, 2023
Heegyu Kim, Hyunsouk Cho

Viaarxiv icon

Assessing LLMs for Moral Value Pluralism

Dec 08, 2023
Noam Benkler, Drisana Mosaphir, Scott Friedman, Andrew Smart, Sonja Schmer-Galunder

Viaarxiv icon

FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation

Nov 08, 2023
Yuanxin Liu, Lei Li, Shuhuai Ren, Rundong Gao, Shicheng Li, Sishuo Chen, Xu Sun, Lu Hou

Figure 1 for FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation
Figure 2 for FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation
Figure 3 for FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation
Figure 4 for FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation
Viaarxiv icon

KOALA: Self-Attention Matters in Knowledge Distillation of Latent Diffusion Models for Memory-Efficient and Fast Image Synthesis

Dec 07, 2023
Youngwan Lee, Kwanyong Park, Yoorhim Cho, Yong-Ju Lee, Sung Ju Hwang

Viaarxiv icon

Image Content Generation with Causal Reasoning

Dec 12, 2023
Xiaochuan Li, Baoyu Fan, Runze Zhang, Liang Jin, Di Wang, Zhenhua Guo, Yaqian Zhao, Rengang Li

Viaarxiv icon

Multilingual large language models leak human stereotypes across language boundaries

Dec 12, 2023
Yang Trista Cao, Anna Sotnikova, Jieyu Zhao, Linda X. Zou, Rachel Rudinger, Hal Daume III

Viaarxiv icon

MedXChat: Bridging CXR Modalities with a Unified Multimodal Large Model

Dec 04, 2023
Ling Yang, Zhanyu Wang, Luping Zhou

Viaarxiv icon

Multimodal Speech Emotion Recognition Using Modality-specific Self-Supervised Frameworks

Dec 04, 2023
Rutherford Agbeshi Patamia, Paulo E. Santos, Kingsley Nketia Acheampong, Favour Ekong, Kwabena Sarpong, She Kun

Viaarxiv icon