Alert button

"Text": models, code, and papers
Alert button

Balancing Act: Distribution-Guided Debiasing in Diffusion Models

Feb 28, 2024
Rishubh Parihar, Abhijnya Bhat, Saswat Mallick, Abhipsa Basu, Jogendra Nath Kundu, R. Venkatesh Babu

Viaarxiv icon

Visual Hallucinations of Multi-modal Large Language Models

Feb 22, 2024
Wen Huang, Hongbin Liu, Minxin Guo, Neil Zhenqiang Gong

Viaarxiv icon

Training-Free Consistent Text-to-Image Generation

Feb 05, 2024
Yoad Tewel, Omri Kaduri, Rinon Gal, Yoni Kasten, Lior Wolf, Gal Chechik, Yuval Atzmon

Viaarxiv icon

A Machine Learning Approach to Detect Customer Satisfaction From Multiple Tweet Parameters

Feb 25, 2024
Md Mahmudul Hasan, Dr. Shaikh Anowarul Fattah

Viaarxiv icon

Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models

Feb 08, 2024
Senmao Li, Joost van de Weijer, Taihang Hu, Fahad Shahbaz Khan, Qibin Hou, Yaxing Wang, Jian Yang

Viaarxiv icon

Improving Cross-Domain Low-Resource Text Generation through LLM Post-Editing: A Programmer-Interpreter Approach

Feb 07, 2024
Zhuang Li, Levon Haroutunian, Raj Tumuluri, Philip Cohen, Gholamreza Haffari

Viaarxiv icon

Increasing SAM Zero-Shot Performance on Multimodal Medical Images Using GPT-4 Generated Descriptive Prompts Without Human Annotation

Feb 24, 2024
Zekun Jiang, Dongjie Cheng, Ziyuan Qin, Jun Gao, Qicheng Lao, Kang Li, Le Zhang

Viaarxiv icon

Datasets for Large Language Models: A Comprehensive Survey

Feb 28, 2024
Yang Liu, Jiahuan Cao, Chongyu Liu, Kai Ding, Lianwen Jin

Viaarxiv icon

GraphTranslator: Aligning Graph Model to Large Language Model for Open-ended Tasks

Feb 28, 2024
Mengmei Zhang, Mingwei Sun, Peng Wang, Shen Fan, Yanhu Mo, Xiaoxiao Xu, Hong Liu, Cheng Yang, Chuan Shi

Viaarxiv icon

UniVS: Unified and Universal Video Segmentation with Prompts as Queries

Feb 28, 2024
Minghan Li, Shuai Li, Xindong Zhang, Lei Zhang

Viaarxiv icon