Alert button

"Text": models, code, and papers
Alert button

Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective

Feb 22, 2024
Zihao Yue, Liang Zhang, Qin Jin

Viaarxiv icon

Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments

Feb 22, 2024
Yu Gu, Yiheng Shu, Hao Yu, Xiao Liu, Yuxiao Dong, Jie Tang, Jayanth Srinivasa, Hugo Latapie, Yu Su

Viaarxiv icon

UMAIR-FPS: User-aware Multi-modal Animation Illustration Recommendation Fusion with Painting Style

Feb 16, 2024
Yan Kang, Hao Lin, Mingjian Yang, Shin-Jye Lee

Viaarxiv icon

Do Large Language Models Understand Logic or Just Mimick Context?

Feb 19, 2024
Junbing Yan, Chengyu Wang, Jun Huang, Wei Zhang

Viaarxiv icon

User-LLM: Efficient LLM Contextualization with User Embeddings

Feb 21, 2024
Lin Ning, Luyang Liu, Jiaxing Wu, Neo Wu, Devora Berlowitz, Sushant Prakash, Bradley Green, Shawn O'Banion, Jun Xie

Viaarxiv icon

Punctuation Restoration Improves Structure Understanding without Supervision

Feb 21, 2024
Junghyun Min, Minho Lee, Woochul Lee, Yeonsoo Lee

Viaarxiv icon

Human Aesthetic Preference-Based Large Text-to-Image Model Personalization: Kandinsky Generation as an Example

Feb 09, 2024
Aven-Le Zhou, Yu-Ao Wang, Wei Wu, Kang Zhang

Viaarxiv icon

VQAttack: Transferable Adversarial Attacks on Visual Question Answering via Pre-trained Models

Feb 16, 2024
Ziyi Yin, Muchao Ye, Tianrong Zhang, Jiaqi Wang, Han Liu, Jinghui Chen, Ting Wang, Fenglong Ma

Viaarxiv icon

BSPA: Exploring Black-box Stealthy Prompt Attacks against Image Generators

Feb 23, 2024
Yu Tian, Xiao Yang, Yinpeng Dong, Heming Yang, Hang Su, Jun Zhu

Viaarxiv icon

Enhancing Systematic Decompositional Natural Language Inference Using Informal Logic

Feb 22, 2024
Nathaniel Weir, Kate Sanders, Orion Weller, Shreya Sharma, Dongwei Jiang, Zhengping Zhang, Bhavana Dalvi Mishra, Oyvind Tafjord, Peter Jansen, Peter Clark, Benjamin Van Durme

Viaarxiv icon