Alert button

"Text": models, code, and papers
Alert button

Cliqueful graphs as a means of calculating the maximal number of maximum cliques of simple graphs

Jul 26, 2023
Dániel Pfeifer

Viaarxiv icon

Enhancing CLIP with GPT-4: Harnessing Visual Descriptions as Prompts

Jul 21, 2023
Mayug Maniparambil, Chris Vorster, Derek Molloy, Noel Murphy, Kevin McGuinness, Noel E. O'Connor

Figure 1 for Enhancing CLIP with GPT-4: Harnessing Visual Descriptions as Prompts
Figure 2 for Enhancing CLIP with GPT-4: Harnessing Visual Descriptions as Prompts
Figure 3 for Enhancing CLIP with GPT-4: Harnessing Visual Descriptions as Prompts
Figure 4 for Enhancing CLIP with GPT-4: Harnessing Visual Descriptions as Prompts
Viaarxiv icon

OxfordTVG-HIC: Can Machine Make Humorous Captions from Images?

Jul 21, 2023
Runjia Li, Shuyang Sun, Mohamed Elhoseiny, Philip Torr

Figure 1 for OxfordTVG-HIC: Can Machine Make Humorous Captions from Images?
Figure 2 for OxfordTVG-HIC: Can Machine Make Humorous Captions from Images?
Figure 3 for OxfordTVG-HIC: Can Machine Make Humorous Captions from Images?
Figure 4 for OxfordTVG-HIC: Can Machine Make Humorous Captions from Images?
Viaarxiv icon

Kosmos-2: Grounding Multimodal Large Language Models to the World

Jul 13, 2023
Zhiliang Peng, Wenhui Wang, Li Dong, Yaru Hao, Shaohan Huang, Shuming Ma, Furu Wei

Figure 1 for Kosmos-2: Grounding Multimodal Large Language Models to the World
Figure 2 for Kosmos-2: Grounding Multimodal Large Language Models to the World
Figure 3 for Kosmos-2: Grounding Multimodal Large Language Models to the World
Figure 4 for Kosmos-2: Grounding Multimodal Large Language Models to the World
Viaarxiv icon

Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation

Jul 13, 2023
Yingqing He, Menghan Xia, Haoxin Chen, Xiaodong Cun, Yuan Gong, Jinbo Xing, Yong Zhang, Xintao Wang, Chao Weng, Ying Shan, Qifeng Chen

Figure 1 for Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation
Figure 2 for Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation
Figure 3 for Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation
Figure 4 for Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation
Viaarxiv icon

ARC-NLP at PAN 2023: Transition-Focused Natural Language Inference for Writing Style Detection

Jul 27, 2023
Izzet Emre Kucukkaya, Umitcan Sahin, Cagri Toraman

Figure 1 for ARC-NLP at PAN 2023: Transition-Focused Natural Language Inference for Writing Style Detection
Figure 2 for ARC-NLP at PAN 2023: Transition-Focused Natural Language Inference for Writing Style Detection
Figure 3 for ARC-NLP at PAN 2023: Transition-Focused Natural Language Inference for Writing Style Detection
Viaarxiv icon

X-IQE: eXplainable Image Quality Evaluation for Text-to-Image Generation with Visual Large Language Models

May 18, 2023
Yixiong Chen

Figure 1 for X-IQE: eXplainable Image Quality Evaluation for Text-to-Image Generation with Visual Large Language Models
Figure 2 for X-IQE: eXplainable Image Quality Evaluation for Text-to-Image Generation with Visual Large Language Models
Figure 3 for X-IQE: eXplainable Image Quality Evaluation for Text-to-Image Generation with Visual Large Language Models
Figure 4 for X-IQE: eXplainable Image Quality Evaluation for Text-to-Image Generation with Visual Large Language Models
Viaarxiv icon

Language Models for German Text Simplification: Overcoming Parallel Data Scarcity through Style-specific Pre-training

May 22, 2023
Miriam Anschütz, Joshua Oehms, Thomas Wimmer, Bartłomiej Jezierski, Georg Groh

Figure 1 for Language Models for German Text Simplification: Overcoming Parallel Data Scarcity through Style-specific Pre-training
Figure 2 for Language Models for German Text Simplification: Overcoming Parallel Data Scarcity through Style-specific Pre-training
Figure 3 for Language Models for German Text Simplification: Overcoming Parallel Data Scarcity through Style-specific Pre-training
Figure 4 for Language Models for German Text Simplification: Overcoming Parallel Data Scarcity through Style-specific Pre-training
Viaarxiv icon

Artificial Artificial Artificial Intelligence: Crowd Workers Widely Use Large Language Models for Text Production Tasks

Jun 13, 2023
Veniamin Veselovsky, Manoel Horta Ribeiro, Robert West

Figure 1 for Artificial Artificial Artificial Intelligence: Crowd Workers Widely Use Large Language Models for Text Production Tasks
Figure 2 for Artificial Artificial Artificial Intelligence: Crowd Workers Widely Use Large Language Models for Text Production Tasks
Figure 3 for Artificial Artificial Artificial Intelligence: Crowd Workers Widely Use Large Language Models for Text Production Tasks
Figure 4 for Artificial Artificial Artificial Intelligence: Crowd Workers Widely Use Large Language Models for Text Production Tasks
Viaarxiv icon

(Ab)using Images and Sounds for Indirect Instruction Injection in Multi-Modal LLMs

Jul 24, 2023
Eugene Bagdasaryan, Tsung-Yin Hsieh, Ben Nassi, Vitaly Shmatikov

Figure 1 for (Ab)using Images and Sounds for Indirect Instruction Injection in Multi-Modal LLMs
Figure 2 for (Ab)using Images and Sounds for Indirect Instruction Injection in Multi-Modal LLMs
Figure 3 for (Ab)using Images and Sounds for Indirect Instruction Injection in Multi-Modal LLMs
Figure 4 for (Ab)using Images and Sounds for Indirect Instruction Injection in Multi-Modal LLMs
Viaarxiv icon