Alert button

"Text": models, code, and papers
Alert button

Text-Driven Foley Sound Generation With Latent Diffusion Model

Jun 17, 2023
Yi Yuan, Haohe Liu, Xubo Liu, Xiyuan Kang, Peipei Wu, Mark D. Plumbley, Wenwu Wang

Figure 1 for Text-Driven Foley Sound Generation With Latent Diffusion Model
Figure 2 for Text-Driven Foley Sound Generation With Latent Diffusion Model
Figure 3 for Text-Driven Foley Sound Generation With Latent Diffusion Model
Figure 4 for Text-Driven Foley Sound Generation With Latent Diffusion Model
Viaarxiv icon

Language Prompt for Autonomous Driving

Sep 08, 2023
Dongming Wu, Wencheng Han, Tiancai Wang, Yingfei Liu, Xiangyu Zhang, Jianbing Shen

Figure 1 for Language Prompt for Autonomous Driving
Figure 2 for Language Prompt for Autonomous Driving
Figure 3 for Language Prompt for Autonomous Driving
Figure 4 for Language Prompt for Autonomous Driving
Viaarxiv icon

Classification of Human- and AI-Generated Texts: Investigating Features for ChatGPT

Aug 10, 2023
Lorenz Mindner, Tim Schlippe, Kristina Schaaff

Viaarxiv icon

PREADD: Prefix-Adaptive Decoding for Controlled Text Generation

Jul 06, 2023
Jonathan Pei, Kevin Yang, Dan Klein

Figure 1 for PREADD: Prefix-Adaptive Decoding for Controlled Text Generation
Figure 2 for PREADD: Prefix-Adaptive Decoding for Controlled Text Generation
Figure 3 for PREADD: Prefix-Adaptive Decoding for Controlled Text Generation
Figure 4 for PREADD: Prefix-Adaptive Decoding for Controlled Text Generation
Viaarxiv icon

Extracting Mathematical Concepts with Large Language Models

Aug 29, 2023
Valeria de Paiva, Qiyue Gao, Pavel Kovalev, Lawrence S. Moss

Viaarxiv icon

Augmenters at SemEval-2023 Task 1: Enhancing CLIP in Handling Compositionality and Ambiguity for Zero-Shot Visual WSD through Prompt Augmentation and Text-To-Image Diffusion

Jul 09, 2023
Jie S. Li, Yow-Ting Shiue, Yong-Siang Shih, Jonas Geiping

Figure 1 for Augmenters at SemEval-2023 Task 1: Enhancing CLIP in Handling Compositionality and Ambiguity for Zero-Shot Visual WSD through Prompt Augmentation and Text-To-Image Diffusion
Figure 2 for Augmenters at SemEval-2023 Task 1: Enhancing CLIP in Handling Compositionality and Ambiguity for Zero-Shot Visual WSD through Prompt Augmentation and Text-To-Image Diffusion
Figure 3 for Augmenters at SemEval-2023 Task 1: Enhancing CLIP in Handling Compositionality and Ambiguity for Zero-Shot Visual WSD through Prompt Augmentation and Text-To-Image Diffusion
Figure 4 for Augmenters at SemEval-2023 Task 1: Enhancing CLIP in Handling Compositionality and Ambiguity for Zero-Shot Visual WSD through Prompt Augmentation and Text-To-Image Diffusion
Viaarxiv icon

CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models

Jun 16, 2023
Hao-Wen Dong, Xiaoyu Liu, Jordi Pons, Gautam Bhattacharya, Santiago Pascual, Joan Serrà, Taylor Berg-Kirkpatrick, Julian McAuley

Figure 1 for CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models
Figure 2 for CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models
Figure 3 for CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models
Figure 4 for CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models
Viaarxiv icon

Age Recommendation from Texts and Sentences for Children

Aug 21, 2023
Rashedur Rahman, Gwénolé Lecorvé, Nicolas Béchet

Figure 1 for Age Recommendation from Texts and Sentences for Children
Figure 2 for Age Recommendation from Texts and Sentences for Children
Figure 3 for Age Recommendation from Texts and Sentences for Children
Figure 4 for Age Recommendation from Texts and Sentences for Children
Viaarxiv icon

Bridging the KB-Text Gap: Leveraging Structured Knowledge-aware Pre-training for KBQA

Aug 28, 2023
Guanting Dong, Rumei Li, Sirui Wang, Yupeng Zhang, Yunsen Xian, Weiran Xu

Figure 1 for Bridging the KB-Text Gap: Leveraging Structured Knowledge-aware Pre-training for KBQA
Figure 2 for Bridging the KB-Text Gap: Leveraging Structured Knowledge-aware Pre-training for KBQA
Figure 3 for Bridging the KB-Text Gap: Leveraging Structured Knowledge-aware Pre-training for KBQA
Figure 4 for Bridging the KB-Text Gap: Leveraging Structured Knowledge-aware Pre-training for KBQA
Viaarxiv icon

Quilt-1M: One Million Image-Text Pairs for Histopathology

Jun 22, 2023
Wisdom Oluchi Ikezogwo, Mehmet Saygin Seyfioglu, Fatemeh Ghezloo, Dylan Stefan Chan Geva, Fatwir Sheikh Mohammed, Pavan Kumar Anand, Ranjay Krishna, Linda Shapiro

Figure 1 for Quilt-1M: One Million Image-Text Pairs for Histopathology
Figure 2 for Quilt-1M: One Million Image-Text Pairs for Histopathology
Figure 3 for Quilt-1M: One Million Image-Text Pairs for Histopathology
Figure 4 for Quilt-1M: One Million Image-Text Pairs for Histopathology
Viaarxiv icon