Alert button

"Text": models, code, and papers
Alert button

CapText: Large Language Model-based Caption Generation From Image Context and Description

Jun 06, 2023
Shinjini Ghosh, Sagnik Anupam

Figure 1 for CapText: Large Language Model-based Caption Generation From Image Context and Description
Figure 2 for CapText: Large Language Model-based Caption Generation From Image Context and Description
Figure 3 for CapText: Large Language Model-based Caption Generation From Image Context and Description
Figure 4 for CapText: Large Language Model-based Caption Generation From Image Context and Description
Viaarxiv icon

shs-nlp at RadSum23: Domain-Adaptive Pre-training of Instruction-tuned LLMs for Radiology Report Impression Generation

Jun 05, 2023
Sanjeev Kumar Karn, Rikhiya Ghosh, Kusuma P, Oladimeji Farri

Figure 1 for shs-nlp at RadSum23: Domain-Adaptive Pre-training of Instruction-tuned LLMs for Radiology Report Impression Generation
Figure 2 for shs-nlp at RadSum23: Domain-Adaptive Pre-training of Instruction-tuned LLMs for Radiology Report Impression Generation
Figure 3 for shs-nlp at RadSum23: Domain-Adaptive Pre-training of Instruction-tuned LLMs for Radiology Report Impression Generation
Figure 4 for shs-nlp at RadSum23: Domain-Adaptive Pre-training of Instruction-tuned LLMs for Radiology Report Impression Generation
Viaarxiv icon

Large Language Models as Sous Chefs: Revising Recipes with GPT-3

Jun 24, 2023
Alyssa Hwang, Bryan Li, Zhaoyi Hou, Dan Roth

Figure 1 for Large Language Models as Sous Chefs: Revising Recipes with GPT-3
Figure 2 for Large Language Models as Sous Chefs: Revising Recipes with GPT-3
Figure 3 for Large Language Models as Sous Chefs: Revising Recipes with GPT-3
Viaarxiv icon

Incorporating Graph Information in Transformer-based AMR Parsing

Jun 23, 2023
Pavlo Vasylenko, Pere-Lluís Huguet Cabot, Abelardo Carlos Martínez Lorenzo, Roberto Navigli

Figure 1 for Incorporating Graph Information in Transformer-based AMR Parsing
Figure 2 for Incorporating Graph Information in Transformer-based AMR Parsing
Figure 3 for Incorporating Graph Information in Transformer-based AMR Parsing
Figure 4 for Incorporating Graph Information in Transformer-based AMR Parsing
Viaarxiv icon

Exploiting Summarization Data to Help Text Simplification

Feb 14, 2023
Renliang Sun, Zhixian Yang, Xiaojun Wan

Figure 1 for Exploiting Summarization Data to Help Text Simplification
Figure 2 for Exploiting Summarization Data to Help Text Simplification
Figure 3 for Exploiting Summarization Data to Help Text Simplification
Figure 4 for Exploiting Summarization Data to Help Text Simplification
Viaarxiv icon

Text-To-4D Dynamic Scene Generation

Jan 26, 2023
Uriel Singer, Shelly Sheynin, Adam Polyak, Oron Ashual, Iurii Makarov, Filippos Kokkinos, Naman Goyal, Andrea Vedaldi, Devi Parikh, Justin Johnson, Yaniv Taigman

Figure 1 for Text-To-4D Dynamic Scene Generation
Figure 2 for Text-To-4D Dynamic Scene Generation
Figure 3 for Text-To-4D Dynamic Scene Generation
Figure 4 for Text-To-4D Dynamic Scene Generation
Viaarxiv icon

Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation

Mar 29, 2023
Jiawei Liu, Weining Wang, Sihan Chen, Xinxin Zhu, Jing Liu

Figure 1 for Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation
Figure 2 for Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation
Figure 3 for Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation
Figure 4 for Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation
Viaarxiv icon

A Survey on Audio Diffusion Models: Text To Speech Synthesis and Enhancement in Generative AI

Apr 02, 2023
Chenshuang Zhang, Chaoning Zhang, Sheng Zheng, Mengchun Zhang, Maryam Qamar, Sung-Ho Bae, In So Kweon

Figure 1 for A Survey on Audio Diffusion Models: Text To Speech Synthesis and Enhancement in Generative AI
Figure 2 for A Survey on Audio Diffusion Models: Text To Speech Synthesis and Enhancement in Generative AI
Figure 3 for A Survey on Audio Diffusion Models: Text To Speech Synthesis and Enhancement in Generative AI
Figure 4 for A Survey on Audio Diffusion Models: Text To Speech Synthesis and Enhancement in Generative AI
Viaarxiv icon

Multi-modal Representation Learning for Social Post Location Inference

Jun 11, 2023
Ruiting Dai, Jiayi Luo, Xucheng Luo, Lisi Mo, Wanlun Ma, Fan Zhou

Figure 1 for Multi-modal Representation Learning for Social Post Location Inference
Figure 2 for Multi-modal Representation Learning for Social Post Location Inference
Figure 3 for Multi-modal Representation Learning for Social Post Location Inference
Figure 4 for Multi-modal Representation Learning for Social Post Location Inference
Viaarxiv icon

Vision + Language Applications: A Survey

May 24, 2023
Yutong Zhou, Nobutaka Shimada

Figure 1 for Vision + Language Applications: A Survey
Figure 2 for Vision + Language Applications: A Survey
Figure 3 for Vision + Language Applications: A Survey
Figure 4 for Vision + Language Applications: A Survey
Viaarxiv icon