Alert button

"Text": models, code, and papers
Alert button

IteraTTA: An interface for exploring both text prompts and audio priors in generating music with text-to-audio models

Jul 24, 2023
Hiromu Yakura, Masataka Goto

Figure 1 for IteraTTA: An interface for exploring both text prompts and audio priors in generating music with text-to-audio models
Figure 2 for IteraTTA: An interface for exploring both text prompts and audio priors in generating music with text-to-audio models
Figure 3 for IteraTTA: An interface for exploring both text prompts and audio priors in generating music with text-to-audio models
Figure 4 for IteraTTA: An interface for exploring both text prompts and audio priors in generating music with text-to-audio models
Viaarxiv icon

Evaluating Explanation Methods for Vision-and-Language Navigation

Oct 10, 2023
Guanqi Chen, Lei Yang, Guanhua Chen, Jia Pan

Viaarxiv icon

Document-Level Supervision for Multi-Aspect Sentiment Analysis Without Fine-grained Labels

Oct 10, 2023
Kasturi Bhattacharjee, Rashmi Gangadharaiah

Figure 1 for Document-Level Supervision for Multi-Aspect Sentiment Analysis Without Fine-grained Labels
Figure 2 for Document-Level Supervision for Multi-Aspect Sentiment Analysis Without Fine-grained Labels
Figure 3 for Document-Level Supervision for Multi-Aspect Sentiment Analysis Without Fine-grained Labels
Figure 4 for Document-Level Supervision for Multi-Aspect Sentiment Analysis Without Fine-grained Labels
Viaarxiv icon

Words into Action: Learning Diverse Humanoid Robot Behaviors using Language Guided Iterative Motion Refinement

Oct 10, 2023
K. Niranjan Kumar, Irfan Essa, Sehoon Ha

Viaarxiv icon

Latent Diffusion Counterfactual Explanations

Oct 10, 2023
Karim Farid, Simon Schrodi, Max Argus, Thomas Brox

Figure 1 for Latent Diffusion Counterfactual Explanations
Figure 2 for Latent Diffusion Counterfactual Explanations
Figure 3 for Latent Diffusion Counterfactual Explanations
Figure 4 for Latent Diffusion Counterfactual Explanations
Viaarxiv icon

On the Depth between Beam Search and Exhaustive Search for Text Generation

Aug 25, 2023
Yuu Jinnai, Tetsuro Morimura, Ukyo Honda

Figure 1 for On the Depth between Beam Search and Exhaustive Search for Text Generation
Figure 2 for On the Depth between Beam Search and Exhaustive Search for Text Generation
Figure 3 for On the Depth between Beam Search and Exhaustive Search for Text Generation
Figure 4 for On the Depth between Beam Search and Exhaustive Search for Text Generation
Viaarxiv icon

Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation Learning

Aug 14, 2023
Xugong Qin, Pengyuan Lyu, Chengquan Zhang, Yu Zhou, Kun Yao, Peng Zhang, Hailun Lin, Weiping Wang

Figure 1 for Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation Learning
Figure 2 for Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation Learning
Figure 3 for Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation Learning
Figure 4 for Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation Learning
Viaarxiv icon

The Impact of Artificial Intelligence on the Evolution of Digital Education: A Comparative Study of OpenAI Text Generation Tools including ChatGPT, Bing Chat, Bard, and Ernie

Sep 05, 2023
Negin Yazdani Motlagh, Matin Khajavi, Abbas Sharifi, Mohsen Ahmadi

Viaarxiv icon

Training Audio Captioning Models without Audio

Sep 14, 2023
Soham Deshmukh, Benjamin Elizalde, Dimitra Emmanouilidou, Bhiksha Raj, Rita Singh, Huaming Wang

Figure 1 for Training Audio Captioning Models without Audio
Figure 2 for Training Audio Captioning Models without Audio
Figure 3 for Training Audio Captioning Models without Audio
Figure 4 for Training Audio Captioning Models without Audio
Viaarxiv icon

LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models

Sep 27, 2023
Yaohui Wang, Xinyuan Chen, Xin Ma, Shangchen Zhou, Ziqi Huang, Yi Wang, Ceyuan Yang, Yinan He, Jiashuo Yu, Peiqing Yang, Yuwei Guo, Tianxing Wu, Chenyang Si, Yuming Jiang, Cunjian Chen, Chen Change Loy, Bo Dai, Dahua Lin, Yu Qiao, Ziwei Liu

Figure 1 for LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models
Figure 2 for LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models
Figure 3 for LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models
Figure 4 for LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models
Viaarxiv icon