Alert button

"Text": models, code, and papers
Alert button

V2Meow: Meowing to the Visual Beat via Music Generation

May 11, 2023
Kun Su, Judith Yue Li, Qingqing Huang, Dima Kuzmin, Joonseok Lee, Chris Donahue, Fei Sha, Aren Jansen, Yu Wang, Mauro Verzetti, Timo I. Denk

Figure 1 for V2Meow: Meowing to the Visual Beat via Music Generation
Figure 2 for V2Meow: Meowing to the Visual Beat via Music Generation
Figure 3 for V2Meow: Meowing to the Visual Beat via Music Generation
Figure 4 for V2Meow: Meowing to the Visual Beat via Music Generation
Viaarxiv icon

IslamicPCQA: A Dataset for Persian Multi-hop Complex Question Answering in Islamic Text Resources

Apr 23, 2023
Arash Ghafouri, Hasan Naderi, Mohammad Aghajani asl, Mahdi Firouzmandi

Viaarxiv icon

Enhancing Indic Handwritten Text Recognition Using Global Semantic Information

Dec 15, 2022
Ajoy Mondal, C. V. Jawahar

Viaarxiv icon

Plan, Eliminate, and Track -- Language Models are Good Teachers for Embodied Agents

May 07, 2023
Yue Wu, So Yeon Min, Yonatan Bisk, Ruslan Salakhutdinov, Amos Azaria, Yuanzhi Li, Tom Mitchell, Shrimai Prabhumoye

Figure 1 for Plan, Eliminate, and Track -- Language Models are Good Teachers for Embodied Agents
Figure 2 for Plan, Eliminate, and Track -- Language Models are Good Teachers for Embodied Agents
Figure 3 for Plan, Eliminate, and Track -- Language Models are Good Teachers for Embodied Agents
Figure 4 for Plan, Eliminate, and Track -- Language Models are Good Teachers for Embodied Agents
Viaarxiv icon

FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving Performance

May 09, 2023
Lingjiao Chen, Matei Zaharia, James Zou

Figure 1 for FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving Performance
Figure 2 for FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving Performance
Figure 3 for FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving Performance
Figure 4 for FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving Performance
Viaarxiv icon

Prompt as Triggers for Backdoor Attack: Examining the Vulnerability in Language Models

May 09, 2023
Shuai Zhao, Jinming Wen, Luu Anh Tuan, Junbo Zhao, Jie Fu

Figure 1 for Prompt as Triggers for Backdoor Attack: Examining the Vulnerability in Language Models
Figure 2 for Prompt as Triggers for Backdoor Attack: Examining the Vulnerability in Language Models
Figure 3 for Prompt as Triggers for Backdoor Attack: Examining the Vulnerability in Language Models
Figure 4 for Prompt as Triggers for Backdoor Attack: Examining the Vulnerability in Language Models
Viaarxiv icon

Hybrid-SD ($\text{H}_{\text{SD}}$) : A new hybrid evaluation metric for automatic speech recognition tasks

Nov 03, 2022
Zitha Sasindran, Harsha Yelchuri, Supreeth Rao, T. V. Prabhakar

Figure 1 for Hybrid-SD ($\text{H}_{\text{SD}}$) : A new hybrid evaluation metric for automatic speech recognition tasks
Figure 2 for Hybrid-SD ($\text{H}_{\text{SD}}$) : A new hybrid evaluation metric for automatic speech recognition tasks
Figure 3 for Hybrid-SD ($\text{H}_{\text{SD}}$) : A new hybrid evaluation metric for automatic speech recognition tasks
Figure 4 for Hybrid-SD ($\text{H}_{\text{SD}}$) : A new hybrid evaluation metric for automatic speech recognition tasks
Viaarxiv icon

BLAT: Bootstrapping Language-Audio Pre-training based on AudioSet Tag-guided Synthetic Data

Mar 14, 2023
Xuenan Xu, Zhiling Zhang, Zelin Zhou, Pingyue Zhang, Zeyu Xie, Mengyue Wu, Kenny Q. Zhu

Figure 1 for BLAT: Bootstrapping Language-Audio Pre-training based on AudioSet Tag-guided Synthetic Data
Figure 2 for BLAT: Bootstrapping Language-Audio Pre-training based on AudioSet Tag-guided Synthetic Data
Figure 3 for BLAT: Bootstrapping Language-Audio Pre-training based on AudioSet Tag-guided Synthetic Data
Figure 4 for BLAT: Bootstrapping Language-Audio Pre-training based on AudioSet Tag-guided Synthetic Data
Viaarxiv icon

Importance of Synthesizing High-quality Data for Text-to-SQL Parsing

Dec 17, 2022
Yiyun Zhao, Jiarong Jiang, Yiqun Hu, Wuwei Lan, Henry Zhu, Anuj Chauhan, Alexander Li, Lin Pan, Jun Wang, Chung-Wei Hang, Sheng Zhang, Marvin Dong, Joe Lilien, Patrick Ng, Zhiguo Wang, Vittorio Castelli, Bing Xiang

Figure 1 for Importance of Synthesizing High-quality Data for Text-to-SQL Parsing
Figure 2 for Importance of Synthesizing High-quality Data for Text-to-SQL Parsing
Figure 3 for Importance of Synthesizing High-quality Data for Text-to-SQL Parsing
Figure 4 for Importance of Synthesizing High-quality Data for Text-to-SQL Parsing
Viaarxiv icon

Using External Off-Policy Speech-To-Text Mappings in Contextual End-To-End Automated Speech Recognition

Jan 06, 2023
David M. Chan, Shalini Ghosh, Ariya Rastrow, Björn Hoffmeister

Figure 1 for Using External Off-Policy Speech-To-Text Mappings in Contextual End-To-End Automated Speech Recognition
Figure 2 for Using External Off-Policy Speech-To-Text Mappings in Contextual End-To-End Automated Speech Recognition
Figure 3 for Using External Off-Policy Speech-To-Text Mappings in Contextual End-To-End Automated Speech Recognition
Figure 4 for Using External Off-Policy Speech-To-Text Mappings in Contextual End-To-End Automated Speech Recognition
Viaarxiv icon