Alert button

"Text": models, code, and papers
Alert button

Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code

Oct 02, 2023
Xuan Ju, Ailing Zeng, Yuxuan Bian, Shaoteng Liu, Qiang Xu

Figure 1 for Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code
Figure 2 for Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code
Figure 3 for Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code
Figure 4 for Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code
Viaarxiv icon

A Transformer-based Approach for Arabic Offline Handwritten Text Recognition

Jul 27, 2023
Saleh Momeni, Bagher BabaAli

Figure 1 for A Transformer-based Approach for Arabic Offline Handwritten Text Recognition
Figure 2 for A Transformer-based Approach for Arabic Offline Handwritten Text Recognition
Figure 3 for A Transformer-based Approach for Arabic Offline Handwritten Text Recognition
Figure 4 for A Transformer-based Approach for Arabic Offline Handwritten Text Recognition
Viaarxiv icon

LAE-ST-MoE: Boosted Language-Aware Encoder Using Speech Translation Auxiliary Task for E2E Code-switching ASR

Sep 28, 2023
Guodong Ma, Wenxuan Wang, Yuke Li, Yuting Yang, Binbin Du, Haoran Fu

Figure 1 for LAE-ST-MoE: Boosted Language-Aware Encoder Using Speech Translation Auxiliary Task for E2E Code-switching ASR
Figure 2 for LAE-ST-MoE: Boosted Language-Aware Encoder Using Speech Translation Auxiliary Task for E2E Code-switching ASR
Figure 3 for LAE-ST-MoE: Boosted Language-Aware Encoder Using Speech Translation Auxiliary Task for E2E Code-switching ASR
Figure 4 for LAE-ST-MoE: Boosted Language-Aware Encoder Using Speech Translation Auxiliary Task for E2E Code-switching ASR
Viaarxiv icon

Test-Time Training for Speech

Sep 28, 2023
Sri Harsha Dumpala, Chandramouli Sastry, Sageev Oore

Figure 1 for Test-Time Training for Speech
Figure 2 for Test-Time Training for Speech
Figure 3 for Test-Time Training for Speech
Figure 4 for Test-Time Training for Speech
Viaarxiv icon

Using Large Language Models for Qualitative Analysis can Introduce Serious Bias

Sep 29, 2023
Julian Ashwin, Aditya Chhabra, Vijayendra Rao

Figure 1 for Using Large Language Models for Qualitative Analysis can Introduce Serious Bias
Figure 2 for Using Large Language Models for Qualitative Analysis can Introduce Serious Bias
Figure 3 for Using Large Language Models for Qualitative Analysis can Introduce Serious Bias
Figure 4 for Using Large Language Models for Qualitative Analysis can Introduce Serious Bias
Viaarxiv icon

Synthetic Speech Detection Based on Temporal Consistency and Distribution of Speaker Features

Sep 29, 2023
Yuxiang Zhang, Zhuo Li, Jingze Lu, Wenchao Wang, Pengyuan Zhang

Viaarxiv icon

GAIA-1: A Generative World Model for Autonomous Driving

Sep 29, 2023
Anthony Hu, Lloyd Russell, Hudson Yeo, Zak Murez, George Fedoseev, Alex Kendall, Jamie Shotton, Gianluca Corrado

Figure 1 for GAIA-1: A Generative World Model for Autonomous Driving
Figure 2 for GAIA-1: A Generative World Model for Autonomous Driving
Figure 3 for GAIA-1: A Generative World Model for Autonomous Driving
Figure 4 for GAIA-1: A Generative World Model for Autonomous Driving
Viaarxiv icon

Revolutionizing Mobile Interaction: Enabling a 3 Billion Parameter GPT LLM on Mobile

Sep 29, 2023
Samuel Carreira, Tomás Marques, José Ribeiro, Carlos Grilo

Figure 1 for Revolutionizing Mobile Interaction: Enabling a 3 Billion Parameter GPT LLM on Mobile
Figure 2 for Revolutionizing Mobile Interaction: Enabling a 3 Billion Parameter GPT LLM on Mobile
Figure 3 for Revolutionizing Mobile Interaction: Enabling a 3 Billion Parameter GPT LLM on Mobile
Figure 4 for Revolutionizing Mobile Interaction: Enabling a 3 Billion Parameter GPT LLM on Mobile
Viaarxiv icon

Domain-Controlled Prompt Learning

Sep 30, 2023
Qinglong Cao, Zhengqin Xu, Yuantian Chen, Chao Ma, Xiaokang Yang

Viaarxiv icon

A Brief History of Prompt: Leveraging Language Models

Sep 30, 2023
Golam Md Muktadir

Viaarxiv icon