Alert button

"Text": models, code, and papers
Alert button

Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning

Sep 05, 2023
Lili Yu, Bowen Shi, Ramakanth Pasunuru, Benjamin Muller, Olga Golovneva, Tianlu Wang, Arun Babu, Binh Tang, Brian Karrer, Shelly Sheynin, Candace Ross, Adam Polyak, Russell Howes, Vasu Sharma, Puxin Xu, Hovhannes Tamoyan, Oron Ashual, Uriel Singer, Shang-Wen Li, Susan Zhang, Richard James, Gargi Ghosh, Yaniv Taigman, Maryam Fazel-Zarandi, Asli Celikyilmaz, Luke Zettlemoyer, Armen Aghajanyan

Figure 1 for Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning
Figure 2 for Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning
Figure 3 for Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning
Figure 4 for Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning
Viaarxiv icon

RECAP: Retrieval-Augmented Audio Captioning

Sep 18, 2023
Sreyan Ghosh, Sonal Kumar, Chandra Kiran Reddy Evuru, Ramani Duraiswami, Dinesh Manocha

Viaarxiv icon

Rank Your Summaries: Enhancing Bengali Text Summarization via Ranking-based Approach

Jul 14, 2023
G. M. Shahariar, Tonmoy Talukder, Rafin Alam Khan Sotez, Md. Tanvir Rouf Shawon

Viaarxiv icon

Data Filtering Networks

Oct 02, 2023
Alex Fang, Albin Madappally Jose, Amit Jain, Ludwig Schmidt, Alexander Toshev, Vaishaal Shankar

Figure 1 for Data Filtering Networks
Figure 2 for Data Filtering Networks
Figure 3 for Data Filtering Networks
Figure 4 for Data Filtering Networks
Viaarxiv icon

CPPF: A contextual and post-processing-free model for automatic speech recognition

Sep 14, 2023
Lei Zhang, Zhengkun Tian, Xiang Chen, Jiaming Sun, Hongyu Xiang, Ke Ding, Guanglu Wan

Figure 1 for CPPF: A contextual and post-processing-free model for automatic speech recognition
Figure 2 for CPPF: A contextual and post-processing-free model for automatic speech recognition
Figure 3 for CPPF: A contextual and post-processing-free model for automatic speech recognition
Viaarxiv icon

Testing of Detection Tools for AI-Generated Text

Jul 10, 2023
Debora Weber-Wulff, Alla Anohina-Naumeca, Sonja Bjelobaba, Tomáš Foltýnek, Jean Guerrero-Dib, Olumide Popoola, Petr Šigut, Lorna Waddington

Figure 1 for Testing of Detection Tools for AI-Generated Text
Figure 2 for Testing of Detection Tools for AI-Generated Text
Figure 3 for Testing of Detection Tools for AI-Generated Text
Figure 4 for Testing of Detection Tools for AI-Generated Text
Viaarxiv icon

FaceCLIPNeRF: Text-driven 3D Face Manipulation using Deformable Neural Radiance Fields

Jul 21, 2023
Sungwon Hwang, Junha Hyung, Daejin Kim, Min-Jung Kim, Jaegul Choo

Figure 1 for FaceCLIPNeRF: Text-driven 3D Face Manipulation using Deformable Neural Radiance Fields
Figure 2 for FaceCLIPNeRF: Text-driven 3D Face Manipulation using Deformable Neural Radiance Fields
Figure 3 for FaceCLIPNeRF: Text-driven 3D Face Manipulation using Deformable Neural Radiance Fields
Figure 4 for FaceCLIPNeRF: Text-driven 3D Face Manipulation using Deformable Neural Radiance Fields
Viaarxiv icon

Few-shot medical image classification with simple shape and texture text descriptors using vision-language models

Aug 08, 2023
Michal Byra, Muhammad Febrian Rachmadi, Henrik Skibbe

Viaarxiv icon

Faithful Explanations of Black-box NLP Models Using LLM-generated Counterfactuals

Oct 01, 2023
Yair Gat, Nitay Calderon, Amir Feder, Alexander Chapanin, Amit Sharma, Roi Reichart

Figure 1 for Faithful Explanations of Black-box NLP Models Using LLM-generated Counterfactuals
Figure 2 for Faithful Explanations of Black-box NLP Models Using LLM-generated Counterfactuals
Figure 3 for Faithful Explanations of Black-box NLP Models Using LLM-generated Counterfactuals
Figure 4 for Faithful Explanations of Black-box NLP Models Using LLM-generated Counterfactuals
Viaarxiv icon

Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis

Sep 30, 2023
Nithin Gopalakrishnan Nair, Anoop Cherian, Suhas Lohit, Ye Wang, Toshiaki Koike-Akino, Vishal M. Patel, Tim K. Marks

Figure 1 for Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis
Figure 2 for Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis
Figure 3 for Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis
Figure 4 for Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis
Viaarxiv icon