Alert button

"Text": models, code, and papers
Alert button

Reinforcement Learning for Generative AI: A Survey

Aug 29, 2023
Yuanjiang Cao, Quan Z. Sheng, Julian McAuley, Lina Yao

Figure 1 for Reinforcement Learning for Generative AI: A Survey
Figure 2 for Reinforcement Learning for Generative AI: A Survey
Figure 3 for Reinforcement Learning for Generative AI: A Survey
Figure 4 for Reinforcement Learning for Generative AI: A Survey
Viaarxiv icon

Advancing Natural-Language Based Audio Retrieval with PaSST and Large Audio-Caption Data Sets

Aug 08, 2023
Paul Primus, Khaled Koutini, Gerhard Widmer

Figure 1 for Advancing Natural-Language Based Audio Retrieval with PaSST and Large Audio-Caption Data Sets
Figure 2 for Advancing Natural-Language Based Audio Retrieval with PaSST and Large Audio-Caption Data Sets
Figure 3 for Advancing Natural-Language Based Audio Retrieval with PaSST and Large Audio-Caption Data Sets
Figure 4 for Advancing Natural-Language Based Audio Retrieval with PaSST and Large Audio-Caption Data Sets
Viaarxiv icon

Homological Convolutional Neural Networks

Aug 26, 2023
Antonio Briola, Yuanrong Wang, Silvia Bartolucci, Tomaso Aste

Viaarxiv icon

Let's Give a Voice to Conversational Agents in Virtual Reality

Aug 04, 2023
Michele Yin, Gabriel Roccabruna, Abhinav Azad, Giuseppe Riccardi

Figure 1 for Let's Give a Voice to Conversational Agents in Virtual Reality
Figure 2 for Let's Give a Voice to Conversational Agents in Virtual Reality
Viaarxiv icon

Learning the Visualness of Text Using Large Vision-Language Models

May 11, 2023
Gaurav Verma, Ryan A. Rossi, Christopher Tensmeyer, Jiuxiang Gu, Ani Nenkova

Figure 1 for Learning the Visualness of Text Using Large Vision-Language Models
Figure 2 for Learning the Visualness of Text Using Large Vision-Language Models
Figure 3 for Learning the Visualness of Text Using Large Vision-Language Models
Figure 4 for Learning the Visualness of Text Using Large Vision-Language Models
Viaarxiv icon

Rank-Aware Negative Training for Semi-Supervised Text Classification

Jun 13, 2023
Ahmed Murtadha, Shengfeng Pan, Wen Bo, Jianlin Su, Xinxin Cao, Wenze Zhang, Yunfeng Liu

Figure 1 for Rank-Aware Negative Training for Semi-Supervised Text Classification
Figure 2 for Rank-Aware Negative Training for Semi-Supervised Text Classification
Figure 3 for Rank-Aware Negative Training for Semi-Supervised Text Classification
Figure 4 for Rank-Aware Negative Training for Semi-Supervised Text Classification
Viaarxiv icon

Uncertainty Estimation of Transformers' Predictions via Topological Analysis of the Attention Matrices

Aug 22, 2023
Elizaveta Kostenok, Daniil Cherniavskii, Alexey Zaytsev

Figure 1 for Uncertainty Estimation of Transformers' Predictions via Topological Analysis of the Attention Matrices
Figure 2 for Uncertainty Estimation of Transformers' Predictions via Topological Analysis of the Attention Matrices
Figure 3 for Uncertainty Estimation of Transformers' Predictions via Topological Analysis of the Attention Matrices
Figure 4 for Uncertainty Estimation of Transformers' Predictions via Topological Analysis of the Attention Matrices
Viaarxiv icon

bbOCR: An Open-source Multi-domain OCR Pipeline for Bengali Documents

Aug 22, 2023
Imam Mohammad Zulkarnain, Shayekh Bin Islam, Md. Zami Al Zunaed Farabe, Md. Mehedi Hasan Shawon, Jawaril Munshad Abedin, Beig Rajibul Hasan, Marsia Haque, Istiak Shihab, Syed Mobassir, MD. Nazmuddoha Ansary, Asif Sushmit, Farig Sadeque

Figure 1 for bbOCR: An Open-source Multi-domain OCR Pipeline for Bengali Documents
Figure 2 for bbOCR: An Open-source Multi-domain OCR Pipeline for Bengali Documents
Figure 3 for bbOCR: An Open-source Multi-domain OCR Pipeline for Bengali Documents
Figure 4 for bbOCR: An Open-source Multi-domain OCR Pipeline for Bengali Documents
Viaarxiv icon

An Overview of Challenges in Egocentric Text-Video Retrieval

Jun 07, 2023
Burak Satar, Hongyuan Zhu, Hanwang Zhang, Joo Hwee Lim

Figure 1 for An Overview of Challenges in Egocentric Text-Video Retrieval
Figure 2 for An Overview of Challenges in Egocentric Text-Video Retrieval
Figure 3 for An Overview of Challenges in Egocentric Text-Video Retrieval
Figure 4 for An Overview of Challenges in Egocentric Text-Video Retrieval
Viaarxiv icon

Imbalanced Multi-label Classification for Business-related Text with Moderately Large Label Spaces

Jun 12, 2023
Muhammad Arslan, Christophe Cruz

Figure 1 for Imbalanced Multi-label Classification for Business-related Text with Moderately Large Label Spaces
Figure 2 for Imbalanced Multi-label Classification for Business-related Text with Moderately Large Label Spaces
Viaarxiv icon