Alert button

"Text": models, code, and papers
Alert button

CSTR: A Classification Perspective on Scene Text Recognition

Feb 22, 2021
Hongxiang Cai, Jun Sun, Yichao Xiong

Figure 1 for CSTR: A Classification Perspective on Scene Text Recognition
Figure 2 for CSTR: A Classification Perspective on Scene Text Recognition
Figure 3 for CSTR: A Classification Perspective on Scene Text Recognition
Figure 4 for CSTR: A Classification Perspective on Scene Text Recognition
Viaarxiv icon

EdiTTS: Score-based Editing for Controllable Text-to-Speech

Oct 06, 2021
Jaesung Tae, Hyeongju Kim, Taesu Kim

Figure 1 for EdiTTS: Score-based Editing for Controllable Text-to-Speech
Figure 2 for EdiTTS: Score-based Editing for Controllable Text-to-Speech
Figure 3 for EdiTTS: Score-based Editing for Controllable Text-to-Speech
Figure 4 for EdiTTS: Score-based Editing for Controllable Text-to-Speech
Viaarxiv icon

Simple Open-Vocabulary Object Detection with Vision Transformers

May 12, 2022
Matthias Minderer, Alexey Gritsenko, Austin Stone, Maxim Neumann, Dirk Weissenborn, Alexey Dosovitskiy, Aravindh Mahendran, Anurag Arnab, Mostafa Dehghani, Zhuoran Shen, Xiao Wang, Xiaohua Zhai, Thomas Kipf, Neil Houlsby

Figure 1 for Simple Open-Vocabulary Object Detection with Vision Transformers
Figure 2 for Simple Open-Vocabulary Object Detection with Vision Transformers
Figure 3 for Simple Open-Vocabulary Object Detection with Vision Transformers
Figure 4 for Simple Open-Vocabulary Object Detection with Vision Transformers
Viaarxiv icon

Visual Subtitle Feature Enhanced Video Outline Generation

Sep 01, 2022
Qi Lv, Ziqiang Cao, Wenrui Xie, Derui Wang, Jingwen Wang, Zhiwei Hu, Tangkun Zhang, Ba Yuan, Yuanhang Li, Min Cao, Wenjie Li, Sujian Li, Guohong Fu

Figure 1 for Visual Subtitle Feature Enhanced Video Outline Generation
Figure 2 for Visual Subtitle Feature Enhanced Video Outline Generation
Figure 3 for Visual Subtitle Feature Enhanced Video Outline Generation
Figure 4 for Visual Subtitle Feature Enhanced Video Outline Generation
Viaarxiv icon

Transformer over Pre-trained Transformer for Neural Text Segmentation with Enhanced Topic Coherence

Oct 14, 2021
Kelvin Lo, Yuan Jin, Weicong Tan, Ming Liu, Lan Du, Wray Buntine

Figure 1 for Transformer over Pre-trained Transformer for Neural Text Segmentation with Enhanced Topic Coherence
Figure 2 for Transformer over Pre-trained Transformer for Neural Text Segmentation with Enhanced Topic Coherence
Figure 3 for Transformer over Pre-trained Transformer for Neural Text Segmentation with Enhanced Topic Coherence
Figure 4 for Transformer over Pre-trained Transformer for Neural Text Segmentation with Enhanced Topic Coherence
Viaarxiv icon

Towards Robustness of Text-to-SQL Models against Synonym Substitution

Jun 19, 2021
Yujian Gan, Xinyun Chen, Qiuping Huang, Matthew Purver, John R. Woodward, Jinxia Xie, Pengsheng Huang

Figure 1 for Towards Robustness of Text-to-SQL Models against Synonym Substitution
Figure 2 for Towards Robustness of Text-to-SQL Models against Synonym Substitution
Figure 3 for Towards Robustness of Text-to-SQL Models against Synonym Substitution
Figure 4 for Towards Robustness of Text-to-SQL Models against Synonym Substitution
Viaarxiv icon

Unconstrained Text Detection in Manga

Oct 07, 2020
Julián Del Gobbo, Rosana Matuk Herrera

Figure 1 for Unconstrained Text Detection in Manga
Figure 2 for Unconstrained Text Detection in Manga
Figure 3 for Unconstrained Text Detection in Manga
Figure 4 for Unconstrained Text Detection in Manga
Viaarxiv icon

Joint Visual Semantic Reasoning: Multi-Stage Decoder for Text Recognition

Jul 27, 2021
Ayan Kumar Bhunia, Aneeshan Sain, Amandeep Kumar, Shuvozit Ghose, Pinaki Nath Chowdhury, Yi-Zhe Song

Figure 1 for Joint Visual Semantic Reasoning: Multi-Stage Decoder for Text Recognition
Figure 2 for Joint Visual Semantic Reasoning: Multi-Stage Decoder for Text Recognition
Figure 3 for Joint Visual Semantic Reasoning: Multi-Stage Decoder for Text Recognition
Figure 4 for Joint Visual Semantic Reasoning: Multi-Stage Decoder for Text Recognition
Viaarxiv icon

Video Activity Localisation with Uncertainties in Temporal Boundary

Jun 26, 2022
Jiabo Huang, Hailin Jin, Shaogang Gong, Yang Liu

Figure 1 for Video Activity Localisation with Uncertainties in Temporal Boundary
Figure 2 for Video Activity Localisation with Uncertainties in Temporal Boundary
Figure 3 for Video Activity Localisation with Uncertainties in Temporal Boundary
Figure 4 for Video Activity Localisation with Uncertainties in Temporal Boundary
Viaarxiv icon

Use of Transformer-Based Models for Word-Level Transliteration of the Book of the Dean of Lismore

May 31, 2022
Edward Gow-Smith, Mark McConville, William Gillies, Jade Scott, Roibeard Ó Maolalaigh

Figure 1 for Use of Transformer-Based Models for Word-Level Transliteration of the Book of the Dean of Lismore
Figure 2 for Use of Transformer-Based Models for Word-Level Transliteration of the Book of the Dean of Lismore
Figure 3 for Use of Transformer-Based Models for Word-Level Transliteration of the Book of the Dean of Lismore
Viaarxiv icon