Alert button

"Text": models, code, and papers
Alert button

Video + CLIP Baseline for Ego4D Long-term Action Anticipation

Jul 01, 2022
Srijan Das, Michael S. Ryoo

Figure 1 for Video + CLIP Baseline for Ego4D Long-term Action Anticipation
Figure 2 for Video + CLIP Baseline for Ego4D Long-term Action Anticipation
Figure 3 for Video + CLIP Baseline for Ego4D Long-term Action Anticipation
Viaarxiv icon

I3CL:Intra- and Inter-Instance Collaborative Learning for Arbitrary-shaped Scene Text Detection

Aug 16, 2021
Jian Ye, Jing Zhang, Juhua Liu, Bo Du, Dacheng Tao

Figure 1 for I3CL:Intra- and Inter-Instance Collaborative Learning for Arbitrary-shaped Scene Text Detection
Figure 2 for I3CL:Intra- and Inter-Instance Collaborative Learning for Arbitrary-shaped Scene Text Detection
Figure 3 for I3CL:Intra- and Inter-Instance Collaborative Learning for Arbitrary-shaped Scene Text Detection
Figure 4 for I3CL:Intra- and Inter-Instance Collaborative Learning for Arbitrary-shaped Scene Text Detection
Viaarxiv icon

Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification

Sep 01, 2021
Shuhuai Ren, Jinchao Zhang, Lei Li, Xu Sun, Jie Zhou

Figure 1 for Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification
Figure 2 for Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification
Figure 3 for Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification
Figure 4 for Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification
Viaarxiv icon

BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus

Jul 07, 2022
Josh Meyer, David Ifeoluwa Adelani, Edresson Casanova, Alp Öktem, Daniel Whitenack Julian Weber, Salomon Kabongo, Elizabeth Salesky, Iroro Orife, Colin Leong, Perez Ogayo, Chris Emezue, Jonathan Mukiibi, Salomey Osei, Apelete Agbolo, Victor Akinode, Bernard Opoku, Samuel Olanrewaju, Jesujoba Alabi, Shamsuddeen Muhammad

Figure 1 for BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus
Figure 2 for BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus
Figure 3 for BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus
Figure 4 for BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus
Viaarxiv icon

Efficient Training of Language Models to Fill in the Middle

Jul 28, 2022
Mohammad Bavarian, Heewoo Jun, Nikolas Tezak, John Schulman, Christine McLeavey, Jerry Tworek, Mark Chen

Figure 1 for Efficient Training of Language Models to Fill in the Middle
Figure 2 for Efficient Training of Language Models to Fill in the Middle
Figure 3 for Efficient Training of Language Models to Fill in the Middle
Figure 4 for Efficient Training of Language Models to Fill in the Middle
Viaarxiv icon

Cross-speaker style transfer for text-to-speech using data augmentation

Feb 10, 2022
Manuel Sam Ribeiro, Julian Roth, Giulia Comini, Goeric Huybrechts, Adam Gabrys, Jaime Lorenzo-Trueba

Figure 1 for Cross-speaker style transfer for text-to-speech using data augmentation
Figure 2 for Cross-speaker style transfer for text-to-speech using data augmentation
Figure 3 for Cross-speaker style transfer for text-to-speech using data augmentation
Figure 4 for Cross-speaker style transfer for text-to-speech using data augmentation
Viaarxiv icon

Open-Vocabulary Multi-Label Classification via Multi-modal Knowledge Transfer

Jul 05, 2022
Sunan He, Taian Guo, Tao Dai, Ruizhi Qiao, Bo Ren, Shu-Tao Xia

Figure 1 for Open-Vocabulary Multi-Label Classification via Multi-modal Knowledge Transfer
Figure 2 for Open-Vocabulary Multi-Label Classification via Multi-modal Knowledge Transfer
Figure 3 for Open-Vocabulary Multi-Label Classification via Multi-modal Knowledge Transfer
Figure 4 for Open-Vocabulary Multi-Label Classification via Multi-modal Knowledge Transfer
Viaarxiv icon

PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR System

Jun 07, 2022
Chenxia Li, Weiwei Liu, Ruoyu Guo, Xiaoting Yin, Kaitao Jiang, Yongkun Du, Yuning Du, Lingfeng Zhu, Baohua Lai, Xiaoguang Hu, Dianhai Yu, Yanjun Ma

Figure 1 for PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR System
Figure 2 for PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR System
Figure 3 for PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR System
Figure 4 for PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR System
Viaarxiv icon

Vocabulary Transfer for Medical Texts

Aug 04, 2022
Vladislav D. Mosin, Ivan P. Yamshchikov

Figure 1 for Vocabulary Transfer for Medical Texts
Figure 2 for Vocabulary Transfer for Medical Texts
Figure 3 for Vocabulary Transfer for Medical Texts
Figure 4 for Vocabulary Transfer for Medical Texts
Viaarxiv icon

R2D2: Relational Text Decoding with Transformers

May 10, 2021
Aryan Arbabi, Mingqiu Wang, Laurent El Shafey, Nan Du, Izhak Shafran

Figure 1 for R2D2: Relational Text Decoding with Transformers
Figure 2 for R2D2: Relational Text Decoding with Transformers
Figure 3 for R2D2: Relational Text Decoding with Transformers
Figure 4 for R2D2: Relational Text Decoding with Transformers
Viaarxiv icon