Alert button

"Text": models, code, and papers
Alert button

SantaCoder: don't reach for the stars!

Jan 09, 2023
Loubna Ben Allal, Raymond Li, Denis Kocetkov, Chenghao Mou, Christopher Akiki, Carlos Munoz Ferrandis, Niklas Muennighoff, Mayank Mishra, Alex Gu, Manan Dey, Logesh Kumar Umapathi, Carolyn Jane Anderson, Yangtian Zi, Joel Lamy Poirier, Hailey Schoelkopf, Sergey Troshin, Dmitry Abulkhanov, Manuel Romero, Michael Lappert, Francesco De Toni, Bernardo García del Río, Qian Liu, Shamik Bose, Urvashi Bhattacharyya, Terry Yue Zhuo, Ian Yu, Paulo Villegas, Marco Zocca, Sourab Mangrulkar, David Lansky, Huu Nguyen, Danish Contractor, Luis Villa, Jia Li, Dzmitry Bahdanau, Yacine Jernite, Sean Hughes, Daniel Fried, Arjun Guha, Harm de Vries, Leandro von Werra

Figure 1 for SantaCoder: don't reach for the stars!
Figure 2 for SantaCoder: don't reach for the stars!
Figure 3 for SantaCoder: don't reach for the stars!
Figure 4 for SantaCoder: don't reach for the stars!
Viaarxiv icon

muBoost: An Effective Method for Solving Indic Multilingual Text Classification Problem

Jun 21, 2022
Manish Pathak, Aditya Jain

Figure 1 for muBoost: An Effective Method for Solving Indic Multilingual Text Classification Problem
Figure 2 for muBoost: An Effective Method for Solving Indic Multilingual Text Classification Problem
Figure 3 for muBoost: An Effective Method for Solving Indic Multilingual Text Classification Problem
Figure 4 for muBoost: An Effective Method for Solving Indic Multilingual Text Classification Problem
Viaarxiv icon

The Text Anonymization Benchmark (TAB): A Dedicated Corpus and Evaluation Framework for Text Anonymization

Jan 25, 2022
Ildikó Pilán, Pierre Lison, Lilja Øvrelid, Anthi Papadopoulou, David Sánchez, Montserrat Batet

Figure 1 for The Text Anonymization Benchmark (TAB): A Dedicated Corpus and Evaluation Framework for Text Anonymization
Figure 2 for The Text Anonymization Benchmark (TAB): A Dedicated Corpus and Evaluation Framework for Text Anonymization
Figure 3 for The Text Anonymization Benchmark (TAB): A Dedicated Corpus and Evaluation Framework for Text Anonymization
Figure 4 for The Text Anonymization Benchmark (TAB): A Dedicated Corpus and Evaluation Framework for Text Anonymization
Viaarxiv icon

M3ST: Mix at Three Levels for Speech Translation

Dec 07, 2022
Xuxin Cheng, Qianqian Dong, Fengpeng Yue, Tom Ko, Mingxuan Wang, Yuexian Zou

Figure 1 for M3ST: Mix at Three Levels for Speech Translation
Figure 2 for M3ST: Mix at Three Levels for Speech Translation
Figure 3 for M3ST: Mix at Three Levels for Speech Translation
Figure 4 for M3ST: Mix at Three Levels for Speech Translation
Viaarxiv icon

Text-Based Automatic Personality Prediction Using KGrAt-Net; A Knowledge Graph Attention Network Classifier

May 27, 2022
Majid Ramezani, Mohammad-Reza Feizi-Derakhshi, Mohammad-Ali Balafar

Figure 1 for Text-Based Automatic Personality Prediction Using KGrAt-Net; A Knowledge Graph Attention Network Classifier
Figure 2 for Text-Based Automatic Personality Prediction Using KGrAt-Net; A Knowledge Graph Attention Network Classifier
Figure 3 for Text-Based Automatic Personality Prediction Using KGrAt-Net; A Knowledge Graph Attention Network Classifier
Figure 4 for Text-Based Automatic Personality Prediction Using KGrAt-Net; A Knowledge Graph Attention Network Classifier
Viaarxiv icon

Why do Nearest Neighbor Language Models Work?

Jan 17, 2023
Frank F. Xu, Uri Alon, Graham Neubig

Figure 1 for Why do Nearest Neighbor Language Models Work?
Figure 2 for Why do Nearest Neighbor Language Models Work?
Figure 3 for Why do Nearest Neighbor Language Models Work?
Figure 4 for Why do Nearest Neighbor Language Models Work?
Viaarxiv icon

Speaker consistency loss and step-wise optimization for semi-supervised joint training of TTS and ASR using unpaired text data

Jul 11, 2022
Naoki Makishima, Satoshi Suzuki, Atsushi Ando, Ryo Masumura

Figure 1 for Speaker consistency loss and step-wise optimization for semi-supervised joint training of TTS and ASR using unpaired text data
Figure 2 for Speaker consistency loss and step-wise optimization for semi-supervised joint training of TTS and ASR using unpaired text data
Figure 3 for Speaker consistency loss and step-wise optimization for semi-supervised joint training of TTS and ASR using unpaired text data
Viaarxiv icon

A Survey of Knowledge-Enhanced Pre-trained Language Models

Nov 18, 2022
Linmei Hu, Zeyi Liu, Ziwang Zhao, Lei Hou, Liqiang Nie, Juanzi Li

Figure 1 for A Survey of Knowledge-Enhanced Pre-trained Language Models
Viaarxiv icon

Exploring Discrete Diffusion Models for Image Captioning

Nov 21, 2022
Zixin Zhu, Yixuan Wei, Jianfeng Wang, Zhe Gan, Zheng Zhang, Le Wang, Gang Hua, Lijuan Wang, Zicheng Liu, Han Hu

Figure 1 for Exploring Discrete Diffusion Models for Image Captioning
Figure 2 for Exploring Discrete Diffusion Models for Image Captioning
Figure 3 for Exploring Discrete Diffusion Models for Image Captioning
Figure 4 for Exploring Discrete Diffusion Models for Image Captioning
Viaarxiv icon

Critical Perspectives: A Benchmark Revealing Pitfalls in PerspectiveAPI

Jan 05, 2023
Lorena Piedras, Lucas Rosenblatt, Julia Wilkins

Figure 1 for Critical Perspectives: A Benchmark Revealing Pitfalls in PerspectiveAPI
Figure 2 for Critical Perspectives: A Benchmark Revealing Pitfalls in PerspectiveAPI
Figure 3 for Critical Perspectives: A Benchmark Revealing Pitfalls in PerspectiveAPI
Figure 4 for Critical Perspectives: A Benchmark Revealing Pitfalls in PerspectiveAPI
Viaarxiv icon