Alert button
Picture for Pedro Javier Ortiz Suárez

Pedro Javier Ortiz Suárez

Alert button

ALMAnaCH, SU

Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets

Add code
Bookmark button
Alert button
Mar 22, 2021
Isaac Caswell, Julia Kreutzer, Lisa Wang, Ahsan Wahab, Daan van Esch, Nasanbayar Ulzii-Orshikh, Allahsera Tapo, Nishant Subramani, Artem Sokolov, Claytone Sikasote, Monang Setyawan, Supheakmungkol Sarin, Sokhar Samb, Benoît Sagot, Clara Rivera, Annette Rios, Isabel Papadimitriou, Salomey Osei, Pedro Javier Ortiz Suárez, Iroro Orife, Kelechi Ogueji, Rubungo Andre Niyongabo, Toan Q. Nguyen, Mathias Müller, André Müller, Shamsuddeen Hassan Muhammad, Nanda Muhammad, Ayanda Mnyakeni, Jamshidbek Mirzakhalov, Tapiwanashe Matangira, Colin Leong, Nze Lawson, Sneha Kudugunta, Yacine Jernite, Mathias Jenny, Orhan Firat, Bonaventure F. P. Dossou, Sakhile Dlamini, Nisansa de Silva, Sakine Çabuk Ballı, Stella Biderman, Alessia Battisti, Ahmed Baruwa, Ankur Bapna, Pallavi Baljekar, Israel Abebe Azime, Ayodele Awokoya, Duygu Ataman, Orevaoghene Ahia, Oghenefego Ahia, Sweta Agrawal, Mofetoluwa Adeyemi

Figure 1 for Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets
Figure 2 for Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets
Figure 3 for Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets
Figure 4 for Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets
Viaarxiv icon

A Monolingual Approach to Contextualized Word Embeddings for Mid-Resource Languages

Add code
Bookmark button
Alert button
Jun 18, 2020
Pedro Javier Ortiz Suárez, Laurent Romary, Benoît Sagot

Figure 1 for A Monolingual Approach to Contextualized Word Embeddings for Mid-Resource Languages
Figure 2 for A Monolingual Approach to Contextualized Word Embeddings for Mid-Resource Languages
Figure 3 for A Monolingual Approach to Contextualized Word Embeddings for Mid-Resource Languages
Figure 4 for A Monolingual Approach to Contextualized Word Embeddings for Mid-Resource Languages
Viaarxiv icon

Establishing a New State-of-the-Art for French Named Entity Recognition

Add code
Bookmark button
Alert button
May 27, 2020
Pedro Javier Ortiz Suárez, Yoann Dupont, Benjamin Muller, Laurent Romary, Benoît Sagot

Figure 1 for Establishing a New State-of-the-Art for French Named Entity Recognition
Figure 2 for Establishing a New State-of-the-Art for French Named Entity Recognition
Viaarxiv icon

CamemBERT: a Tasty French Language Model

Add code
Bookmark button
Alert button
Nov 10, 2019
Louis Martin, Benjamin Muller, Pedro Javier Ortiz Suárez, Yoann Dupont, Laurent Romary, Éric Villemonte de la Clergerie, Djamé Seddah, Benoît Sagot

Figure 1 for CamemBERT: a Tasty French Language Model
Figure 2 for CamemBERT: a Tasty French Language Model
Figure 3 for CamemBERT: a Tasty French Language Model
Figure 4 for CamemBERT: a Tasty French Language Model
Viaarxiv icon