Picture for Sampo Pyysalo

Sampo Pyysalo

LIPN

Poro 34B and the Blessing of Multilinguality

Add code
Apr 02, 2024
Viaarxiv icon

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

Add code
Mar 30, 2024
Viaarxiv icon

A New Massive Multilingual Dataset for High-Performance Language Technologies

Add code
Mar 20, 2024
Figure 1 for A New Massive Multilingual Dataset for High-Performance Language Technologies
Figure 2 for A New Massive Multilingual Dataset for High-Performance Language Technologies
Figure 3 for A New Massive Multilingual Dataset for High-Performance Language Technologies
Figure 4 for A New Massive Multilingual Dataset for High-Performance Language Technologies
Viaarxiv icon

FinGPT: Large Generative Models for a Small Language

Add code
Nov 03, 2023
Figure 1 for FinGPT: Large Generative Models for a Small Language
Figure 2 for FinGPT: Large Generative Models for a Small Language
Figure 3 for FinGPT: Large Generative Models for a Small Language
Figure 4 for FinGPT: Large Generative Models for a Small Language
Viaarxiv icon

Scaling Data-Constrained Language Models

Add code
May 25, 2023
Figure 1 for Scaling Data-Constrained Language Models
Figure 2 for Scaling Data-Constrained Language Models
Figure 3 for Scaling Data-Constrained Language Models
Figure 4 for Scaling Data-Constrained Language Models
Viaarxiv icon

Silver Syntax Pre-training for Cross-Domain Relation Extraction

Add code
May 18, 2023
Figure 1 for Silver Syntax Pre-training for Cross-Domain Relation Extraction
Figure 2 for Silver Syntax Pre-training for Cross-Domain Relation Extraction
Figure 3 for Silver Syntax Pre-training for Cross-Domain Relation Extraction
Figure 4 for Silver Syntax Pre-training for Cross-Domain Relation Extraction
Viaarxiv icon

Multi-CrossRE A Multi-Lingual Multi-Domain Dataset for Relation Extraction

Add code
May 18, 2023
Figure 1 for Multi-CrossRE A Multi-Lingual Multi-Domain Dataset for Relation Extraction
Figure 2 for Multi-CrossRE A Multi-Lingual Multi-Domain Dataset for Relation Extraction
Figure 3 for Multi-CrossRE A Multi-Lingual Multi-Domain Dataset for Relation Extraction
Figure 4 for Multi-CrossRE A Multi-Lingual Multi-Domain Dataset for Relation Extraction
Viaarxiv icon

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Add code
Nov 09, 2022
Viaarxiv icon

Explaining Classes through Word Attribution

Add code
Aug 31, 2021
Figure 1 for Explaining Classes through Word Attribution
Figure 2 for Explaining Classes through Word Attribution
Viaarxiv icon

Quantitative Evaluation of Alternative Translations in a Corpus of Highly Dissimilar Finnish Paraphrases

Add code
May 06, 2021
Figure 1 for Quantitative Evaluation of Alternative Translations in a Corpus of Highly Dissimilar Finnish Paraphrases
Figure 2 for Quantitative Evaluation of Alternative Translations in a Corpus of Highly Dissimilar Finnish Paraphrases
Figure 3 for Quantitative Evaluation of Alternative Translations in a Corpus of Highly Dissimilar Finnish Paraphrases
Figure 4 for Quantitative Evaluation of Alternative Translations in a Corpus of Highly Dissimilar Finnish Paraphrases
Viaarxiv icon