Alert button
Picture for Leo Gao

Leo Gao

Alert button

An Empirical Exploration in Quality Filtering of Text Data

Add code
Bookmark button
Alert button
Sep 02, 2021
Leo Gao

Figure 1 for An Empirical Exploration in Quality Filtering of Text Data
Figure 2 for An Empirical Exploration in Quality Filtering of Text Data
Figure 3 for An Empirical Exploration in Quality Filtering of Text Data
Figure 4 for An Empirical Exploration in Quality Filtering of Text Data
Viaarxiv icon

The Pile: An 800GB Dataset of Diverse Text for Language Modeling

Add code
Bookmark button
Alert button
Dec 31, 2020
Leo Gao, Stella Biderman, Sid Black, Laurence Golding, Travis Hoppe, Charles Foster, Jason Phang, Horace He, Anish Thite, Noa Nabeshima, Shawn Presser, Connor Leahy

Figure 1 for The Pile: An 800GB Dataset of Diverse Text for Language Modeling
Figure 2 for The Pile: An 800GB Dataset of Diverse Text for Language Modeling
Figure 3 for The Pile: An 800GB Dataset of Diverse Text for Language Modeling
Figure 4 for The Pile: An 800GB Dataset of Diverse Text for Language Modeling
Viaarxiv icon

Collaborative Storytelling with Large-scale Neural Language Models

Add code
Bookmark button
Alert button
Nov 20, 2020
Eric Nichols, Leo Gao, Randy Gomez

Figure 1 for Collaborative Storytelling with Large-scale Neural Language Models
Figure 2 for Collaborative Storytelling with Large-scale Neural Language Models
Figure 3 for Collaborative Storytelling with Large-scale Neural Language Models
Figure 4 for Collaborative Storytelling with Large-scale Neural Language Models
Viaarxiv icon