Alert button
Picture for Brian Roark

Brian Roark

Alert button

XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages

May 24, 2023
Sebastian Ruder, Jonathan H. Clark, Alexander Gutkin, Mihir Kale, Min Ma, Massimo Nicosia, Shruti Rijhwani, Parker Riley, Jean-Michel A. Sarr, Xinyi Wang, John Wieting, Nitish Gupta, Anna Katanova, Christo Kirov, Dana L. Dickinson, Brian Roark, Bidisha Samanta, Connie Tao, David I. Adelani, Vera Axelrod, Isaac Caswell, Colin Cherry, Dan Garrette, Reeve Ingle, Melvin Johnson, Dmitry Panteleev, Partha Talukdar

Figure 1 for XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages
Figure 2 for XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages
Figure 3 for XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages
Figure 4 for XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages
Viaarxiv icon

Spelling convention sensitivity in neural language models

Mar 06, 2023
Elizabeth Nielsen, Christo Kirov, Brian Roark

Figure 1 for Spelling convention sensitivity in neural language models
Figure 2 for Spelling convention sensitivity in neural language models
Figure 3 for Spelling convention sensitivity in neural language models
Figure 4 for Spelling convention sensitivity in neural language models
Viaarxiv icon

Beyond Arabic: Software for Perso-Arabic Script Manipulation

Jan 26, 2023
Alexander Gutkin, Cibu Johny, Raiomond Doctor, Brian Roark, Richard Sproat

Figure 1 for Beyond Arabic: Software for Perso-Arabic Script Manipulation
Figure 2 for Beyond Arabic: Software for Perso-Arabic Script Manipulation
Figure 3 for Beyond Arabic: Software for Perso-Arabic Script Manipulation
Figure 4 for Beyond Arabic: Software for Perso-Arabic Script Manipulation
Viaarxiv icon

Structured abbreviation expansion in context

Oct 04, 2021
Kyle Gorman, Christo Kirov, Brian Roark, Richard Sproat

Figure 1 for Structured abbreviation expansion in context
Figure 2 for Structured abbreviation expansion in context
Figure 3 for Structured abbreviation expansion in context
Figure 4 for Structured abbreviation expansion in context
Viaarxiv icon

Finding Concept-specific Biases in Form--Meaning Associations

Apr 29, 2021
Tiago Pimentel, Brian Roark, Søren Wichmann, Ryan Cotterell, Damián Blasi

Figure 1 for Finding Concept-specific Biases in Form--Meaning Associations
Figure 2 for Finding Concept-specific Biases in Form--Meaning Associations
Figure 3 for Finding Concept-specific Biases in Form--Meaning Associations
Figure 4 for Finding Concept-specific Biases in Form--Meaning Associations
Viaarxiv icon

Disambiguatory Signals are Stronger in Word-initial Positions

Feb 03, 2021
Tiago Pimentel, Ryan Cotterell, Brian Roark

Figure 1 for Disambiguatory Signals are Stronger in Word-initial Positions
Viaarxiv icon

Processing South Asian Languages Written in the Latin Script: the Dakshina Dataset

Jul 02, 2020
Brian Roark, Lawrence Wolf-Sonkin, Christo Kirov, Sabrina J. Mielke, Cibu Johny, Isin Demirsahin, Keith Hall

Figure 1 for Processing South Asian Languages Written in the Latin Script: the Dakshina Dataset
Figure 2 for Processing South Asian Languages Written in the Latin Script: the Dakshina Dataset
Figure 3 for Processing South Asian Languages Written in the Latin Script: the Dakshina Dataset
Figure 4 for Processing South Asian Languages Written in the Latin Script: the Dakshina Dataset
Viaarxiv icon

Phonotactic Complexity and its Trade-offs

May 07, 2020
Tiago Pimentel, Brian Roark, Ryan Cotterell

Viaarxiv icon

Language-agnostic Multilingual Modeling

Apr 20, 2020
Arindrima Datta, Bhuvana Ramabhadran, Jesse Emond, Anjuli Kannan, Brian Roark

Figure 1 for Language-agnostic Multilingual Modeling
Figure 2 for Language-agnostic Multilingual Modeling
Figure 3 for Language-agnostic Multilingual Modeling
Figure 4 for Language-agnostic Multilingual Modeling
Viaarxiv icon