Alert button
Picture for Nikolay Bogoychev

Nikolay Bogoychev

Alert button

OpusCleaner and OpusTrainer, open source toolkits for training Machine Translation and Large language models

Add code
Bookmark button
Alert button
Nov 24, 2023
Nikolay Bogoychev, Jelmer van der Linde, Graeme Nail, Barry Haddow, Jaume Zaragoza-Bernabeu, Gema Ramírez-Sánchez, Lukas Weymann, Tudor Nicolae Mateiu, Jindřich Helcl, Mikko Aulamo

Viaarxiv icon

Large Language Model Inference with Lexical Shortlisting

Add code
Bookmark button
Alert button
Nov 16, 2023
Nikolay Bogoychev, Pinzhen Chen, Barry Haddow, Alexandra Birch

Viaarxiv icon

Terminology-Aware Translation with Constrained Decoding and Large Language Model Prompting

Add code
Bookmark button
Alert button
Oct 09, 2023
Nikolay Bogoychev, Pinzhen Chen

Figure 1 for Terminology-Aware Translation with Constrained Decoding and Large Language Model Prompting
Figure 2 for Terminology-Aware Translation with Constrained Decoding and Large Language Model Prompting
Figure 3 for Terminology-Aware Translation with Constrained Decoding and Large Language Model Prompting
Figure 4 for Terminology-Aware Translation with Constrained Decoding and Large Language Model Prompting
Viaarxiv icon

Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca

Add code
Bookmark button
Alert button
Sep 16, 2023
Pinzhen Chen, Shaoxiong Ji, Nikolay Bogoychev, Barry Haddow, Kenneth Heafield

Figure 1 for Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca
Figure 2 for Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca
Figure 3 for Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca
Figure 4 for Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca
Viaarxiv icon

An Open Dataset and Model for Language Identification

Add code
Bookmark button
Alert button
May 23, 2023
Laurie Burchell, Alexandra Birch, Nikolay Bogoychev, Kenneth Heafield

Figure 1 for An Open Dataset and Model for Language Identification
Figure 2 for An Open Dataset and Model for Language Identification
Figure 3 for An Open Dataset and Model for Language Identification
Viaarxiv icon

The Edinburgh International Accents of English Corpus: Towards the Democratization of English ASR

Add code
Bookmark button
Alert button
Mar 31, 2023
Ramon Sanabria, Nikolay Bogoychev, Nina Markl, Andrea Carmantini, Ondrej Klejch, Peter Bell

Figure 1 for The Edinburgh International Accents of English Corpus: Towards the Democratization of English ASR
Figure 2 for The Edinburgh International Accents of English Corpus: Towards the Democratization of English ASR
Figure 3 for The Edinburgh International Accents of English Corpus: Towards the Democratization of English ASR
Viaarxiv icon

Low-Rank Softmax Can Have Unargmaxable Classes in Theory but Rarely in Practice

Add code
Bookmark button
Alert button
Mar 21, 2022
Andreas Grivas, Nikolay Bogoychev, Adam Lopez

Figure 1 for Low-Rank Softmax Can Have Unargmaxable Classes in Theory but Rarely in Practice
Figure 2 for Low-Rank Softmax Can Have Unargmaxable Classes in Theory but Rarely in Practice
Figure 3 for Low-Rank Softmax Can Have Unargmaxable Classes in Theory but Rarely in Practice
Figure 4 for Low-Rank Softmax Can Have Unargmaxable Classes in Theory but Rarely in Practice
Viaarxiv icon

TranslateLocally: Blazing-fast translation running on the local CPU

Add code
Bookmark button
Alert button
Sep 21, 2021
Nikolay Bogoychev, Jelmer Van der Linde, Kenneth Heafield

Figure 1 for TranslateLocally: Blazing-fast translation running on the local CPU
Figure 2 for TranslateLocally: Blazing-fast translation running on the local CPU
Figure 3 for TranslateLocally: Blazing-fast translation running on the local CPU
Figure 4 for TranslateLocally: Blazing-fast translation running on the local CPU
Viaarxiv icon

Decoding Time Lexical Domain Adaptationfor Neural Machine Translation

Add code
Bookmark button
Alert button
Jan 02, 2021
Nikolay Bogoychev, Pinzhen Chen

Figure 1 for Decoding Time Lexical Domain Adaptationfor Neural Machine Translation
Figure 2 for Decoding Time Lexical Domain Adaptationfor Neural Machine Translation
Figure 3 for Decoding Time Lexical Domain Adaptationfor Neural Machine Translation
Figure 4 for Decoding Time Lexical Domain Adaptationfor Neural Machine Translation
Viaarxiv icon