Picture for Yuri Kuratov

Yuri Kuratov

Associative Recurrent Memory Transformer

Add code
Jul 05, 2024
Viaarxiv icon

BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack

Add code
Jun 14, 2024
Viaarxiv icon

In Search of Needles in a 11M Haystack: Recurrent Memory Finds What LLMs Miss

Add code
Feb 21, 2024
Viaarxiv icon

Better Together: Enhancing Generative Knowledge Graph Completion with Language Models and Neighborhood Information

Add code
Nov 02, 2023
Viaarxiv icon

Scaling Transformer to 1M tokens and beyond with RMT

Add code
Apr 19, 2023
Figure 1 for Scaling Transformer to 1M tokens and beyond with RMT
Figure 2 for Scaling Transformer to 1M tokens and beyond with RMT
Figure 3 for Scaling Transformer to 1M tokens and beyond with RMT
Figure 4 for Scaling Transformer to 1M tokens and beyond with RMT
Viaarxiv icon

Recurrent Memory Transformer

Add code
Jul 14, 2022
Figure 1 for Recurrent Memory Transformer
Figure 2 for Recurrent Memory Transformer
Figure 3 for Recurrent Memory Transformer
Figure 4 for Recurrent Memory Transformer
Viaarxiv icon

Knowledge Distillation of Russian Language Models with Reduction of Vocabulary

Add code
May 04, 2022
Figure 1 for Knowledge Distillation of Russian Language Models with Reduction of Vocabulary
Figure 2 for Knowledge Distillation of Russian Language Models with Reduction of Vocabulary
Figure 3 for Knowledge Distillation of Russian Language Models with Reduction of Vocabulary
Figure 4 for Knowledge Distillation of Russian Language Models with Reduction of Vocabulary
Viaarxiv icon

Goal-Oriented Multi-Task BERT-Based Dialogue State Tracker

Add code
Feb 05, 2020
Figure 1 for Goal-Oriented Multi-Task BERT-Based Dialogue State Tracker
Figure 2 for Goal-Oriented Multi-Task BERT-Based Dialogue State Tracker
Figure 3 for Goal-Oriented Multi-Task BERT-Based Dialogue State Tracker
Figure 4 for Goal-Oriented Multi-Task BERT-Based Dialogue State Tracker
Viaarxiv icon

Adaptation of Deep Bidirectional Multilingual Transformers for Russian Language

Add code
May 17, 2019
Figure 1 for Adaptation of Deep Bidirectional Multilingual Transformers for Russian Language
Figure 2 for Adaptation of Deep Bidirectional Multilingual Transformers for Russian Language
Figure 3 for Adaptation of Deep Bidirectional Multilingual Transformers for Russian Language
Figure 4 for Adaptation of Deep Bidirectional Multilingual Transformers for Russian Language
Viaarxiv icon