Picture for Ivan Rodkin

Ivan Rodkin

Diagonal Batching Unlocks Parallelism in Recurrent Memory Transformers for Long Contexts

Add code
Jun 05, 2025
Viaarxiv icon

Associative Recurrent Memory Transformer

Add code
Jul 05, 2024
Figure 1 for Associative Recurrent Memory Transformer
Figure 2 for Associative Recurrent Memory Transformer
Figure 3 for Associative Recurrent Memory Transformer
Figure 4 for Associative Recurrent Memory Transformer
Viaarxiv icon

BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack

Add code
Jun 14, 2024
Figure 1 for BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack
Figure 2 for BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack
Figure 3 for BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack
Figure 4 for BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack
Viaarxiv icon