Picture for Ivan Rodkin

Ivan Rodkin

GradMem: Learning to Write Context into Memory with Test-Time Gradient Descent

Add code
Mar 14, 2026
Viaarxiv icon

Diagonal Batching Unlocks Parallelism in Recurrent Memory Transformers for Long Contexts

Add code
Jun 05, 2025
Viaarxiv icon

Associative Recurrent Memory Transformer

Add code
Jul 05, 2024
Figure 1 for Associative Recurrent Memory Transformer
Figure 2 for Associative Recurrent Memory Transformer
Figure 3 for Associative Recurrent Memory Transformer
Figure 4 for Associative Recurrent Memory Transformer
Viaarxiv icon

BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack

Add code
Jun 14, 2024
Figure 1 for BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack
Figure 2 for BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack
Figure 3 for BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack
Figure 4 for BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack
Viaarxiv icon