Picture for Dmitry Sorokin

Dmitry Sorokin

BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack

Add code
Jun 14, 2024
Viaarxiv icon

In Search of Needles in a 11M Haystack: Recurrent Memory Finds What LLMs Miss

Add code
Feb 21, 2024
Viaarxiv icon

TreeDQN: Learning to minimize Branch-and-Bound tree

Add code
Jun 09, 2023
Figure 1 for TreeDQN: Learning to minimize Branch-and-Bound tree
Figure 2 for TreeDQN: Learning to minimize Branch-and-Bound tree
Figure 3 for TreeDQN: Learning to minimize Branch-and-Bound tree
Figure 4 for TreeDQN: Learning to minimize Branch-and-Bound tree
Viaarxiv icon

Insights From the NeurIPS 2021 NetHack Challenge

Add code
Mar 22, 2022
Figure 1 for Insights From the NeurIPS 2021 NetHack Challenge
Figure 2 for Insights From the NeurIPS 2021 NetHack Challenge
Figure 3 for Insights From the NeurIPS 2021 NetHack Challenge
Figure 4 for Insights From the NeurIPS 2021 NetHack Challenge
Viaarxiv icon

Aligning an optical interferometer with beam divergence control and continuous action space

Add code
Jul 09, 2021
Figure 1 for Aligning an optical interferometer with beam divergence control and continuous action space
Figure 2 for Aligning an optical interferometer with beam divergence control and continuous action space
Figure 3 for Aligning an optical interferometer with beam divergence control and continuous action space
Figure 4 for Aligning an optical interferometer with beam divergence control and continuous action space
Viaarxiv icon

Adaptation of Quadruped Robot Locomotion with Meta-Learning

Add code
Jul 08, 2021
Figure 1 for Adaptation of Quadruped Robot Locomotion with Meta-Learning
Figure 2 for Adaptation of Quadruped Robot Locomotion with Meta-Learning
Figure 3 for Adaptation of Quadruped Robot Locomotion with Meta-Learning
Figure 4 for Adaptation of Quadruped Robot Locomotion with Meta-Learning
Viaarxiv icon

Interferobot: aligning an optical interferometer by a reinforcement learning agent

Add code
Jun 03, 2020
Figure 1 for Interferobot: aligning an optical interferometer by a reinforcement learning agent
Figure 2 for Interferobot: aligning an optical interferometer by a reinforcement learning agent
Figure 3 for Interferobot: aligning an optical interferometer by a reinforcement learning agent
Figure 4 for Interferobot: aligning an optical interferometer by a reinforcement learning agent
Viaarxiv icon