Picture for Viacheslav Sinii

Viacheslav Sinii

Steering LLM Reasoning Through Bias-Only Adaptation

Add code
May 24, 2025
Viaarxiv icon

You Do Not Fully Utilize Transformer's Representation Capacity

Add code
Feb 13, 2025
Viaarxiv icon

The Differences Between Direct Alignment Algorithms are a Blur

Add code
Feb 03, 2025
Viaarxiv icon

XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning

Add code
Jun 13, 2024
Figure 1 for XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning
Figure 2 for XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning
Figure 3 for XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning
Figure 4 for XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning
Viaarxiv icon

In-Context Reinforcement Learning for Variable Action Spaces

Add code
Dec 20, 2023
Figure 1 for In-Context Reinforcement Learning for Variable Action Spaces
Figure 2 for In-Context Reinforcement Learning for Variable Action Spaces
Viaarxiv icon

XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX

Add code
Dec 19, 2023
Viaarxiv icon

Emergence of In-Context Reinforcement Learning from Noise Distillation

Add code
Dec 19, 2023
Viaarxiv icon