Picture for Nikolay Malkin

Nikolay Malkin

A Comedy of Estimators: On KL Regularization in RL Training of LLMs

Add code
Dec 26, 2025
Viaarxiv icon

Forgetting is Everywhere

Add code
Nov 06, 2025
Viaarxiv icon

Recursive Self-Aggregation Unlocks Deep Thinking in Large Language Models

Add code
Sep 30, 2025
Viaarxiv icon

Fast Flow-based Visuomotor Policies via Conditional Optimal Transport Couplings

Add code
May 02, 2025
Viaarxiv icon

Solving Bayesian inverse problems with diffusion priors and off-policy RL

Add code
Mar 12, 2025
Viaarxiv icon

Learning Decision Trees as Amortized Structure Inference

Add code
Mar 10, 2025
Viaarxiv icon

In-Context Parametric Inference: Point or Distribution Estimators?

Add code
Feb 17, 2025
Viaarxiv icon

Outsourced diffusion sampling: Efficient posterior inference in latent spaces of generative models

Add code
Feb 10, 2025
Figure 1 for Outsourced diffusion sampling: Efficient posterior inference in latent spaces of generative models
Figure 2 for Outsourced diffusion sampling: Efficient posterior inference in latent spaces of generative models
Figure 3 for Outsourced diffusion sampling: Efficient posterior inference in latent spaces of generative models
Figure 4 for Outsourced diffusion sampling: Efficient posterior inference in latent spaces of generative models
Viaarxiv icon

From discrete-time policies to continuous-time diffusion samplers: Asymptotic equivalences and faster training

Add code
Jan 10, 2025
Viaarxiv icon

Mixtures of In-Context Learners

Add code
Nov 05, 2024
Figure 1 for Mixtures of In-Context Learners
Figure 2 for Mixtures of In-Context Learners
Figure 3 for Mixtures of In-Context Learners
Figure 4 for Mixtures of In-Context Learners
Viaarxiv icon