Picture for Razvan Pascanu

Razvan Pascanu

Google DeepMind

The Illusion of Stochasticity in LLMs

Add code
Apr 08, 2026
Viaarxiv icon

Understanding Performance Gap Between Parallel and Sequential Sampling in Large Reasoning Models

Add code
Apr 07, 2026
Viaarxiv icon

Mining Generalizable Activation Functions

Add code
Feb 05, 2026
Viaarxiv icon

Perplexity Cannot Always Tell Right from Wrong

Add code
Jan 30, 2026
Viaarxiv icon

MS-SSM: A Multi-Scale State Space Model for Efficient Sequence Modeling

Add code
Dec 29, 2025
Viaarxiv icon

Fine-Tuned In-Context Learners for Efficient Adaptation

Add code
Dec 22, 2025
Viaarxiv icon

What Can Grokking Teach Us About Learning Under Nonstationarity?

Add code
Jul 26, 2025
Viaarxiv icon

Optimizers Qualitatively Alter Solutions And We Should Leverage This

Add code
Jul 16, 2025
Viaarxiv icon

Meta-learning how to Share Credit among Macro-Actions

Add code
Jun 16, 2025
Viaarxiv icon

MesaNet: Sequence Modeling by Locally Optimal Test-Time Training

Add code
Jun 05, 2025
Viaarxiv icon