Picture for Edoardo Cetin

Edoardo Cetin

Reinforcement Learning Teachers of Test Time Scaling

Add code
Jun 10, 2025
Viaarxiv icon

Text-to-LoRA: Instant Transformer Adaption

Add code
Jun 06, 2025
Viaarxiv icon

Sudoku-Bench: Evaluating creative reasoning with Sudoku variants

Add code
May 22, 2025
Viaarxiv icon

Large Language Models to Diffusion Finetuning

Add code
Jan 27, 2025
Viaarxiv icon

$\text{Transformer}^2$: Self-adaptive LLMs

Add code
Jan 14, 2025
Viaarxiv icon

Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting

Add code
Dec 05, 2024
Figure 1 for Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting
Figure 2 for Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting
Figure 3 for Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting
Figure 4 for Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting
Viaarxiv icon

An Evolved Universal Transformer Memory

Add code
Oct 17, 2024
Figure 1 for An Evolved Universal Transformer Memory
Figure 2 for An Evolved Universal Transformer Memory
Figure 3 for An Evolved Universal Transformer Memory
Figure 4 for An Evolved Universal Transformer Memory
Viaarxiv icon

Simple Ingredients for Offline Reinforcement Learning

Add code
Mar 19, 2024
Figure 1 for Simple Ingredients for Offline Reinforcement Learning
Figure 2 for Simple Ingredients for Offline Reinforcement Learning
Figure 3 for Simple Ingredients for Offline Reinforcement Learning
Figure 4 for Simple Ingredients for Offline Reinforcement Learning
Viaarxiv icon

Policy Gradient With Serial Markov Chain Reasoning

Add code
Oct 13, 2022
Figure 1 for Policy Gradient With Serial Markov Chain Reasoning
Figure 2 for Policy Gradient With Serial Markov Chain Reasoning
Figure 3 for Policy Gradient With Serial Markov Chain Reasoning
Figure 4 for Policy Gradient With Serial Markov Chain Reasoning
Viaarxiv icon

Hyperbolic Deep Reinforcement Learning

Add code
Oct 04, 2022
Figure 1 for Hyperbolic Deep Reinforcement Learning
Figure 2 for Hyperbolic Deep Reinforcement Learning
Figure 3 for Hyperbolic Deep Reinforcement Learning
Figure 4 for Hyperbolic Deep Reinforcement Learning
Viaarxiv icon