Picture for Vivien Cabannes

Vivien Cabannes

Efficient RL Training for LLMs with Experience Replay

Add code
Apr 09, 2026
Viaarxiv icon

Automatic Textbook Formalization

Add code
Apr 03, 2026
Viaarxiv icon

Provable Benefits of In-Tool Learning for Large Language Models

Add code
Aug 28, 2025
Figure 1 for Provable Benefits of In-Tool Learning for Large Language Models
Figure 2 for Provable Benefits of In-Tool Learning for Large Language Models
Figure 3 for Provable Benefits of In-Tool Learning for Large Language Models
Figure 4 for Provable Benefits of In-Tool Learning for Large Language Models
Viaarxiv icon

Easing Optimization Paths: a Circuit Perspective

Add code
Jan 04, 2025
Figure 1 for Easing Optimization Paths: a Circuit Perspective
Figure 2 for Easing Optimization Paths: a Circuit Perspective
Figure 3 for Easing Optimization Paths: a Circuit Perspective
Figure 4 for Easing Optimization Paths: a Circuit Perspective
Viaarxiv icon

Scaling Laws with Hidden Structure

Add code
Nov 05, 2024
Viaarxiv icon

A Visual Case Study of the Training Dynamics in Neural Networks

Add code
Oct 31, 2024
Figure 1 for A Visual Case Study of the Training Dynamics in Neural Networks
Figure 2 for A Visual Case Study of the Training Dynamics in Neural Networks
Figure 3 for A Visual Case Study of the Training Dynamics in Neural Networks
Figure 4 for A Visual Case Study of the Training Dynamics in Neural Networks
Viaarxiv icon

$\mathbb{X}$-Sample Contrastive Loss: Improving Contrastive Learning with Sample Similarity Graphs

Add code
Jul 25, 2024
Figure 1 for $\mathbb{X}$-Sample Contrastive Loss: Improving Contrastive Learning with Sample Similarity Graphs
Figure 2 for $\mathbb{X}$-Sample Contrastive Loss: Improving Contrastive Learning with Sample Similarity Graphs
Figure 3 for $\mathbb{X}$-Sample Contrastive Loss: Improving Contrastive Learning with Sample Similarity Graphs
Figure 4 for $\mathbb{X}$-Sample Contrastive Loss: Improving Contrastive Learning with Sample Similarity Graphs
Viaarxiv icon

Iteration Head: A Mechanistic Study of Chain-of-Thought

Add code
Jun 04, 2024
Viaarxiv icon

Learning Associative Memories with Gradient Descent

Add code
Feb 28, 2024
Figure 1 for Learning Associative Memories with Gradient Descent
Figure 2 for Learning Associative Memories with Gradient Descent
Figure 3 for Learning Associative Memories with Gradient Descent
Figure 4 for Learning Associative Memories with Gradient Descent
Viaarxiv icon

Mode Estimation with Partial Feedback

Add code
Feb 20, 2024
Figure 1 for Mode Estimation with Partial Feedback
Viaarxiv icon