Alert button
Picture for Ted Moskovitz

Ted Moskovitz

Alert button

What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation

Add code
Bookmark button
Alert button
Apr 10, 2024
Aaditya K. Singh, Ted Moskovitz, Felix Hill, Stephanie C. Y. Chan, Andrew M. Saxe

Viaarxiv icon

The Transient Nature of Emergent In-Context Learning in Transformers

Add code
Bookmark button
Alert button
Nov 15, 2023
Aaditya K. Singh, Stephanie C. Y. Chan, Ted Moskovitz, Erin Grant, Andrew M. Saxe, Felix Hill

Viaarxiv icon

Confronting Reward Model Overoptimization with Constrained RLHF

Add code
Bookmark button
Alert button
Oct 10, 2023
Ted Moskovitz, Aaditya K. Singh, DJ Strouse, Tuomas Sandholm, Ruslan Salakhutdinov, Anca D. Dragan, Stephen McAleer

Viaarxiv icon

A State Representation for Diminishing Rewards

Add code
Bookmark button
Alert button
Sep 07, 2023
Ted Moskovitz, Samo Hromadka, Ahmed Touati, Diana Borsa, Maneesh Sahani

Figure 1 for A State Representation for Diminishing Rewards
Figure 2 for A State Representation for Diminishing Rewards
Figure 3 for A State Representation for Diminishing Rewards
Figure 4 for A State Representation for Diminishing Rewards
Viaarxiv icon

ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs

Add code
Bookmark button
Alert button
Feb 02, 2023
Ted Moskovitz, Brendan O'Donoghue, Vivek Veeriah, Sebastian Flennerhag, Satinder Singh, Tom Zahavy

Figure 1 for ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs
Figure 2 for ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs
Figure 3 for ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs
Figure 4 for ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs
Viaarxiv icon

Transfer RL via the Undo Maps Formalism

Add code
Bookmark button
Alert button
Nov 26, 2022
Abhi Gupta, Ted Moskovitz, David Alvarez-Melis, Aldo Pacchiano

Figure 1 for Transfer RL via the Undo Maps Formalism
Figure 2 for Transfer RL via the Undo Maps Formalism
Figure 3 for Transfer RL via the Undo Maps Formalism
Viaarxiv icon

Minimum Description Length Control

Add code
Bookmark button
Alert button
Jul 24, 2022
Ted Moskovitz, Ta-Chu Kao, Maneesh Sahani, Matthew M. Botvinick

Figure 1 for Minimum Description Length Control
Figure 2 for Minimum Description Length Control
Figure 3 for Minimum Description Length Control
Figure 4 for Minimum Description Length Control
Viaarxiv icon