Picture for Rosie Zhao

Rosie Zhao

Deconstructing What Makes a Good Optimizer for Language Models

Add code
Jul 10, 2024
Viaarxiv icon

Feature emergence via margin maximization: case studies in algebraic tasks

Add code
Nov 13, 2023
Figure 1 for Feature emergence via margin maximization: case studies in algebraic tasks
Figure 2 for Feature emergence via margin maximization: case studies in algebraic tasks
Figure 3 for Feature emergence via margin maximization: case studies in algebraic tasks
Figure 4 for Feature emergence via margin maximization: case studies in algebraic tasks
Viaarxiv icon

Beyond Implicit Bias: The Insignificance of SGD Noise in Online Learning

Add code
Jun 14, 2023
Figure 1 for Beyond Implicit Bias: The Insignificance of SGD Noise in Online Learning
Figure 2 for Beyond Implicit Bias: The Insignificance of SGD Noise in Online Learning
Figure 3 for Beyond Implicit Bias: The Insignificance of SGD Noise in Online Learning
Figure 4 for Beyond Implicit Bias: The Insignificance of SGD Noise in Online Learning
Viaarxiv icon

Policy Gradient Methods in the Presence of Symmetries and State Abstractions

Add code
May 09, 2023
Figure 1 for Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Figure 2 for Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Figure 3 for Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Figure 4 for Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Viaarxiv icon

Loss of Plasticity in Continual Deep Reinforcement Learning

Add code
Mar 13, 2023
Figure 1 for Loss of Plasticity in Continual Deep Reinforcement Learning
Figure 2 for Loss of Plasticity in Continual Deep Reinforcement Learning
Figure 3 for Loss of Plasticity in Continual Deep Reinforcement Learning
Figure 4 for Loss of Plasticity in Continual Deep Reinforcement Learning
Viaarxiv icon

Continuous MDP Homomorphisms and Homomorphic Policy Gradient

Add code
Sep 15, 2022
Figure 1 for Continuous MDP Homomorphisms and Homomorphic Policy Gradient
Figure 2 for Continuous MDP Homomorphisms and Homomorphic Policy Gradient
Figure 3 for Continuous MDP Homomorphisms and Homomorphic Policy Gradient
Figure 4 for Continuous MDP Homomorphisms and Homomorphic Policy Gradient
Viaarxiv icon

Bridging the gap between supervised classification and unsupervised topic modelling for social-media assisted crisis management

Add code
Mar 22, 2021
Figure 1 for Bridging the gap between supervised classification and unsupervised topic modelling for social-media assisted crisis management
Figure 2 for Bridging the gap between supervised classification and unsupervised topic modelling for social-media assisted crisis management
Figure 3 for Bridging the gap between supervised classification and unsupervised topic modelling for social-media assisted crisis management
Figure 4 for Bridging the gap between supervised classification and unsupervised topic modelling for social-media assisted crisis management
Viaarxiv icon

A Study of Policy Gradient on a Class of Exactly Solvable Models

Add code
Nov 03, 2020
Figure 1 for A Study of Policy Gradient on a Class of Exactly Solvable Models
Figure 2 for A Study of Policy Gradient on a Class of Exactly Solvable Models
Figure 3 for A Study of Policy Gradient on a Class of Exactly Solvable Models
Figure 4 for A Study of Policy Gradient on a Class of Exactly Solvable Models
Viaarxiv icon