Picture for Rosie Zhao

Rosie Zhao

SOAP: Improving and Stabilizing Shampoo using Adam

Add code
Sep 17, 2024
Viaarxiv icon

Deconstructing What Makes a Good Optimizer for Language Models

Add code
Jul 10, 2024
Viaarxiv icon

Feature emergence via margin maximization: case studies in algebraic tasks

Add code
Nov 13, 2023
Viaarxiv icon

Beyond Implicit Bias: The Insignificance of SGD Noise in Online Learning

Add code
Jun 14, 2023
Viaarxiv icon

Policy Gradient Methods in the Presence of Symmetries and State Abstractions

Add code
May 09, 2023
Figure 1 for Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Figure 2 for Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Figure 3 for Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Figure 4 for Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Viaarxiv icon

Loss of Plasticity in Continual Deep Reinforcement Learning

Add code
Mar 13, 2023
Viaarxiv icon

Continuous MDP Homomorphisms and Homomorphic Policy Gradient

Add code
Sep 15, 2022
Figure 1 for Continuous MDP Homomorphisms and Homomorphic Policy Gradient
Figure 2 for Continuous MDP Homomorphisms and Homomorphic Policy Gradient
Figure 3 for Continuous MDP Homomorphisms and Homomorphic Policy Gradient
Figure 4 for Continuous MDP Homomorphisms and Homomorphic Policy Gradient
Viaarxiv icon

Bridging the gap between supervised classification and unsupervised topic modelling for social-media assisted crisis management

Add code
Mar 22, 2021
Figure 1 for Bridging the gap between supervised classification and unsupervised topic modelling for social-media assisted crisis management
Figure 2 for Bridging the gap between supervised classification and unsupervised topic modelling for social-media assisted crisis management
Figure 3 for Bridging the gap between supervised classification and unsupervised topic modelling for social-media assisted crisis management
Figure 4 for Bridging the gap between supervised classification and unsupervised topic modelling for social-media assisted crisis management
Viaarxiv icon

A Study of Policy Gradient on a Class of Exactly Solvable Models

Add code
Nov 03, 2020
Figure 1 for A Study of Policy Gradient on a Class of Exactly Solvable Models
Figure 2 for A Study of Policy Gradient on a Class of Exactly Solvable Models
Figure 3 for A Study of Policy Gradient on a Class of Exactly Solvable Models
Figure 4 for A Study of Policy Gradient on a Class of Exactly Solvable Models
Viaarxiv icon