Alert button
Picture for Alex Olshevsky

Alex Olshevsky

Alert button

Sample Complexity of the Linear Quadratic Regulator: A Reinforcement Learning Lens

Add code
Bookmark button
Alert button
Apr 18, 2024
Amirreza Neshaei Moghaddam, Alex Olshevsky, Bahman Gharesifard

Viaarxiv icon

One-Shot Averaging for Distributed TD($λ$) Under Markov Sampling

Add code
Bookmark button
Alert button
Mar 13, 2024
Haoxing Tian, Ioannis Ch. Paschalidis, Alex Olshevsky

Viaarxiv icon

Convex SGD: Generalization Without Early Stopping

Add code
Bookmark button
Alert button
Jan 08, 2024
Julien Hendrickx, Alex Olshevsky

Viaarxiv icon

On the Performance of Temporal Difference Learning With Neural Networks

Add code
Bookmark button
Alert button
Dec 08, 2023
Haoxing Tian, Ioannis Ch. Paschalidis, Alex Olshevsky

Viaarxiv icon

Distributed TD(0) with Almost No Communication

Add code
Bookmark button
Alert button
May 25, 2023
Rui Liu, Alex Olshevsky

Figure 1 for Distributed TD(0) with Almost No Communication
Figure 2 for Distributed TD(0) with Almost No Communication
Viaarxiv icon

Closing the gap between SVRG and TD-SVRG with Gradient Splitting

Add code
Bookmark button
Alert button
Nov 29, 2022
Arsenii Mustafin, Alex Olshevsky, Ioannis Ch. Paschalidis

Figure 1 for Closing the gap between SVRG and TD-SVRG with Gradient Splitting
Figure 2 for Closing the gap between SVRG and TD-SVRG with Gradient Splitting
Figure 3 for Closing the gap between SVRG and TD-SVRG with Gradient Splitting
Figure 4 for Closing the gap between SVRG and TD-SVRG with Gradient Splitting
Viaarxiv icon

A Small Gain Analysis of Single Timescale Actor Critic

Add code
Bookmark button
Alert button
Mar 08, 2022
Alex Olshevsky, Bahman Gharesifard

Viaarxiv icon

Communication-efficient SGD: From Local SGD to One-Shot Averaging

Add code
Bookmark button
Alert button
Jun 09, 2021
Artin Spiridonoff, Alex Olshevsky, Ioannis Ch. Paschalidis

Figure 1 for Communication-efficient SGD: From Local SGD to One-Shot Averaging
Figure 2 for Communication-efficient SGD: From Local SGD to One-Shot Averaging
Figure 3 for Communication-efficient SGD: From Local SGD to One-Shot Averaging
Figure 4 for Communication-efficient SGD: From Local SGD to One-Shot Averaging
Viaarxiv icon

Temporal Difference Learning as Gradient Splitting

Add code
Bookmark button
Alert button
Oct 27, 2020
Rui Liu, Alex Olshevsky

Viaarxiv icon