Alert button
Picture for Semih Cayci

Semih Cayci

Alert button

Convergence of Gradient Descent for Recurrent Neural Networks: A Nonasymptotic Analysis

Add code
Bookmark button
Alert button
Feb 19, 2024
Semih Cayci, Atilla Eryilmaz

Viaarxiv icon

Provably Robust Temporal Difference Learning for Heavy-Tailed Rewards

Add code
Bookmark button
Alert button
Jun 20, 2023
Semih Cayci, Atilla Eryilmaz

Viaarxiv icon

Policy Mirror Ascent for Efficient and Independent Learning in Mean Field Games

Add code
Bookmark button
Alert button
Dec 29, 2022
Batuhan Yardim, Semih Cayci, Matthieu Geist, Niao He

Figure 1 for Policy Mirror Ascent for Efficient and Independent Learning in Mean Field Games
Viaarxiv icon

Finite-Time Analysis of Entropy-Regularized Neural Natural Actor-Critic Algorithm

Add code
Bookmark button
Alert button
Jun 02, 2022
Semih Cayci, Niao He, R. Srikant

Figure 1 for Finite-Time Analysis of Entropy-Regularized Neural Natural Actor-Critic Algorithm
Viaarxiv icon

Learning to Control Partially Observed Systems with Finite Memory

Add code
Bookmark button
Alert button
Feb 22, 2022
Semih Cayci, Niao He, R. Srikant

Viaarxiv icon

A Lyapunov-Based Methodology for Constrained Optimization with Bandit Feedback

Add code
Bookmark button
Alert button
Jun 09, 2021
Semih Cayci, Yilin Zheng, Atilla Eryilmaz

Figure 1 for A Lyapunov-Based Methodology for Constrained Optimization with Bandit Feedback
Figure 2 for A Lyapunov-Based Methodology for Constrained Optimization with Bandit Feedback
Viaarxiv icon

Linear Convergence of Entropy-Regularized Natural Policy Gradient with Linear Function Approximation

Add code
Bookmark button
Alert button
Jun 08, 2021
Semih Cayci, Niao He, R. Srikant

Figure 1 for Linear Convergence of Entropy-Regularized Natural Policy Gradient with Linear Function Approximation
Viaarxiv icon

Sample Complexity and Overparameterization Bounds for Projection-Free Neural TD Learning

Add code
Bookmark button
Alert button
Mar 02, 2021
Semih Cayci, Siddhartha Satpathi, Niao He, R. Srikant

Figure 1 for Sample Complexity and Overparameterization Bounds for Projection-Free Neural TD Learning
Viaarxiv icon

Continuous-Time Multi-Armed Bandits with Controlled Restarts

Add code
Bookmark button
Alert button
Jun 30, 2020
Semih Cayci, Atilla Eryilmaz, R. Srikant

Figure 1 for Continuous-Time Multi-Armed Bandits with Controlled Restarts
Figure 2 for Continuous-Time Multi-Armed Bandits with Controlled Restarts
Figure 3 for Continuous-Time Multi-Armed Bandits with Controlled Restarts
Viaarxiv icon

Group-Fair Online Allocation in Continuous Time

Add code
Bookmark button
Alert button
Jun 11, 2020
Semih Cayci, Swati Gupta, Atilla Eryilmaz

Figure 1 for Group-Fair Online Allocation in Continuous Time
Figure 2 for Group-Fair Online Allocation in Continuous Time
Viaarxiv icon