Alert button
Picture for Yuhao Ding

Yuhao Ding

Alert button

Max

Tempo Adaption in Non-stationary Reinforcement Learning

Add code
Bookmark button
Alert button
Sep 26, 2023
Hyunin Lee, Yuhao Ding, Jongmin Lee, Ming Jin, Javad Lavaei, Somayeh Sojoudi

Figure 1 for Tempo Adaption in Non-stationary Reinforcement Learning
Figure 2 for Tempo Adaption in Non-stationary Reinforcement Learning
Figure 3 for Tempo Adaption in Non-stationary Reinforcement Learning
Figure 4 for Tempo Adaption in Non-stationary Reinforcement Learning
Viaarxiv icon

Scalable Primal-Dual Actor-Critic Method for Safe Multi-Agent RL with General Utilities

Add code
Bookmark button
Alert button
May 27, 2023
Donghao Ying, Yunkai Zhang, Yuhao Ding, Alec Koppel, Javad Lavaei

Figure 1 for Scalable Primal-Dual Actor-Critic Method for Safe Multi-Agent RL with General Utilities
Figure 2 for Scalable Primal-Dual Actor-Critic Method for Safe Multi-Agent RL with General Utilities
Figure 3 for Scalable Primal-Dual Actor-Critic Method for Safe Multi-Agent RL with General Utilities
Figure 4 for Scalable Primal-Dual Actor-Critic Method for Safe Multi-Agent RL with General Utilities
Viaarxiv icon

DyBit: Dynamic Bit-Precision Numbers for Efficient Quantized Neural Network Inference

Add code
Bookmark button
Alert button
Feb 24, 2023
Jiajun Zhou, Jiajun Wu, Yizhao Gao, Yuhao Ding, Chaofan Tao, Boyu Li, Fengbin Tu, Kwang-Ting Cheng, Hayden Kwok-Hay So, Ngai Wong

Figure 1 for DyBit: Dynamic Bit-Precision Numbers for Efficient Quantized Neural Network Inference
Figure 2 for DyBit: Dynamic Bit-Precision Numbers for Efficient Quantized Neural Network Inference
Figure 3 for DyBit: Dynamic Bit-Precision Numbers for Efficient Quantized Neural Network Inference
Figure 4 for DyBit: Dynamic Bit-Precision Numbers for Efficient Quantized Neural Network Inference
Viaarxiv icon

Scalable Multi-Agent Reinforcement Learning with General Utilities

Add code
Bookmark button
Alert button
Feb 15, 2023
Donghao Ying, Yuhao Ding, Alec Koppel, Javad Lavaei

Viaarxiv icon

Non-stationary Risk-sensitive Reinforcement Learning: Near-optimal Dynamic Regret, Adaptive Detection, and Separation Design

Add code
Bookmark button
Alert button
Nov 19, 2022
Yuhao Ding, Ming Jin, Javad Lavaei

Figure 1 for Non-stationary Risk-sensitive Reinforcement Learning: Near-optimal Dynamic Regret, Adaptive Detection, and Separation Design
Figure 2 for Non-stationary Risk-sensitive Reinforcement Learning: Near-optimal Dynamic Regret, Adaptive Detection, and Separation Design
Figure 3 for Non-stationary Risk-sensitive Reinforcement Learning: Near-optimal Dynamic Regret, Adaptive Detection, and Separation Design
Viaarxiv icon

Policy-based Primal-Dual Methods for Convex Constrained Markov Decision Processes

Add code
Bookmark button
Alert button
May 22, 2022
Donghao Ying, Mengzi Guo, Yuhao Ding, Javad Lavaei, Zuo-Jun, Shen

Figure 1 for Policy-based Primal-Dual Methods for Convex Constrained Markov Decision Processes
Viaarxiv icon

Provably Efficient Primal-Dual Reinforcement Learning for CMDPs with Non-stationary Objectives and Constraints

Add code
Bookmark button
Alert button
Jan 28, 2022
Yuhao Ding, Javad Lavaei

Figure 1 for Provably Efficient Primal-Dual Reinforcement Learning for CMDPs with Non-stationary Objectives and Constraints
Viaarxiv icon

Beyond Exact Gradients: Convergence of Stochastic Soft-Max Policy Gradient Methods with Entropy Regularization

Add code
Bookmark button
Alert button
Oct 19, 2021
Yuhao Ding, Junzi Zhang, Javad Lavaei

Figure 1 for Beyond Exact Gradients: Convergence of Stochastic Soft-Max Policy Gradient Methods with Entropy Regularization
Viaarxiv icon

On the Global Convergence of Momentum-based Policy Gradient

Add code
Bookmark button
Alert button
Oct 19, 2021
Yuhao Ding, Junzi Zhang, Javad Lavaei

Figure 1 for On the Global Convergence of Momentum-based Policy Gradient
Viaarxiv icon

A Dual Approach to Constrained Markov Decision Processes with Entropy Regularization

Add code
Bookmark button
Alert button
Oct 17, 2021
Donghao Ying, Yuhao Ding, Javad Lavaei

Viaarxiv icon