Alert button
Picture for Han Shen

Han Shen

Alert button

Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF

Add code
Bookmark button
Alert button
Feb 10, 2024
Han Shen, Zhuoran Yang, Tianyi Chen

Viaarxiv icon

Joint Unsupervised and Supervised Training for Automatic Speech Recognition via Bilevel Optimization

Add code
Bookmark button
Alert button
Jan 13, 2024
A F M Saif, Xiaodong Cui, Han Shen, Songtao Lu, Brian Kingsbury, Tianyi Chen

Viaarxiv icon

On Penalty-based Bilevel Gradient Descent Method

Add code
Bookmark button
Alert button
Feb 10, 2023
Han Shen, Tianyi Chen

Figure 1 for On Penalty-based Bilevel Gradient Descent Method
Figure 2 for On Penalty-based Bilevel Gradient Descent Method
Figure 3 for On Penalty-based Bilevel Gradient Descent Method
Figure 4 for On Penalty-based Bilevel Gradient Descent Method
Viaarxiv icon

Alternating Implicit Projected SGD and Its Efficient Variants for Equality-constrained Bilevel Optimization

Add code
Bookmark button
Alert button
Nov 14, 2022
Quan Xiao, Han Shen, Wotao Yin, Tianyi Chen

Figure 1 for Alternating Implicit Projected SGD and Its Efficient Variants for Equality-constrained Bilevel Optimization
Figure 2 for Alternating Implicit Projected SGD and Its Efficient Variants for Equality-constrained Bilevel Optimization
Figure 3 for Alternating Implicit Projected SGD and Its Efficient Variants for Equality-constrained Bilevel Optimization
Figure 4 for Alternating Implicit Projected SGD and Its Efficient Variants for Equality-constrained Bilevel Optimization
Viaarxiv icon

Mitigating Gradient Bias in Multi-objective Learning: A Provably Convergent Stochastic Approach

Add code
Bookmark button
Alert button
Oct 23, 2022
Heshan Fernando, Han Shen, Miao Liu, Subhajit Chaudhury, Keerthiram Murugesan, Tianyi Chen

Figure 1 for Mitigating Gradient Bias in Multi-objective Learning: A Provably Convergent Stochastic Approach
Figure 2 for Mitigating Gradient Bias in Multi-objective Learning: A Provably Convergent Stochastic Approach
Figure 3 for Mitigating Gradient Bias in Multi-objective Learning: A Provably Convergent Stochastic Approach
Figure 4 for Mitigating Gradient Bias in Multi-objective Learning: A Provably Convergent Stochastic Approach
Viaarxiv icon

A Single-Timescale Analysis For Stochastic Approximation With Multiple Coupled Sequences

Add code
Bookmark button
Alert button
Jun 21, 2022
Han Shen, Tianyi Chen

Figure 1 for A Single-Timescale Analysis For Stochastic Approximation With Multiple Coupled Sequences
Figure 2 for A Single-Timescale Analysis For Stochastic Approximation With Multiple Coupled Sequences
Viaarxiv icon

Asynchronous Advantage Actor Critic: Non-asymptotic Analysis and Linear Speedup

Add code
Bookmark button
Alert button
Dec 31, 2020
Han Shen, Kaiqing Zhang, Mingyi Hong, Tianyi Chen

Figure 1 for Asynchronous Advantage Actor Critic: Non-asymptotic Analysis and Linear Speedup
Figure 2 for Asynchronous Advantage Actor Critic: Non-asymptotic Analysis and Linear Speedup
Figure 3 for Asynchronous Advantage Actor Critic: Non-asymptotic Analysis and Linear Speedup
Figure 4 for Asynchronous Advantage Actor Critic: Non-asymptotic Analysis and Linear Speedup
Viaarxiv icon

Multi-object Tracking via End-to-end Tracklet Searching and Ranking

Add code
Bookmark button
Alert button
Mar 04, 2020
Tao Hu, Lichao Huang, Han Shen

Figure 1 for Multi-object Tracking via End-to-end Tracklet Searching and Ranking
Figure 2 for Multi-object Tracking via End-to-end Tracklet Searching and Ranking
Figure 3 for Multi-object Tracking via End-to-end Tracklet Searching and Ranking
Figure 4 for Multi-object Tracking via End-to-end Tracklet Searching and Ranking
Viaarxiv icon

Adaptive Temporal Difference Learning with Linear Function Approximation

Add code
Bookmark button
Alert button
Feb 20, 2020
Tao Sun, Han Shen, Tianyi Chen, Dongsheng Li

Figure 1 for Adaptive Temporal Difference Learning with Linear Function Approximation
Figure 2 for Adaptive Temporal Difference Learning with Linear Function Approximation
Figure 3 for Adaptive Temporal Difference Learning with Linear Function Approximation
Figure 4 for Adaptive Temporal Difference Learning with Linear Function Approximation
Viaarxiv icon

Real Time Visual Tracking using Spatial-Aware Temporal Aggregation Network

Add code
Bookmark button
Alert button
Aug 02, 2019
Tao Hu, Lichao Huang, Xianming Liu, Han Shen

Figure 1 for Real Time Visual Tracking using Spatial-Aware Temporal Aggregation Network
Figure 2 for Real Time Visual Tracking using Spatial-Aware Temporal Aggregation Network
Figure 3 for Real Time Visual Tracking using Spatial-Aware Temporal Aggregation Network
Figure 4 for Real Time Visual Tracking using Spatial-Aware Temporal Aggregation Network
Viaarxiv icon