Alert button

Actor-critic is implicitly biased towards high entropy optimal policies

Oct 21, 2021
Yuzheng Hu, Ziwei Ji, Matus Telgarsky

Share this with someone who'll enjoy it:

View paper onarxiv iconopen_review iconOpenReview

Share this with someone who'll enjoy it: