Alert button
Picture for Tom Zahavy

Tom Zahavy

Alert button

Balancing Constraints and Rewards with Meta-Gradient D4PG

Add code
Bookmark button
Alert button
Oct 13, 2020
Dan A. Calian, Daniel J. Mankowitz, Tom Zahavy, Zhongwen Xu, Junhyuk Oh, Nir Levine, Timothy Mann

Figure 1 for Balancing Constraints and Rewards with Meta-Gradient D4PG
Figure 2 for Balancing Constraints and Rewards with Meta-Gradient D4PG
Figure 3 for Balancing Constraints and Rewards with Meta-Gradient D4PG
Figure 4 for Balancing Constraints and Rewards with Meta-Gradient D4PG
Viaarxiv icon

Learning to Ask Medical Questions using Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 31, 2020
Uri Shaham, Tom Zahavy, Cesar Caraballo, Shiwani Mahajan, Daisy Massey, Harlan Krumholz

Figure 1 for Learning to Ask Medical Questions using Reinforcement Learning
Figure 2 for Learning to Ask Medical Questions using Reinforcement Learning
Figure 3 for Learning to Ask Medical Questions using Reinforcement Learning
Figure 4 for Learning to Ask Medical Questions using Reinforcement Learning
Viaarxiv icon

Self-Tuning Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 02, 2020
Tom Zahavy, Zhongwen Xu, Vivek Veeriah, Matteo Hessel, Junhyuk Oh, Hado van Hasselt, David Silver, Satinder Singh

Figure 1 for Self-Tuning Deep Reinforcement Learning
Figure 2 for Self-Tuning Deep Reinforcement Learning
Figure 3 for Self-Tuning Deep Reinforcement Learning
Figure 4 for Self-Tuning Deep Reinforcement Learning
Viaarxiv icon

Deep learning reconstruction of ultrashort pulses from 2D spatial intensity patterns recorded by an all-in-line system in a single-shot

Add code
Bookmark button
Alert button
Nov 23, 2019
Ron Ziv, Alex Dikopoltsev, Tom Zahavy, Ittai Rubinstein, Pavel Sidorenko, Oren Cohen, Mordechai Segev

Figure 1 for Deep learning reconstruction of ultrashort pulses from 2D spatial intensity patterns recorded by an all-in-line system in a single-shot
Figure 2 for Deep learning reconstruction of ultrashort pulses from 2D spatial intensity patterns recorded by an all-in-line system in a single-shot
Figure 3 for Deep learning reconstruction of ultrashort pulses from 2D spatial intensity patterns recorded by an all-in-line system in a single-shot
Figure 4 for Deep learning reconstruction of ultrashort pulses from 2D spatial intensity patterns recorded by an all-in-line system in a single-shot
Viaarxiv icon

Apprenticeship Learning via Frank-Wolfe

Add code
Bookmark button
Alert button
Nov 20, 2019
Tom Zahavy, Alon Cohen, Haim Kaplan, Yishay Mansour

Figure 1 for Apprenticeship Learning via Frank-Wolfe
Figure 2 for Apprenticeship Learning via Frank-Wolfe
Viaarxiv icon

Inverse Reinforcement Learning in Contextual MDPs

Add code
Bookmark button
Alert button
May 29, 2019
Philip Korsunsky, Stav Belogolovsky, Tom Zahavy, Chen Tessler, Shie Mannor

Figure 1 for Inverse Reinforcement Learning in Contextual MDPs
Figure 2 for Inverse Reinforcement Learning in Contextual MDPs
Figure 3 for Inverse Reinforcement Learning in Contextual MDPs
Figure 4 for Inverse Reinforcement Learning in Contextual MDPs
Viaarxiv icon

Average reward reinforcement learning with unknown mixing times

Add code
Bookmark button
Alert button
May 23, 2019
Tom Zahavy, Alon Cohen, Haim Kaplan, Yishay Mansour

Figure 1 for Average reward reinforcement learning with unknown mixing times
Viaarxiv icon