Alert button
Picture for Martha White

Martha White

Alert button

Understanding Feature Transfer Through Representation Alignment

Dec 15, 2021
Ehsan Imani, Wei Hu, Martha White

Figure 1 for Understanding Feature Transfer Through Representation Alignment
Figure 2 for Understanding Feature Transfer Through Representation Alignment
Figure 3 for Understanding Feature Transfer Through Representation Alignment
Figure 4 for Understanding Feature Transfer Through Representation Alignment
Viaarxiv icon

Off-Policy Actor-Critic with Emphatic Weightings

Nov 16, 2021
Eric Graves, Ehsan Imani, Raksha Kumaraswamy, Martha White

Figure 1 for Off-Policy Actor-Critic with Emphatic Weightings
Figure 2 for Off-Policy Actor-Critic with Emphatic Weightings
Figure 3 for Off-Policy Actor-Critic with Emphatic Weightings
Figure 4 for Off-Policy Actor-Critic with Emphatic Weightings
Viaarxiv icon

Exploiting Action Impact Regularity and Partially Known Models for Offline Reinforcement Learning

Nov 15, 2021
Vincent Liu, James Wright, Martha White

Figure 1 for Exploiting Action Impact Regularity and Partially Known Models for Offline Reinforcement Learning
Figure 2 for Exploiting Action Impact Regularity and Partially Known Models for Offline Reinforcement Learning
Figure 3 for Exploiting Action Impact Regularity and Partially Known Models for Offline Reinforcement Learning
Viaarxiv icon

Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences

Jul 17, 2021
Alan Chan, Hugo Silva, Sungsu Lim, Tadashi Kozuno, A. Rupam Mahmood, Martha White

Figure 1 for Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences
Figure 2 for Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences
Figure 3 for Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences
Figure 4 for Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences
Viaarxiv icon

Predictive Representation Learning for Language Modeling

May 29, 2021
Qingfeng Lan, Luke Kumar, Martha White, Alona Fyshe

Figure 1 for Predictive Representation Learning for Language Modeling
Figure 2 for Predictive Representation Learning for Language Modeling
Figure 3 for Predictive Representation Learning for Language Modeling
Figure 4 for Predictive Representation Learning for Language Modeling
Viaarxiv icon

A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning

Apr 28, 2021
Andrew Patterson, Adam White, Sina Ghiassian, Martha White

Figure 1 for A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning
Figure 2 for A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning
Figure 3 for A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning
Figure 4 for A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning
Viaarxiv icon

Scalable Online Recurrent Learning Using Columnar Neural Networks

Mar 09, 2021
Khurram Javed, Martha White, Rich Sutton

Figure 1 for Scalable Online Recurrent Learning Using Columnar Neural Networks
Figure 2 for Scalable Online Recurrent Learning Using Columnar Neural Networks
Figure 3 for Scalable Online Recurrent Learning Using Columnar Neural Networks
Figure 4 for Scalable Online Recurrent Learning Using Columnar Neural Networks
Viaarxiv icon

Perspectives on Sim2Real Transfer for Robotics: A Summary of the R:SS 2020 Workshop

Dec 07, 2020
Sebastian Höfer, Kostas Bekris, Ankur Handa, Juan Camilo Gamboa, Florian Golemo, Melissa Mozifian, Chris Atkeson, Dieter Fox, Ken Goldberg, John Leonard, C. Karen Liu, Jan Peters, Shuran Song, Peter Welinder, Martha White

Figure 1 for Perspectives on Sim2Real Transfer for Robotics: A Summary of the R:SS 2020 Workshop
Viaarxiv icon

Towards Safe Policy Improvement for Non-Stationary MDPs

Oct 23, 2020
Yash Chandak, Scott M. Jordan, Georgios Theocharous, Martha White, Philip S. Thomas

Figure 1 for Towards Safe Policy Improvement for Non-Stationary MDPs
Figure 2 for Towards Safe Policy Improvement for Non-Stationary MDPs
Figure 3 for Towards Safe Policy Improvement for Non-Stationary MDPs
Figure 4 for Towards Safe Policy Improvement for Non-Stationary MDPs
Viaarxiv icon

From Language to Language-ish: How Brain-Like is an LSTM's Representation of Nonsensical Language Stimuli?

Oct 14, 2020
Maryam Hashemzadeh, Greta Kaufeld, Martha White, Andrea E. Martin, Alona Fyshe

Figure 1 for From Language to Language-ish: How Brain-Like is an LSTM's Representation of Nonsensical Language Stimuli?
Figure 2 for From Language to Language-ish: How Brain-Like is an LSTM's Representation of Nonsensical Language Stimuli?
Figure 3 for From Language to Language-ish: How Brain-Like is an LSTM's Representation of Nonsensical Language Stimuli?
Figure 4 for From Language to Language-ish: How Brain-Like is an LSTM's Representation of Nonsensical Language Stimuli?
Viaarxiv icon