Alert button
Picture for Jihwan Jeong

Jihwan Jeong

Alert button

Factual and Personalized Recommendations using Language Models and Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 09, 2023
Jihwan Jeong, Yinlam Chow, Guy Tennenholtz, Chih-Wei Hsu, Azamat Tulepbergenov, Mohammad Ghavamzadeh, Craig Boutilier

Figure 1 for Factual and Personalized Recommendations using Language Models and Reinforcement Learning
Figure 2 for Factual and Personalized Recommendations using Language Models and Reinforcement Learning
Figure 3 for Factual and Personalized Recommendations using Language Models and Reinforcement Learning
Figure 4 for Factual and Personalized Recommendations using Language Models and Reinforcement Learning
Viaarxiv icon

Demystifying Embedding Spaces using Large Language Models

Add code
Bookmark button
Alert button
Oct 06, 2023
Guy Tennenholtz, Yinlam Chow, Chih-Wei Hsu, Jihwan Jeong, Lior Shani, Azamat Tulepbergenov, Deepak Ramachandran, Martin Mladenov, Craig Boutilier

Figure 1 for Demystifying Embedding Spaces using Large Language Models
Figure 2 for Demystifying Embedding Spaces using Large Language Models
Figure 3 for Demystifying Embedding Spaces using Large Language Models
Figure 4 for Demystifying Embedding Spaces using Large Language Models
Viaarxiv icon

Conservative Bayesian Model-Based Value Expansion for Offline Policy Optimization

Add code
Bookmark button
Alert button
Oct 07, 2022
Jihwan Jeong, Xiaoyu Wang, Michael Gimelfarb, Hyunwoo Kim, Baher Abdulhai, Scott Sanner

Figure 1 for Conservative Bayesian Model-Based Value Expansion for Offline Policy Optimization
Figure 2 for Conservative Bayesian Model-Based Value Expansion for Offline Policy Optimization
Figure 3 for Conservative Bayesian Model-Based Value Expansion for Offline Policy Optimization
Figure 4 for Conservative Bayesian Model-Based Value Expansion for Offline Policy Optimization
Viaarxiv icon

RAPTOR: End-to-end Risk-Aware MDP Planning and Policy Learning by Backpropagation

Add code
Bookmark button
Alert button
Jun 14, 2021
Noah Patton, Jihwan Jeong, Michael Gimelfarb, Scott Sanner

Figure 1 for RAPTOR: End-to-end Risk-Aware MDP Planning and Policy Learning by Backpropagation
Figure 2 for RAPTOR: End-to-end Risk-Aware MDP Planning and Policy Learning by Backpropagation
Figure 3 for RAPTOR: End-to-end Risk-Aware MDP Planning and Policy Learning by Backpropagation
Figure 4 for RAPTOR: End-to-end Risk-Aware MDP Planning and Policy Learning by Backpropagation
Viaarxiv icon

Online Continual Learning in Image Classification: An Empirical Survey

Add code
Bookmark button
Alert button
Jan 25, 2021
Zheda Mai, Ruiwen Li, Jihwan Jeong, David Quispe, Hyunwoo Kim, Scott Sanner

Figure 1 for Online Continual Learning in Image Classification: An Empirical Survey
Figure 2 for Online Continual Learning in Image Classification: An Empirical Survey
Figure 3 for Online Continual Learning in Image Classification: An Empirical Survey
Figure 4 for Online Continual Learning in Image Classification: An Empirical Survey
Viaarxiv icon

Adversarial Shapley Value Experience Replay for Task-Free Continual Learning

Add code
Bookmark button
Alert button
Aug 31, 2020
Zheda Mai, Dongsub Shim, Jihwan Jeong, Scott Sanner, Hyunwoo Kim, Jongseong Jang

Figure 1 for Adversarial Shapley Value Experience Replay for Task-Free Continual Learning
Figure 2 for Adversarial Shapley Value Experience Replay for Task-Free Continual Learning
Figure 3 for Adversarial Shapley Value Experience Replay for Task-Free Continual Learning
Figure 4 for Adversarial Shapley Value Experience Replay for Task-Free Continual Learning
Viaarxiv icon

Batch-level Experience Replay with Review for Continual Learning

Add code
Bookmark button
Alert button
Jul 11, 2020
Zheda Mai, Hyunwoo Kim, Jihwan Jeong, Scott Sanner

Figure 1 for Batch-level Experience Replay with Review for Continual Learning
Figure 2 for Batch-level Experience Replay with Review for Continual Learning
Figure 3 for Batch-level Experience Replay with Review for Continual Learning
Figure 4 for Batch-level Experience Replay with Review for Continual Learning
Viaarxiv icon