Alert button
Picture for Tengyang Xie

Tengyang Xie

Alert button

Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences

Add code
Bookmark button
Alert button
Apr 04, 2024
Corby Rosset, Ching-An Cheng, Arindam Mitra, Michael Santacroce, Ahmed Awadallah, Tengyang Xie

Viaarxiv icon

Towards Principled Representation Learning from Videos for Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 20, 2024
Dipendra Misra, Akanksha Saran, Tengyang Xie, Alex Lamb, John Langford

Figure 1 for Towards Principled Representation Learning from Videos for Reinforcement Learning
Figure 2 for Towards Principled Representation Learning from Videos for Reinforcement Learning
Figure 3 for Towards Principled Representation Learning from Videos for Reinforcement Learning
Figure 4 for Towards Principled Representation Learning from Videos for Reinforcement Learning
Viaarxiv icon

CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual Examples

Add code
Bookmark button
Alert button
Feb 20, 2024
Jianrui Zhang, Mu Cai, Tengyang Xie, Yong Jae Lee

Viaarxiv icon

Harnessing Density Ratios for Online Reinforcement Learning

Add code
Bookmark button
Alert button
Jan 18, 2024
Philip Amortila, Dylan J. Foster, Nan Jiang, Ayush Sekhari, Tengyang Xie

Viaarxiv icon

Adversarial Model for Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 21, 2023
Mohak Bhardwaj, Tengyang Xie, Byron Boots, Nan Jiang, Ching-An Cheng

Figure 1 for Adversarial Model for Offline Reinforcement Learning
Figure 2 for Adversarial Model for Offline Reinforcement Learning
Figure 3 for Adversarial Model for Offline Reinforcement Learning
Figure 4 for Adversarial Model for Offline Reinforcement Learning
Viaarxiv icon

ARMOR: A Model-based Framework for Improving Arbitrary Baseline Policies with Offline Data

Add code
Bookmark button
Alert button
Nov 08, 2022
Tengyang Xie, Mohak Bhardwaj, Nan Jiang, Ching-An Cheng

Viaarxiv icon

The Role of Coverage in Online Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 09, 2022
Tengyang Xie, Dylan J. Foster, Yu Bai, Nan Jiang, Sham M. Kakade

Figure 1 for The Role of Coverage in Online Reinforcement Learning
Viaarxiv icon

Interaction-Grounded Learning with Action-inclusive Feedback

Add code
Bookmark button
Alert button
Jun 16, 2022
Tengyang Xie, Akanksha Saran, Dylan J. Foster, Lekan Molu, Ida Momennejad, Nan Jiang, Paul Mineiro, John Langford

Figure 1 for Interaction-Grounded Learning with Action-inclusive Feedback
Figure 2 for Interaction-Grounded Learning with Action-inclusive Feedback
Figure 3 for Interaction-Grounded Learning with Action-inclusive Feedback
Figure 4 for Interaction-Grounded Learning with Action-inclusive Feedback
Viaarxiv icon

Adversarially Trained Actor Critic for Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 05, 2022
Ching-An Cheng, Tengyang Xie, Nan Jiang, Alekh Agarwal

Figure 1 for Adversarially Trained Actor Critic for Offline Reinforcement Learning
Figure 2 for Adversarially Trained Actor Critic for Offline Reinforcement Learning
Figure 3 for Adversarially Trained Actor Critic for Offline Reinforcement Learning
Figure 4 for Adversarially Trained Actor Critic for Offline Reinforcement Learning
Viaarxiv icon

Bellman-consistent Pessimism for Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 01, 2021
Tengyang Xie, Ching-An Cheng, Nan Jiang, Paul Mineiro, Alekh Agarwal

Figure 1 for Bellman-consistent Pessimism for Offline Reinforcement Learning
Viaarxiv icon