Alert button
Picture for Ching-An Cheng

Ching-An Cheng

Alert button

Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences

Add code
Bookmark button
Alert button
Apr 04, 2024
Corby Rosset, Ching-An Cheng, Arindam Mitra, Michael Santacroce, Ahmed Awadallah, Tengyang Xie

Viaarxiv icon

PRISE: Learning Temporal Action Abstractions as a Sequence Compression Problem

Add code
Bookmark button
Alert button
Feb 16, 2024
Ruijie Zheng, Ching-An Cheng, Hal Daumé III, Furong Huang, Andrey Kolobov

Viaarxiv icon

LLF-Bench: Benchmark for Interactive Learning from Language Feedback

Add code
Bookmark button
Alert button
Dec 13, 2023
Ching-An Cheng, Andrey Kolobov, Dipendra Misra, Allen Nie, Adith Swaminathan

Figure 1 for LLF-Bench: Benchmark for Interactive Learning from Language Feedback
Figure 2 for LLF-Bench: Benchmark for Interactive Learning from Language Feedback
Figure 3 for LLF-Bench: Benchmark for Interactive Learning from Language Feedback
Figure 4 for LLF-Bench: Benchmark for Interactive Learning from Language Feedback
Viaarxiv icon

Interactive Robot Learning from Verbal Correction

Add code
Bookmark button
Alert button
Oct 26, 2023
Huihan Liu, Alice Chen, Yuke Zhu, Adith Swaminathan, Andrey Kolobov, Ching-An Cheng

Viaarxiv icon

Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control

Add code
Bookmark button
Alert button
Jun 30, 2023
Vivek Myers, Andre He, Kuan Fang, Homer Walke, Philippe Hansen-Estruch, Ching-An Cheng, Mihai Jalobeanu, Andrey Kolobov, Anca Dragan, Sergey Levine

Figure 1 for Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control
Figure 2 for Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control
Figure 3 for Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control
Figure 4 for Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control
Viaarxiv icon

Survival Instinct in Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 05, 2023
Anqi Li, Dipendra Misra, Andrey Kolobov, Ching-An Cheng

Figure 1 for Survival Instinct in Offline Reinforcement Learning
Figure 2 for Survival Instinct in Offline Reinforcement Learning
Figure 3 for Survival Instinct in Offline Reinforcement Learning
Figure 4 for Survival Instinct in Offline Reinforcement Learning
Viaarxiv icon

Improving Offline RL by Blending Heuristics

Add code
Bookmark button
Alert button
Jun 01, 2023
Sinong Geng, Aldo Pacchiano, Andrey Kolobov, Ching-An Cheng

Figure 1 for Improving Offline RL by Blending Heuristics
Figure 2 for Improving Offline RL by Blending Heuristics
Figure 3 for Improving Offline RL by Blending Heuristics
Figure 4 for Improving Offline RL by Blending Heuristics
Viaarxiv icon

MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations

Add code
Bookmark button
Alert button
Mar 30, 2023
Anqi Li, Byron Boots, Ching-An Cheng

Figure 1 for MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations
Figure 2 for MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations
Figure 3 for MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations
Figure 4 for MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations
Viaarxiv icon

PLEX: Making the Most of the Available Data for Robotic Manipulation Pretraining

Add code
Bookmark button
Alert button
Mar 15, 2023
Garrett Thomas, Ching-An Cheng, Ricky Loynd, Vibhav Vineet, Mihai Jalobeanu, Andrey Kolobov

Figure 1 for PLEX: Making the Most of the Available Data for Robotic Manipulation Pretraining
Figure 2 for PLEX: Making the Most of the Available Data for Robotic Manipulation Pretraining
Figure 3 for PLEX: Making the Most of the Available Data for Robotic Manipulation Pretraining
Figure 4 for PLEX: Making the Most of the Available Data for Robotic Manipulation Pretraining
Viaarxiv icon

Adversarial Model for Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 21, 2023
Mohak Bhardwaj, Tengyang Xie, Byron Boots, Nan Jiang, Ching-An Cheng

Figure 1 for Adversarial Model for Offline Reinforcement Learning
Figure 2 for Adversarial Model for Offline Reinforcement Learning
Figure 3 for Adversarial Model for Offline Reinforcement Learning
Figure 4 for Adversarial Model for Offline Reinforcement Learning
Viaarxiv icon