Alert button
Picture for Joey Hejna

Joey Hejna

Alert button

From $r$ to $Q^*$: Your Language Model is Secretly a Q-Function

Add code
Bookmark button
Alert button
Apr 18, 2024
Rafael Rafailov, Joey Hejna, Ryan Park, Chelsea Finn

Viaarxiv icon

DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset

Add code
Bookmark button
Alert button
Mar 19, 2024
Alexander Khazatsky, Karl Pertsch, Suraj Nair, Ashwin Balakrishna, Sudeep Dasari, Siddharth Karamcheti, Soroush Nasiriany, Mohan Kumar Srirama, Lawrence Yunliang Chen, Kirsty Ellis, Peter David Fagan, Joey Hejna, Masha Itkina, Marion Lepert, Yecheng Jason Ma, Patrick Tree Miller, Jimmy Wu, Suneel Belkhale, Shivin Dass, Huy Ha, Arhan Jain, Abraham Lee, Youngwoon Lee, Marius Memmel, Sungjae Park, Ilija Radosavovic, Kaiyuan Wang, Albert Zhan, Kevin Black, Cheng Chi, Kyle Beltran Hatch, Shan Lin, Jingpei Lu, Jean Mercat, Abdul Rehman, Pannag R Sanketi, Archit Sharma, Cody Simpson, Quan Vuong, Homer Rich Walke, Blake Wulfe, Ted Xiao, Jonathan Heewon Yang, Arefeh Yavary, Tony Z. Zhao, Christopher Agia, Rohan Baijal, Mateo Guaman Castro, Daphne Chen, Qiuyu Chen, Trinity Chung, Jaimyn Drake, Ethan Paul Foster, Jensen Gao, David Antonio Herrera, Minho Heo, Kyle Hsu, Jiaheng Hu, Donovon Jackson, Charlotte Le, Yunshuang Li, Kevin Lin, Roy Lin, Zehan Ma, Abhiram Maddukuri, Suvir Mirchandani, Daniel Morton, Tony Nguyen, Abigail O'Neill, Rosario Scalise, Derick Seale, Victor Son, Stephen Tian, Emi Tran, Andrew E. Wang, Yilin Wu, Annie Xie, Jingyun Yang, Patrick Yin, Yunchu Zhang, Osbert Bastani, Glen Berseth, Jeannette Bohg, Ken Goldberg, Abhinav Gupta, Abhishek Gupta, Dinesh Jayaraman, Joseph J Lim, Jitendra Malik, Roberto Martín-Martín, Subramanian Ramamoorthy, Dorsa Sadigh, Shuran Song, Jiajun Wu, Michael C. Yip, Yuke Zhu, Thomas Kollar, Sergey Levine, Chelsea Finn

Figure 1 for DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset
Figure 2 for DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset
Figure 3 for DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset
Figure 4 for DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset
Viaarxiv icon

Contrastive Preference Learning: Learning from Human Feedback without RL

Add code
Bookmark button
Alert button
Oct 24, 2023
Joey Hejna, Rafael Rafailov, Harshit Sikchi, Chelsea Finn, Scott Niekum, W. Bradley Knox, Dorsa Sadigh

Viaarxiv icon

Contrastive Prefence Learning: Learning from Human Feedback without RL

Add code
Bookmark button
Alert button
Oct 20, 2023
Joey Hejna, Rafael Rafailov, Harshit Sikchi, Chelsea Finn, Scott Niekum, W. Bradley Knox, Dorsa Sadigh

Viaarxiv icon

Improving Long-Horizon Imitation Through Instruction Prediction

Add code
Bookmark button
Alert button
Jun 21, 2023
Joey Hejna, Pieter Abbeel, Lerrel Pinto

Figure 1 for Improving Long-Horizon Imitation Through Instruction Prediction
Figure 2 for Improving Long-Horizon Imitation Through Instruction Prediction
Figure 3 for Improving Long-Horizon Imitation Through Instruction Prediction
Figure 4 for Improving Long-Horizon Imitation Through Instruction Prediction
Viaarxiv icon

Inverse Preference Learning: Preference-based RL without a Reward Function

Add code
Bookmark button
Alert button
May 24, 2023
Joey Hejna, Dorsa Sadigh

Figure 1 for Inverse Preference Learning: Preference-based RL without a Reward Function
Figure 2 for Inverse Preference Learning: Preference-based RL without a Reward Function
Figure 3 for Inverse Preference Learning: Preference-based RL without a Reward Function
Figure 4 for Inverse Preference Learning: Preference-based RL without a Reward Function
Viaarxiv icon

Distance Weighted Supervised Learning for Offline Interaction Data

Add code
Bookmark button
Alert button
Apr 26, 2023
Joey Hejna, Jensen Gao, Dorsa Sadigh

Figure 1 for Distance Weighted Supervised Learning for Offline Interaction Data
Figure 2 for Distance Weighted Supervised Learning for Offline Interaction Data
Figure 3 for Distance Weighted Supervised Learning for Offline Interaction Data
Figure 4 for Distance Weighted Supervised Learning for Offline Interaction Data
Viaarxiv icon

Extreme Q-Learning: MaxEnt RL without Entropy

Add code
Bookmark button
Alert button
Jan 05, 2023
Divyansh Garg, Joey Hejna, Matthieu Geist, Stefano Ermon

Figure 1 for Extreme Q-Learning: MaxEnt RL without Entropy
Figure 2 for Extreme Q-Learning: MaxEnt RL without Entropy
Figure 3 for Extreme Q-Learning: MaxEnt RL without Entropy
Figure 4 for Extreme Q-Learning: MaxEnt RL without Entropy
Viaarxiv icon

Few-Shot Preference Learning for Human-in-the-Loop RL

Add code
Bookmark button
Alert button
Dec 06, 2022
Joey Hejna, Dorsa Sadigh

Figure 1 for Few-Shot Preference Learning for Human-in-the-Loop RL
Figure 2 for Few-Shot Preference Learning for Human-in-the-Loop RL
Figure 3 for Few-Shot Preference Learning for Human-in-the-Loop RL
Figure 4 for Few-Shot Preference Learning for Human-in-the-Loop RL
Viaarxiv icon