Alert button
Picture for Austin Stone

Austin Stone

Alert button

Open-World Object Manipulation using Pre-trained Vision-Language Models

Add code
Bookmark button
Alert button
Mar 02, 2023
Austin Stone, Ted Xiao, Yao Lu, Keerthana Gopalakrishnan, Kuang-Huei Lee, Quan Vuong, Paul Wohlhart, Brianna Zitkovich, Fei Xia, Chelsea Finn, Karol Hausman

Figure 1 for Open-World Object Manipulation using Pre-trained Vision-Language Models
Figure 2 for Open-World Object Manipulation using Pre-trained Vision-Language Models
Figure 3 for Open-World Object Manipulation using Pre-trained Vision-Language Models
Figure 4 for Open-World Object Manipulation using Pre-trained Vision-Language Models
Viaarxiv icon

Scaling Robot Learning with Semantically Imagined Experience

Add code
Bookmark button
Alert button
Feb 22, 2023
Tianhe Yu, Ted Xiao, Austin Stone, Jonathan Tompson, Anthony Brohan, Su Wang, Jaspiar Singh, Clayton Tan, Dee M, Jodilyn Peralta, Brian Ichter, Karol Hausman, Fei Xia

Figure 1 for Scaling Robot Learning with Semantically Imagined Experience
Figure 2 for Scaling Robot Learning with Semantically Imagined Experience
Figure 3 for Scaling Robot Learning with Semantically Imagined Experience
Figure 4 for Scaling Robot Learning with Semantically Imagined Experience
Viaarxiv icon

RT-1: Robotics Transformer for Real-World Control at Scale

Add code
Bookmark button
Alert button
Dec 13, 2022
Anthony Brohan, Noah Brown, Justice Carbajal, Yevgen Chebotar, Joseph Dabis, Chelsea Finn, Keerthana Gopalakrishnan, Karol Hausman, Alex Herzog, Jasmine Hsu, Julian Ibarz, Brian Ichter, Alex Irpan, Tomas Jackson, Sally Jesmonth, Nikhil J Joshi, Ryan Julian, Dmitry Kalashnikov, Yuheng Kuang, Isabel Leal, Kuang-Huei Lee, Sergey Levine, Yao Lu, Utsav Malla, Deeksha Manjunath, Igor Mordatch, Ofir Nachum, Carolina Parada, Jodilyn Peralta, Emily Perez, Karl Pertsch, Jornell Quiambao, Kanishka Rao, Michael Ryoo, Grecia Salazar, Pannag Sanketi, Kevin Sayed, Jaspiar Singh, Sumedh Sontakke, Austin Stone, Clayton Tan, Huong Tran, Vincent Vanhoucke, Steve Vega, Quan Vuong, Fei Xia, Ted Xiao, Peng Xu, Sichun Xu, Tianhe Yu, Brianna Zitkovich

Figure 1 for RT-1: Robotics Transformer for Real-World Control at Scale
Figure 2 for RT-1: Robotics Transformer for Real-World Control at Scale
Figure 3 for RT-1: Robotics Transformer for Real-World Control at Scale
Figure 4 for RT-1: Robotics Transformer for Real-World Control at Scale
Viaarxiv icon

Self-supervised AutoFlow

Add code
Bookmark button
Alert button
Dec 08, 2022
Hsin-Ping Huang, Charles Herrmann, Junhwa Hur, Erika Lu, Kyle Sargent, Austin Stone, Ming-Hsuan Yang, Deqing Sun

Figure 1 for Self-supervised AutoFlow
Figure 2 for Self-supervised AutoFlow
Figure 3 for Self-supervised AutoFlow
Figure 4 for Self-supervised AutoFlow
Viaarxiv icon

Token Turing Machines

Add code
Bookmark button
Alert button
Nov 16, 2022
Michael S. Ryoo, Keerthana Gopalakrishnan, Kumara Kahatapitiya, Ted Xiao, Kanishka Rao, Austin Stone, Yao Lu, Julian Ibarz, Anurag Arnab

Figure 1 for Token Turing Machines
Figure 2 for Token Turing Machines
Figure 3 for Token Turing Machines
Figure 4 for Token Turing Machines
Viaarxiv icon

Open-vocabulary Queryable Scene Representations for Real World Planning

Add code
Bookmark button
Alert button
Sep 20, 2022
Boyuan Chen, Fei Xia, Brian Ichter, Kanishka Rao, Keerthana Gopalakrishnan, Michael S. Ryoo, Austin Stone, Daniel Kappler

Figure 1 for Open-vocabulary Queryable Scene Representations for Real World Planning
Figure 2 for Open-vocabulary Queryable Scene Representations for Real World Planning
Figure 3 for Open-vocabulary Queryable Scene Representations for Real World Planning
Figure 4 for Open-vocabulary Queryable Scene Representations for Real World Planning
Viaarxiv icon

Simple Open-Vocabulary Object Detection with Vision Transformers

Add code
Bookmark button
Alert button
May 12, 2022
Matthias Minderer, Alexey Gritsenko, Austin Stone, Maxim Neumann, Dirk Weissenborn, Alexey Dosovitskiy, Aravindh Mahendran, Anurag Arnab, Mostafa Dehghani, Zhuoran Shen, Xiao Wang, Xiaohua Zhai, Thomas Kipf, Neil Houlsby

Figure 1 for Simple Open-Vocabulary Object Detection with Vision Transformers
Figure 2 for Simple Open-Vocabulary Object Detection with Vision Transformers
Figure 3 for Simple Open-Vocabulary Object Detection with Vision Transformers
Figure 4 for Simple Open-Vocabulary Object Detection with Vision Transformers
Viaarxiv icon

Kubric: A scalable dataset generator

Add code
Bookmark button
Alert button
Mar 07, 2022
Klaus Greff, Francois Belletti, Lucas Beyer, Carl Doersch, Yilun Du, Daniel Duckworth, David J. Fleet, Dan Gnanapragasam, Florian Golemo, Charles Herrmann, Thomas Kipf, Abhijit Kundu, Dmitry Lagun, Issam Laradji, Hsueh-Ti, Liu, Henning Meyer, Yishu Miao, Derek Nowrouzezahrai, Cengiz Oztireli, Etienne Pot, Noha Radwan, Daniel Rebain, Sara Sabour, Mehdi S. M. Sajjadi, Matan Sela, Vincent Sitzmann, Austin Stone, Deqing Sun, Suhani Vora, Ziyu Wang, Tianhao Wu, Kwang Moo Yi, Fangcheng Zhong, Andrea Tagliasacchi

Figure 1 for Kubric: A scalable dataset generator
Figure 2 for Kubric: A scalable dataset generator
Figure 3 for Kubric: A scalable dataset generator
Figure 4 for Kubric: A scalable dataset generator
Viaarxiv icon

Conditional Object-Centric Learning from Video

Add code
Bookmark button
Alert button
Nov 24, 2021
Thomas Kipf, Gamaleldin F. Elsayed, Aravindh Mahendran, Austin Stone, Sara Sabour, Georg Heigold, Rico Jonschkowski, Alexey Dosovitskiy, Klaus Greff

Figure 1 for Conditional Object-Centric Learning from Video
Figure 2 for Conditional Object-Centric Learning from Video
Figure 3 for Conditional Object-Centric Learning from Video
Figure 4 for Conditional Object-Centric Learning from Video
Viaarxiv icon

SMURF: Self-Teaching Multi-Frame Unsupervised RAFT with Full-Image Warping

Add code
Bookmark button
Alert button
May 14, 2021
Austin Stone, Daniel Maurer, Alper Ayvaci, Anelia Angelova, Rico Jonschkowski

Figure 1 for SMURF: Self-Teaching Multi-Frame Unsupervised RAFT with Full-Image Warping
Figure 2 for SMURF: Self-Teaching Multi-Frame Unsupervised RAFT with Full-Image Warping
Figure 3 for SMURF: Self-Teaching Multi-Frame Unsupervised RAFT with Full-Image Warping
Figure 4 for SMURF: Self-Teaching Multi-Frame Unsupervised RAFT with Full-Image Warping
Viaarxiv icon