Picture for Michael Ryoo

Michael Ryoo

SARA-RT: Scaling up Robotics Transformers with Self-Adaptive Robust Attention

Add code
Dec 04, 2023
Figure 1 for SARA-RT: Scaling up Robotics Transformers with Self-Adaptive Robust Attention
Figure 2 for SARA-RT: Scaling up Robotics Transformers with Self-Adaptive Robust Attention
Figure 3 for SARA-RT: Scaling up Robotics Transformers with Self-Adaptive Robust Attention
Figure 4 for SARA-RT: Scaling up Robotics Transformers with Self-Adaptive Robust Attention
Viaarxiv icon

Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders

Add code
Oct 31, 2023
Figure 1 for Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders
Figure 2 for Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders
Figure 3 for Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders
Figure 4 for Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders
Viaarxiv icon

RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control

Add code
Jul 28, 2023
Figure 1 for RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control
Figure 2 for RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control
Figure 3 for RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control
Figure 4 for RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control
Viaarxiv icon

Language-based Action Concept Spaces Improve Video Self-Supervised Learning

Add code
Jul 20, 2023
Figure 1 for Language-based Action Concept Spaces Improve Video Self-Supervised Learning
Figure 2 for Language-based Action Concept Spaces Improve Video Self-Supervised Learning
Figure 3 for Language-based Action Concept Spaces Improve Video Self-Supervised Learning
Figure 4 for Language-based Action Concept Spaces Improve Video Self-Supervised Learning
Viaarxiv icon

RT-1: Robotics Transformer for Real-World Control at Scale

Add code
Dec 13, 2022
Figure 1 for RT-1: Robotics Transformer for Real-World Control at Scale
Figure 2 for RT-1: Robotics Transformer for Real-World Control at Scale
Figure 3 for RT-1: Robotics Transformer for Real-World Control at Scale
Figure 4 for RT-1: Robotics Transformer for Real-World Control at Scale
Viaarxiv icon

Neural Neural Textures Make Sim2Real Consistent

Add code
Jun 27, 2022
Figure 1 for Neural Neural Textures Make Sim2Real Consistent
Figure 2 for Neural Neural Textures Make Sim2Real Consistent
Figure 3 for Neural Neural Textures Make Sim2Real Consistent
Figure 4 for Neural Neural Textures Make Sim2Real Consistent
Viaarxiv icon

Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language

Add code
Apr 01, 2022
Figure 1 for Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language
Figure 2 for Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language
Figure 3 for Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language
Figure 4 for Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language
Viaarxiv icon

Self-supervised Video Transformer

Add code
Dec 02, 2021
Figure 1 for Self-supervised Video Transformer
Figure 2 for Self-supervised Video Transformer
Figure 3 for Self-supervised Video Transformer
Figure 4 for Self-supervised Video Transformer
Viaarxiv icon

Adaptive Intermediate Representations for Video Understanding

Add code
Apr 14, 2021
Figure 1 for Adaptive Intermediate Representations for Video Understanding
Figure 2 for Adaptive Intermediate Representations for Video Understanding
Figure 3 for Adaptive Intermediate Representations for Video Understanding
Figure 4 for Adaptive Intermediate Representations for Video Understanding
Viaarxiv icon