Alert button
Picture for Michael S. Ryoo

Michael S. Ryoo

Alert button

SWAT: Spatial Structure Within and Among Tokens

Nov 26, 2021
Kumara Kahatapitiya, Michael S. Ryoo

Figure 1 for SWAT: Spatial Structure Within and Among Tokens
Figure 2 for SWAT: Spatial Structure Within and Among Tokens
Figure 3 for SWAT: Spatial Structure Within and Among Tokens
Figure 4 for SWAT: Spatial Structure Within and Among Tokens
Viaarxiv icon

Self-supervised Pretraining with Classification Labels for Temporal Activity Detection

Nov 26, 2021
Kumara Kahatapitiya, Zhou Ren, Haoxiang Li, Zhenyu Wu, Michael S. Ryoo

Figure 1 for Self-supervised Pretraining with Classification Labels for Temporal Activity Detection
Figure 2 for Self-supervised Pretraining with Classification Labels for Temporal Activity Detection
Figure 3 for Self-supervised Pretraining with Classification Labels for Temporal Activity Detection
Figure 4 for Self-supervised Pretraining with Classification Labels for Temporal Activity Detection
Viaarxiv icon

StARformer: Transformer with State-Action-Reward Representations

Oct 12, 2021
Jinghuan Shang, Michael S. Ryoo

Figure 1 for StARformer: Transformer with State-Action-Reward Representations
Figure 2 for StARformer: Transformer with State-Action-Reward Representations
Figure 3 for StARformer: Transformer with State-Action-Reward Representations
Figure 4 for StARformer: Transformer with State-Action-Reward Representations
Viaarxiv icon

4D-Net for Learned Multi-Modal Alignment

Sep 02, 2021
AJ Piergiovanni, Vincent Casser, Michael S. Ryoo, Anelia Angelova

Figure 1 for 4D-Net for Learned Multi-Modal Alignment
Figure 2 for 4D-Net for Learned Multi-Modal Alignment
Figure 3 for 4D-Net for Learned Multi-Modal Alignment
Figure 4 for 4D-Net for Learned Multi-Modal Alignment
Viaarxiv icon

Self-Supervised Disentangled Representation Learning for Third-Person Imitation Learning

Aug 02, 2021
Jinghuan Shang, Michael S. Ryoo

Figure 1 for Self-Supervised Disentangled Representation Learning for Third-Person Imitation Learning
Figure 2 for Self-Supervised Disentangled Representation Learning for Third-Person Imitation Learning
Figure 3 for Self-Supervised Disentangled Representation Learning for Third-Person Imitation Learning
Figure 4 for Self-Supervised Disentangled Representation Learning for Third-Person Imitation Learning
Viaarxiv icon

Unsupervised Discovery of Actions in Instructional Videos

Jun 28, 2021
AJ Piergiovanni, Anelia Angelova, Michael S. Ryoo, Irfan Essa

Figure 1 for Unsupervised Discovery of Actions in Instructional Videos
Figure 2 for Unsupervised Discovery of Actions in Instructional Videos
Figure 3 for Unsupervised Discovery of Actions in Instructional Videos
Figure 4 for Unsupervised Discovery of Actions in Instructional Videos
Viaarxiv icon

TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?

Jun 21, 2021
Michael S. Ryoo, AJ Piergiovanni, Anurag Arnab, Mostafa Dehghani, Anelia Angelova

Figure 1 for TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?
Figure 2 for TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?
Figure 3 for TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?
Figure 4 for TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?
Viaarxiv icon

Unsupervised Action Segmentation for Instructional Videos

Jun 07, 2021
AJ Piergiovanni, Anelia Angelova, Michael S. Ryoo, Irfan Essa

Figure 1 for Unsupervised Action Segmentation for Instructional Videos
Figure 2 for Unsupervised Action Segmentation for Instructional Videos
Figure 3 for Unsupervised Action Segmentation for Instructional Videos
Figure 4 for Unsupervised Action Segmentation for Instructional Videos
Viaarxiv icon

Coarse-Fine Networks for Temporal Activity Detection in Videos

Apr 01, 2021
Kumara Kahatapitiya, Michael S. Ryoo

Figure 1 for Coarse-Fine Networks for Temporal Activity Detection in Videos
Figure 2 for Coarse-Fine Networks for Temporal Activity Detection in Videos
Figure 3 for Coarse-Fine Networks for Temporal Activity Detection in Videos
Figure 4 for Coarse-Fine Networks for Temporal Activity Detection in Videos
Viaarxiv icon

Recognizing Actions in Videos from Unseen Viewpoints

Mar 30, 2021
AJ Piergiovanni, Michael S. Ryoo

Figure 1 for Recognizing Actions in Videos from Unseen Viewpoints
Figure 2 for Recognizing Actions in Videos from Unseen Viewpoints
Figure 3 for Recognizing Actions in Videos from Unseen Viewpoints
Figure 4 for Recognizing Actions in Videos from Unseen Viewpoints
Viaarxiv icon