Picture for Jitendra Malik

Jitendra Malik

Interactive Task Planning with Language Models

Add code
Oct 16, 2023
Viaarxiv icon

What Matters to You? Towards Visual Representation Alignment for Robot Learning

Add code
Oct 11, 2023
Figure 1 for What Matters to You? Towards Visual Representation Alignment for Robot Learning
Figure 2 for What Matters to You? Towards Visual Representation Alignment for Robot Learning
Figure 3 for What Matters to You? Towards Visual Representation Alignment for Robot Learning
Figure 4 for What Matters to You? Towards Visual Representation Alignment for Robot Learning
Viaarxiv icon

Conformal Decision Theory: Safe Autonomous Decisions from Imperfect Predictions

Add code
Oct 10, 2023
Figure 1 for Conformal Decision Theory: Safe Autonomous Decisions from Imperfect Predictions
Figure 2 for Conformal Decision Theory: Safe Autonomous Decisions from Imperfect Predictions
Figure 3 for Conformal Decision Theory: Safe Autonomous Decisions from Imperfect Predictions
Figure 4 for Conformal Decision Theory: Safe Autonomous Decisions from Imperfect Predictions
Viaarxiv icon

General In-Hand Object Rotation with Vision and Touch

Add code
Sep 28, 2023
Viaarxiv icon

Learning Vision-based Pursuit-Evasion Robot Policies

Add code
Aug 30, 2023
Viaarxiv icon

EgoSchema: A Diagnostic Benchmark for Very Long-form Video Language Understanding

Add code
Aug 17, 2023
Figure 1 for EgoSchema: A Diagnostic Benchmark for Very Long-form Video Language Understanding
Figure 2 for EgoSchema: A Diagnostic Benchmark for Very Long-form Video Language Understanding
Figure 3 for EgoSchema: A Diagnostic Benchmark for Very Long-form Video Language Understanding
Figure 4 for EgoSchema: A Diagnostic Benchmark for Very Long-form Video Language Understanding
Viaarxiv icon

Robot Learning with Sensorimotor Pre-training

Add code
Jun 16, 2023
Viaarxiv icon

Learning Space-Time Semantic Correspondences

Add code
Jun 16, 2023
Figure 1 for Learning Space-Time Semantic Correspondences
Figure 2 for Learning Space-Time Semantic Correspondences
Figure 3 for Learning Space-Time Semantic Correspondences
Figure 4 for Learning Space-Time Semantic Correspondences
Viaarxiv icon

Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles

Add code
Jun 01, 2023
Figure 1 for Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Figure 2 for Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Figure 3 for Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Figure 4 for Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Viaarxiv icon

Humans in 4D: Reconstructing and Tracking Humans with Transformers

Add code
May 31, 2023
Viaarxiv icon