Picture for Josef Sivic

Josef Sivic

Estimating 3D Motion and Forces of Human-Object Interactions from Internet Videos

Add code
Nov 02, 2021
Figure 1 for Estimating 3D Motion and Forces of Human-Object Interactions from Internet Videos
Figure 2 for Estimating 3D Motion and Forces of Human-Object Interactions from Internet Videos
Figure 3 for Estimating 3D Motion and Forces of Human-Object Interactions from Internet Videos
Figure 4 for Estimating 3D Motion and Forces of Human-Object Interactions from Internet Videos
Viaarxiv icon

Weakly Supervised Human-Object Interaction Detection in Video via Contrastive Spatiotemporal Regions

Add code
Oct 07, 2021
Figure 1 for Weakly Supervised Human-Object Interaction Detection in Video via Contrastive Spatiotemporal Regions
Figure 2 for Weakly Supervised Human-Object Interaction Detection in Video via Contrastive Spatiotemporal Regions
Figure 3 for Weakly Supervised Human-Object Interaction Detection in Video via Contrastive Spatiotemporal Regions
Figure 4 for Weakly Supervised Human-Object Interaction Detection in Video via Contrastive Spatiotemporal Regions
Viaarxiv icon

Reconstructing and grounding narrated instructional videos in 3D

Add code
Sep 10, 2021
Figure 1 for Reconstructing and grounding narrated instructional videos in 3D
Figure 2 for Reconstructing and grounding narrated instructional videos in 3D
Figure 3 for Reconstructing and grounding narrated instructional videos in 3D
Figure 4 for Reconstructing and grounding narrated instructional videos in 3D
Viaarxiv icon

Single-view robot pose and joint angle estimation via render & compare

Add code
Apr 19, 2021
Figure 1 for Single-view robot pose and joint angle estimation via render & compare
Figure 2 for Single-view robot pose and joint angle estimation via render & compare
Figure 3 for Single-view robot pose and joint angle estimation via render & compare
Figure 4 for Single-view robot pose and joint angle estimation via render & compare
Viaarxiv icon

Visualizing computation in large-scale cellular automata

Add code
Apr 01, 2021
Figure 1 for Visualizing computation in large-scale cellular automata
Figure 2 for Visualizing computation in large-scale cellular automata
Figure 3 for Visualizing computation in large-scale cellular automata
Figure 4 for Visualizing computation in large-scale cellular automata
Viaarxiv icon

Thinking Fast and Slow: Efficient Text-to-Visual Retrieval with Transformers

Add code
Mar 30, 2021
Figure 1 for Thinking Fast and Slow: Efficient Text-to-Visual Retrieval with Transformers
Figure 2 for Thinking Fast and Slow: Efficient Text-to-Visual Retrieval with Transformers
Figure 3 for Thinking Fast and Slow: Efficient Text-to-Visual Retrieval with Transformers
Figure 4 for Thinking Fast and Slow: Efficient Text-to-Visual Retrieval with Transformers
Viaarxiv icon

Just Ask: Learning to Answer Questions from Millions of Narrated Videos

Add code
Dec 01, 2020
Figure 1 for Just Ask: Learning to Answer Questions from Millions of Narrated Videos
Figure 2 for Just Ask: Learning to Answer Questions from Millions of Narrated Videos
Figure 3 for Just Ask: Learning to Answer Questions from Millions of Narrated Videos
Figure 4 for Just Ask: Learning to Answer Questions from Millions of Narrated Videos
Viaarxiv icon

Learning Object Manipulation Skills via Approximate State Estimation from Real Videos

Add code
Nov 13, 2020
Figure 1 for Learning Object Manipulation Skills via Approximate State Estimation from Real Videos
Figure 2 for Learning Object Manipulation Skills via Approximate State Estimation from Real Videos
Figure 3 for Learning Object Manipulation Skills via Approximate State Estimation from Real Videos
Figure 4 for Learning Object Manipulation Skills via Approximate State Estimation from Real Videos
Viaarxiv icon

CosyPose: Consistent multi-view multi-object 6D pose estimation

Add code
Aug 19, 2020
Figure 1 for CosyPose: Consistent multi-view multi-object 6D pose estimation
Figure 2 for CosyPose: Consistent multi-view multi-object 6D pose estimation
Figure 3 for CosyPose: Consistent multi-view multi-object 6D pose estimation
Figure 4 for CosyPose: Consistent multi-view multi-object 6D pose estimation
Viaarxiv icon

RareAct: A video dataset of unusual interactions

Add code
Aug 03, 2020
Figure 1 for RareAct: A video dataset of unusual interactions
Figure 2 for RareAct: A video dataset of unusual interactions
Figure 3 for RareAct: A video dataset of unusual interactions
Figure 4 for RareAct: A video dataset of unusual interactions
Viaarxiv icon