Alert button
Picture for Josef Sivic

Josef Sivic

Alert button

Learning to Answer Visual Questions from Web Videos

Add code
Bookmark button
Alert button
May 11, 2022
Antoine Yang, Antoine Miech, Josef Sivic, Ivan Laptev, Cordelia Schmid

Figure 1 for Learning to Answer Visual Questions from Web Videos
Figure 2 for Learning to Answer Visual Questions from Web Videos
Figure 3 for Learning to Answer Visual Questions from Web Videos
Figure 4 for Learning to Answer Visual Questions from Web Videos
Viaarxiv icon

Focal Length and Object Pose Estimation via Render and Compare

Add code
Bookmark button
Alert button
Apr 11, 2022
Georgy Ponimatkin, Yann Labbé, Bryan Russell, Mathieu Aubry, Josef Sivic

Figure 1 for Focal Length and Object Pose Estimation via Render and Compare
Figure 2 for Focal Length and Object Pose Estimation via Render and Compare
Figure 3 for Focal Length and Object Pose Estimation via Render and Compare
Figure 4 for Focal Length and Object Pose Estimation via Render and Compare
Viaarxiv icon

TubeDETR: Spatio-Temporal Video Grounding with Transformers

Add code
Bookmark button
Alert button
Mar 30, 2022
Antoine Yang, Antoine Miech, Josef Sivic, Ivan Laptev, Cordelia Schmid

Figure 1 for TubeDETR: Spatio-Temporal Video Grounding with Transformers
Figure 2 for TubeDETR: Spatio-Temporal Video Grounding with Transformers
Figure 3 for TubeDETR: Spatio-Temporal Video Grounding with Transformers
Figure 4 for TubeDETR: Spatio-Temporal Video Grounding with Transformers
Viaarxiv icon

Look for the Change: Learning Object States and State-Modifying Actions from Untrimmed Web Videos

Add code
Bookmark button
Alert button
Mar 22, 2022
Tomáš Souček, Jean-Baptiste Alayrac, Antoine Miech, Ivan Laptev, Josef Sivic

Figure 1 for Look for the Change: Learning Object States and State-Modifying Actions from Untrimmed Web Videos
Figure 2 for Look for the Change: Learning Object States and State-Modifying Actions from Untrimmed Web Videos
Figure 3 for Look for the Change: Learning Object States and State-Modifying Actions from Untrimmed Web Videos
Figure 4 for Look for the Change: Learning Object States and State-Modifying Actions from Untrimmed Web Videos
Viaarxiv icon

Drive&Segment: Unsupervised Semantic Segmentation of Urban Scenes via Cross-modal Distillation

Add code
Bookmark button
Alert button
Mar 21, 2022
Antonin Vobecky, David Hurych, Oriane Siméoni, Spyros Gidaris, Andrei Bursuc, Patrick Pérez, Josef Sivic

Figure 1 for Drive&Segment: Unsupervised Semantic Segmentation of Urban Scenes via Cross-modal Distillation
Figure 2 for Drive&Segment: Unsupervised Semantic Segmentation of Urban Scenes via Cross-modal Distillation
Figure 3 for Drive&Segment: Unsupervised Semantic Segmentation of Urban Scenes via Cross-modal Distillation
Figure 4 for Drive&Segment: Unsupervised Semantic Segmentation of Urban Scenes via Cross-modal Distillation
Viaarxiv icon

Learning to Manipulate Tools by Aligning Simulation to Video Demonstration

Add code
Bookmark button
Alert button
Nov 04, 2021
Kateryna Zorina, Justin Carpentier, Josef Sivic, Vladimír Petrík

Figure 1 for Learning to Manipulate Tools by Aligning Simulation to Video Demonstration
Figure 2 for Learning to Manipulate Tools by Aligning Simulation to Video Demonstration
Figure 3 for Learning to Manipulate Tools by Aligning Simulation to Video Demonstration
Figure 4 for Learning to Manipulate Tools by Aligning Simulation to Video Demonstration
Viaarxiv icon

Estimating 3D Motion and Forces of Human-Object Interactions from Internet Videos

Add code
Bookmark button
Alert button
Nov 02, 2021
Zongmian Li, Jiri Sedlar, Justin Carpentier, Ivan Laptev, Nicolas Mansard, Josef Sivic

Figure 1 for Estimating 3D Motion and Forces of Human-Object Interactions from Internet Videos
Figure 2 for Estimating 3D Motion and Forces of Human-Object Interactions from Internet Videos
Figure 3 for Estimating 3D Motion and Forces of Human-Object Interactions from Internet Videos
Figure 4 for Estimating 3D Motion and Forces of Human-Object Interactions from Internet Videos
Viaarxiv icon

Weakly Supervised Human-Object Interaction Detection in Video via Contrastive Spatiotemporal Regions

Add code
Bookmark button
Alert button
Oct 07, 2021
Shuang Li, Yilun Du, Antonio Torralba, Josef Sivic, Bryan Russell

Figure 1 for Weakly Supervised Human-Object Interaction Detection in Video via Contrastive Spatiotemporal Regions
Figure 2 for Weakly Supervised Human-Object Interaction Detection in Video via Contrastive Spatiotemporal Regions
Figure 3 for Weakly Supervised Human-Object Interaction Detection in Video via Contrastive Spatiotemporal Regions
Figure 4 for Weakly Supervised Human-Object Interaction Detection in Video via Contrastive Spatiotemporal Regions
Viaarxiv icon

Reconstructing and grounding narrated instructional videos in 3D

Add code
Bookmark button
Alert button
Sep 10, 2021
Dimitri Zhukov, Ignacio Rocco, Ivan Laptev, Josef Sivic, Johannes L. Schönberger, Bugra Tekin, Marc Pollefeys

Figure 1 for Reconstructing and grounding narrated instructional videos in 3D
Figure 2 for Reconstructing and grounding narrated instructional videos in 3D
Figure 3 for Reconstructing and grounding narrated instructional videos in 3D
Figure 4 for Reconstructing and grounding narrated instructional videos in 3D
Viaarxiv icon

Single-view robot pose and joint angle estimation via render & compare

Add code
Bookmark button
Alert button
Apr 19, 2021
Yann Labbé, Justin Carpentier, Mathieu Aubry, Josef Sivic

Figure 1 for Single-view robot pose and joint angle estimation via render & compare
Figure 2 for Single-view robot pose and joint angle estimation via render & compare
Figure 3 for Single-view robot pose and joint angle estimation via render & compare
Figure 4 for Single-view robot pose and joint angle estimation via render & compare
Viaarxiv icon