Picture for Matt Feiszli

Matt Feiszli

MINOTAUR: Multi-task Video Grounding From Multimodal Queries

Add code
Feb 16, 2023
Figure 1 for MINOTAUR: Multi-task Video Grounding From Multimodal Queries
Figure 2 for MINOTAUR: Multi-task Video Grounding From Multimodal Queries
Figure 3 for MINOTAUR: Multi-task Video Grounding From Multimodal Queries
Figure 4 for MINOTAUR: Multi-task Video Grounding From Multimodal Queries
Viaarxiv icon

EgoTracks: A Long-term Egocentric Visual Object Tracking Dataset

Add code
Jan 11, 2023
Viaarxiv icon

Open-World Instance Segmentation: Exploiting Pseudo Ground Truth From Learned Pairwise Affinity

Add code
Apr 12, 2022
Figure 1 for Open-World Instance Segmentation: Exploiting Pseudo Ground Truth From Learned Pairwise Affinity
Figure 2 for Open-World Instance Segmentation: Exploiting Pseudo Ground Truth From Learned Pairwise Affinity
Figure 3 for Open-World Instance Segmentation: Exploiting Pseudo Ground Truth From Learned Pairwise Affinity
Figure 4 for Open-World Instance Segmentation: Exploiting Pseudo Ground Truth From Learned Pairwise Affinity
Viaarxiv icon

GEB+: A benchmark for generic event boundary captioning, grounding and text-based retrieval

Add code
Apr 10, 2022
Figure 1 for GEB+: A benchmark for generic event boundary captioning, grounding and text-based retrieval
Figure 2 for GEB+: A benchmark for generic event boundary captioning, grounding and text-based retrieval
Figure 3 for GEB+: A benchmark for generic event boundary captioning, grounding and text-based retrieval
Figure 4 for GEB+: A benchmark for generic event boundary captioning, grounding and text-based retrieval
Viaarxiv icon

PyTorchVideo: A Deep Learning Library for Video Understanding

Add code
Nov 18, 2021
Figure 1 for PyTorchVideo: A Deep Learning Library for Video Understanding
Figure 2 for PyTorchVideo: A Deep Learning Library for Video Understanding
Figure 3 for PyTorchVideo: A Deep Learning Library for Video Understanding
Viaarxiv icon

Searching for Two-Stream Models in Multivariate Space for Video Recognition

Add code
Aug 30, 2021
Figure 1 for Searching for Two-Stream Models in Multivariate Space for Video Recognition
Figure 2 for Searching for Two-Stream Models in Multivariate Space for Video Recognition
Figure 3 for Searching for Two-Stream Models in Multivariate Space for Video Recognition
Figure 4 for Searching for Two-Stream Models in Multivariate Space for Video Recognition
Viaarxiv icon

Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation

Add code
Apr 10, 2021
Figure 1 for Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation
Figure 2 for Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation
Figure 3 for Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation
Figure 4 for Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation
Viaarxiv icon

Generic Event Boundary Detection: A Benchmark for Event Segmentation

Add code
Jan 26, 2021
Figure 1 for Generic Event Boundary Detection: A Benchmark for Event Segmentation
Figure 2 for Generic Event Boundary Detection: A Benchmark for Event Segmentation
Figure 3 for Generic Event Boundary Detection: A Benchmark for Event Segmentation
Figure 4 for Generic Event Boundary Detection: A Benchmark for Event Segmentation
Viaarxiv icon

FP-NAS: Fast Probabilistic Neural Architecture Search

Add code
Nov 24, 2020
Figure 1 for FP-NAS: Fast Probabilistic Neural Architecture Search
Figure 2 for FP-NAS: Fast Probabilistic Neural Architecture Search
Figure 3 for FP-NAS: Fast Probabilistic Neural Architecture Search
Figure 4 for FP-NAS: Fast Probabilistic Neural Architecture Search
Viaarxiv icon

SF-Net: Single-Frame Supervision for Temporal Action Localization

Add code
Mar 20, 2020
Viaarxiv icon