Picture for Fabian Caba Heilbron

Fabian Caba Heilbron

MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions

Add code
Dec 01, 2021
Figure 1 for MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions
Figure 2 for MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions
Figure 3 for MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions
Figure 4 for MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions
Viaarxiv icon

MovieCuts: A New Dataset and Benchmark for Cut Type Recognition

Add code
Sep 19, 2021
Figure 1 for MovieCuts: A New Dataset and Benchmark for Cut Type Recognition
Figure 2 for MovieCuts: A New Dataset and Benchmark for Cut Type Recognition
Figure 3 for MovieCuts: A New Dataset and Benchmark for Cut Type Recognition
Figure 4 for MovieCuts: A New Dataset and Benchmark for Cut Type Recognition
Viaarxiv icon

Learning to Cut by Watching Movies

Add code
Aug 09, 2021
Figure 1 for Learning to Cut by Watching Movies
Figure 2 for Learning to Cut by Watching Movies
Figure 3 for Learning to Cut by Watching Movies
Figure 4 for Learning to Cut by Watching Movies
Viaarxiv icon

Transcript to Video: Efficient Clip Sequencing from Texts

Add code
Jul 25, 2021
Figure 1 for Transcript to Video: Efficient Clip Sequencing from Texts
Figure 2 for Transcript to Video: Efficient Clip Sequencing from Texts
Figure 3 for Transcript to Video: Efficient Clip Sequencing from Texts
Figure 4 for Transcript to Video: Efficient Clip Sequencing from Texts
Viaarxiv icon

APES: Audiovisual Person Search in Untrimmed Video

Add code
Jun 03, 2021
Figure 1 for APES: Audiovisual Person Search in Untrimmed Video
Figure 2 for APES: Audiovisual Person Search in Untrimmed Video
Figure 3 for APES: Audiovisual Person Search in Untrimmed Video
Figure 4 for APES: Audiovisual Person Search in Untrimmed Video
Viaarxiv icon

MAAS: Multi-modal Assignation for Active Speaker Detection

Add code
Jan 11, 2021
Figure 1 for MAAS: Multi-modal Assignation for Active Speaker Detection
Figure 2 for MAAS: Multi-modal Assignation for Active Speaker Detection
Figure 3 for MAAS: Multi-modal Assignation for Active Speaker Detection
Figure 4 for MAAS: Multi-modal Assignation for Active Speaker Detection
Viaarxiv icon

Real-time Semantic Segmentation with Fast Attention

Add code
Jul 09, 2020
Figure 1 for Real-time Semantic Segmentation with Fast Attention
Figure 2 for Real-time Semantic Segmentation with Fast Attention
Figure 3 for Real-time Semantic Segmentation with Fast Attention
Figure 4 for Real-time Semantic Segmentation with Fast Attention
Viaarxiv icon

Active Speakers in Context

Add code
May 20, 2020
Figure 1 for Active Speakers in Context
Figure 2 for Active Speakers in Context
Figure 3 for Active Speakers in Context
Figure 4 for Active Speakers in Context
Viaarxiv icon

Temporally Distributed Networks for Fast Video Semantic Segmentation

Add code
Apr 07, 2020
Figure 1 for Temporally Distributed Networks for Fast Video Semantic Segmentation
Figure 2 for Temporally Distributed Networks for Fast Video Semantic Segmentation
Figure 3 for Temporally Distributed Networks for Fast Video Semantic Segmentation
Figure 4 for Temporally Distributed Networks for Fast Video Semantic Segmentation
Viaarxiv icon

Rethinking Online Action Detection in Untrimmed Videos: A Novel Online Evaluation Protocol

Add code
Mar 26, 2020
Figure 1 for Rethinking Online Action Detection in Untrimmed Videos: A Novel Online Evaluation Protocol
Figure 2 for Rethinking Online Action Detection in Untrimmed Videos: A Novel Online Evaluation Protocol
Figure 3 for Rethinking Online Action Detection in Untrimmed Videos: A Novel Online Evaluation Protocol
Figure 4 for Rethinking Online Action Detection in Untrimmed Videos: A Novel Online Evaluation Protocol
Viaarxiv icon