Picture for Mathew Monfort

Mathew Monfort

A Framework for Efficient Model Evaluation through Stratification, Sampling, and Estimation

Add code
Jun 11, 2024
Viaarxiv icon

Spoken Moments: Learning Joint Audio-Visual Representations from Video Descriptions

Add code
May 10, 2021
Figure 1 for Spoken Moments: Learning Joint Audio-Visual Representations from Video Descriptions
Figure 2 for Spoken Moments: Learning Joint Audio-Visual Representations from Video Descriptions
Figure 3 for Spoken Moments: Learning Joint Audio-Visual Representations from Video Descriptions
Figure 4 for Spoken Moments: Learning Joint Audio-Visual Representations from Video Descriptions
Viaarxiv icon

We Have So Much In Common: Modeling Semantic Relational Set Abstractions in Videos

Add code
Aug 12, 2020
Figure 1 for We Have So Much In Common: Modeling Semantic Relational Set Abstractions in Videos
Figure 2 for We Have So Much In Common: Modeling Semantic Relational Set Abstractions in Videos
Figure 3 for We Have So Much In Common: Modeling Semantic Relational Set Abstractions in Videos
Figure 4 for We Have So Much In Common: Modeling Semantic Relational Set Abstractions in Videos
Viaarxiv icon

Multi-Moments in Time: Learning and Interpreting Models for Multi-Action Video Understanding

Add code
Nov 04, 2019
Figure 1 for Multi-Moments in Time: Learning and Interpreting Models for Multi-Action Video Understanding
Figure 2 for Multi-Moments in Time: Learning and Interpreting Models for Multi-Action Video Understanding
Figure 3 for Multi-Moments in Time: Learning and Interpreting Models for Multi-Action Video Understanding
Figure 4 for Multi-Moments in Time: Learning and Interpreting Models for Multi-Action Video Understanding
Viaarxiv icon

Reasoning About Human-Object Interactions Through Dual Attention Networks

Add code
Sep 10, 2019
Figure 1 for Reasoning About Human-Object Interactions Through Dual Attention Networks
Figure 2 for Reasoning About Human-Object Interactions Through Dual Attention Networks
Figure 3 for Reasoning About Human-Object Interactions Through Dual Attention Networks
Figure 4 for Reasoning About Human-Object Interactions Through Dual Attention Networks
Viaarxiv icon

Multi-Agent Tensor Fusion for Contextual Trajectory Prediction

Add code
Apr 09, 2019
Figure 1 for Multi-Agent Tensor Fusion for Contextual Trajectory Prediction
Figure 2 for Multi-Agent Tensor Fusion for Contextual Trajectory Prediction
Figure 3 for Multi-Agent Tensor Fusion for Contextual Trajectory Prediction
Figure 4 for Multi-Agent Tensor Fusion for Contextual Trajectory Prediction
Viaarxiv icon

Moments in Time Dataset: one million videos for event understanding

Add code
Jan 09, 2018
Figure 1 for Moments in Time Dataset: one million videos for event understanding
Figure 2 for Moments in Time Dataset: one million videos for event understanding
Figure 3 for Moments in Time Dataset: one million videos for event understanding
Figure 4 for Moments in Time Dataset: one million videos for event understanding
Viaarxiv icon

End to End Learning for Self-Driving Cars

Add code
Apr 25, 2016
Figure 1 for End to End Learning for Self-Driving Cars
Figure 2 for End to End Learning for Self-Driving Cars
Figure 3 for End to End Learning for Self-Driving Cars
Figure 4 for End to End Learning for Self-Driving Cars
Viaarxiv icon