Picture for Brian Chen

Brian Chen

EgoTV: Egocentric Task Verification from Natural Language Task Descriptions

Add code
Apr 17, 2023
Figure 1 for EgoTV: Egocentric Task Verification from Natural Language Task Descriptions
Figure 2 for EgoTV: Egocentric Task Verification from Natural Language Task Descriptions
Figure 3 for EgoTV: Egocentric Task Verification from Natural Language Task Descriptions
Figure 4 for EgoTV: Egocentric Task Verification from Natural Language Task Descriptions
Viaarxiv icon

What, when, and where? -- Self-Supervised Spatio-Temporal Grounding in Untrimmed Multi-Action Videos from Narrated Instructions

Add code
Mar 29, 2023
Figure 1 for What, when, and where? -- Self-Supervised Spatio-Temporal Grounding in Untrimmed Multi-Action Videos from Narrated Instructions
Figure 2 for What, when, and where? -- Self-Supervised Spatio-Temporal Grounding in Untrimmed Multi-Action Videos from Narrated Instructions
Figure 3 for What, when, and where? -- Self-Supervised Spatio-Temporal Grounding in Untrimmed Multi-Action Videos from Narrated Instructions
Figure 4 for What, when, and where? -- Self-Supervised Spatio-Temporal Grounding in Untrimmed Multi-Action Videos from Narrated Instructions
Viaarxiv icon

Interpretable Graph Convolutional Network of Multi-Modality Brain Imaging for Alzheimer's Disease Diagnosis

Add code
Apr 27, 2022
Figure 1 for Interpretable Graph Convolutional Network of Multi-Modality Brain Imaging for Alzheimer's Disease Diagnosis
Figure 2 for Interpretable Graph Convolutional Network of Multi-Modality Brain Imaging for Alzheimer's Disease Diagnosis
Figure 3 for Interpretable Graph Convolutional Network of Multi-Modality Brain Imaging for Alzheimer's Disease Diagnosis
Figure 4 for Interpretable Graph Convolutional Network of Multi-Modality Brain Imaging for Alzheimer's Disease Diagnosis
Viaarxiv icon

Numerical and geometrical aspects of flow-based variational quantum Monte Carlo

Add code
Mar 28, 2022
Figure 1 for Numerical and geometrical aspects of flow-based variational quantum Monte Carlo
Figure 2 for Numerical and geometrical aspects of flow-based variational quantum Monte Carlo
Figure 3 for Numerical and geometrical aspects of flow-based variational quantum Monte Carlo
Figure 4 for Numerical and geometrical aspects of flow-based variational quantum Monte Carlo
Viaarxiv icon

Everything at Once -- Multi-modal Fusion Transformer for Video Retrieval

Add code
Dec 08, 2021
Figure 1 for Everything at Once -- Multi-modal Fusion Transformer for Video Retrieval
Figure 2 for Everything at Once -- Multi-modal Fusion Transformer for Video Retrieval
Figure 3 for Everything at Once -- Multi-modal Fusion Transformer for Video Retrieval
Figure 4 for Everything at Once -- Multi-modal Fusion Transformer for Video Retrieval
Viaarxiv icon

PreViTS: Contrastive Pretraining with Video Tracking Supervision

Add code
Dec 01, 2021
Figure 1 for PreViTS: Contrastive Pretraining with Video Tracking Supervision
Figure 2 for PreViTS: Contrastive Pretraining with Video Tracking Supervision
Figure 3 for PreViTS: Contrastive Pretraining with Video Tracking Supervision
Figure 4 for PreViTS: Contrastive Pretraining with Video Tracking Supervision
Viaarxiv icon

Routing with Self-Attention for Multimodal Capsule Networks

Add code
Dec 01, 2021
Figure 1 for Routing with Self-Attention for Multimodal Capsule Networks
Figure 2 for Routing with Self-Attention for Multimodal Capsule Networks
Figure 3 for Routing with Self-Attention for Multimodal Capsule Networks
Figure 4 for Routing with Self-Attention for Multimodal Capsule Networks
Viaarxiv icon

Cascaded Multilingual Audio-Visual Learning from Videos

Add code
Nov 08, 2021
Figure 1 for Cascaded Multilingual Audio-Visual Learning from Videos
Figure 2 for Cascaded Multilingual Audio-Visual Learning from Videos
Figure 3 for Cascaded Multilingual Audio-Visual Learning from Videos
Figure 4 for Cascaded Multilingual Audio-Visual Learning from Videos
Viaarxiv icon

Joint Multimedia Event Extraction from Video and Article

Add code
Sep 27, 2021
Figure 1 for Joint Multimedia Event Extraction from Video and Article
Figure 2 for Joint Multimedia Event Extraction from Video and Article
Figure 3 for Joint Multimedia Event Extraction from Video and Article
Figure 4 for Joint Multimedia Event Extraction from Video and Article
Viaarxiv icon

Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos

Add code
May 05, 2021
Figure 1 for Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos
Figure 2 for Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos
Figure 3 for Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos
Figure 4 for Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos
Viaarxiv icon