Alert button
Picture for Chen Sun

Chen Sun

Alert button

Multi-modal Transformer for Video Retrieval

Add code
Bookmark button
Alert button
Jul 21, 2020
Valentin Gabeur, Chen Sun, Karteek Alahari, Cordelia Schmid

Figure 1 for Multi-modal Transformer for Video Retrieval
Figure 2 for Multi-modal Transformer for Video Retrieval
Figure 3 for Multi-modal Transformer for Video Retrieval
Figure 4 for Multi-modal Transformer for Video Retrieval
Viaarxiv icon

What makes for good views for contrastive learning

Add code
Bookmark button
Alert button
May 20, 2020
Yonglong Tian, Chen Sun, Ben Poole, Dilip Krishnan, Cordelia Schmid, Phillip Isola

Figure 1 for What makes for good views for contrastive learning
Figure 2 for What makes for good views for contrastive learning
Figure 3 for What makes for good views for contrastive learning
Figure 4 for What makes for good views for contrastive learning
Viaarxiv icon

VectorNet: Encoding HD Maps and Agent Dynamics from Vectorized Representation

Add code
Bookmark button
Alert button
May 08, 2020
Jiyang Gao, Chen Sun, Hang Zhao, Yi Shen, Dragomir Anguelov, Congcong Li, Cordelia Schmid

Figure 1 for VectorNet: Encoding HD Maps and Agent Dynamics from Vectorized Representation
Figure 2 for VectorNet: Encoding HD Maps and Agent Dynamics from Vectorized Representation
Figure 3 for VectorNet: Encoding HD Maps and Agent Dynamics from Vectorized Representation
Figure 4 for VectorNet: Encoding HD Maps and Agent Dynamics from Vectorized Representation
Viaarxiv icon

Speech2Action: Cross-modal Supervision for Action Recognition

Add code
Bookmark button
Alert button
Mar 30, 2020
Arsha Nagrani, Chen Sun, David Ross, Rahul Sukthankar, Cordelia Schmid, Andrew Zisserman

Figure 1 for Speech2Action: Cross-modal Supervision for Action Recognition
Figure 2 for Speech2Action: Cross-modal Supervision for Action Recognition
Figure 3 for Speech2Action: Cross-modal Supervision for Action Recognition
Figure 4 for Speech2Action: Cross-modal Supervision for Action Recognition
Viaarxiv icon

Unsupervised Learning of Object Structure and Dynamics from Videos

Add code
Bookmark button
Alert button
Jun 19, 2019
Matthias Minderer, Chen Sun, Ruben Villegas, Forrester Cole, Kevin Murphy, Honglak Lee

Figure 1 for Unsupervised Learning of Object Structure and Dynamics from Videos
Figure 2 for Unsupervised Learning of Object Structure and Dynamics from Videos
Figure 3 for Unsupervised Learning of Object Structure and Dynamics from Videos
Figure 4 for Unsupervised Learning of Object Structure and Dynamics from Videos
Viaarxiv icon

Contrastive Bidirectional Transformer for Temporal Representation Learning

Add code
Bookmark button
Alert button
Jun 13, 2019
Chen Sun, Fabien Baradel, Kevin Murphy, Cordelia Schmid

Figure 1 for Contrastive Bidirectional Transformer for Temporal Representation Learning
Figure 2 for Contrastive Bidirectional Transformer for Temporal Representation Learning
Figure 3 for Contrastive Bidirectional Transformer for Temporal Representation Learning
Figure 4 for Contrastive Bidirectional Transformer for Temporal Representation Learning
Viaarxiv icon

Intra-Ensemble in Neural Networks

Add code
Bookmark button
Alert button
Apr 09, 2019
Yuan Gao, Zixiang Cai, Yimin Chen, Wenke Chen, Kan Yang, Chen Sun, Cong Yao

Figure 1 for Intra-Ensemble in Neural Networks
Figure 2 for Intra-Ensemble in Neural Networks
Figure 3 for Intra-Ensemble in Neural Networks
Figure 4 for Intra-Ensemble in Neural Networks
Viaarxiv icon

Relational Action Forecasting

Add code
Bookmark button
Alert button
Apr 08, 2019
Chen Sun, Abhinav Shrivastava, Carl Vondrick, Rahul Sukthankar, Kevin Murphy, Cordelia Schmid

Figure 1 for Relational Action Forecasting
Figure 2 for Relational Action Forecasting
Figure 3 for Relational Action Forecasting
Figure 4 for Relational Action Forecasting
Viaarxiv icon

VideoBERT: A Joint Model for Video and Language Representation Learning

Add code
Bookmark button
Alert button
Apr 03, 2019
Chen Sun, Austin Myers, Carl Vondrick, Kevin Murphy, Cordelia Schmid

Figure 1 for VideoBERT: A Joint Model for Video and Language Representation Learning
Figure 2 for VideoBERT: A Joint Model for Video and Language Representation Learning
Figure 3 for VideoBERT: A Joint Model for Video and Language Representation Learning
Figure 4 for VideoBERT: A Joint Model for Video and Language Representation Learning
Viaarxiv icon

Affordance Learning In Direct Perception for Autonomous Driving

Add code
Bookmark button
Alert button
Mar 20, 2019
Chen Sun, Jean M. Uwabeza Vianney, Dongpu Cao

Figure 1 for Affordance Learning In Direct Perception for Autonomous Driving
Figure 2 for Affordance Learning In Direct Perception for Autonomous Driving
Figure 3 for Affordance Learning In Direct Perception for Autonomous Driving
Figure 4 for Affordance Learning In Direct Perception for Autonomous Driving
Viaarxiv icon