Picture for Linchao Zhu

Linchao Zhu

Temporal Perceiving Video-Language Pre-training

Add code
Jan 18, 2023
Viaarxiv icon

Discriminative Radial Domain Adaptation

Add code
Jan 01, 2023
Figure 1 for Discriminative Radial Domain Adaptation
Figure 2 for Discriminative Radial Domain Adaptation
Figure 3 for Discriminative Radial Domain Adaptation
Figure 4 for Discriminative Radial Domain Adaptation
Viaarxiv icon

MIST: Multi-modal Iterative Spatial-Temporal Transformer for Long-form Video Question Answering

Add code
Dec 19, 2022
Viaarxiv icon

Slimmable Networks for Contrastive Self-supervised Learning

Add code
Sep 30, 2022
Figure 1 for Slimmable Networks for Contrastive Self-supervised Learning
Figure 2 for Slimmable Networks for Contrastive Self-supervised Learning
Figure 3 for Slimmable Networks for Contrastive Self-supervised Learning
Figure 4 for Slimmable Networks for Contrastive Self-supervised Learning
Viaarxiv icon

AFE-CNN: 3D Skeleton-based Action Recognition with Action Feature Enhancement

Add code
Aug 06, 2022
Figure 1 for AFE-CNN: 3D Skeleton-based Action Recognition with Action Feature Enhancement
Figure 2 for AFE-CNN: 3D Skeleton-based Action Recognition with Action Feature Enhancement
Figure 3 for AFE-CNN: 3D Skeleton-based Action Recognition with Action Feature Enhancement
Figure 4 for AFE-CNN: 3D Skeleton-based Action Recognition with Action Feature Enhancement
Viaarxiv icon

Fine-Grained Semantically Aligned Vision-Language Pre-Training

Add code
Aug 04, 2022
Figure 1 for Fine-Grained Semantically Aligned Vision-Language Pre-Training
Figure 2 for Fine-Grained Semantically Aligned Vision-Language Pre-Training
Figure 3 for Fine-Grained Semantically Aligned Vision-Language Pre-Training
Figure 4 for Fine-Grained Semantically Aligned Vision-Language Pre-Training
Viaarxiv icon

Dilated Context Integrated Network with Cross-Modal Consensus for Temporal Emotion Localization in Videos

Add code
Aug 03, 2022
Figure 1 for Dilated Context Integrated Network with Cross-Modal Consensus for Temporal Emotion Localization in Videos
Figure 2 for Dilated Context Integrated Network with Cross-Modal Consensus for Temporal Emotion Localization in Videos
Figure 3 for Dilated Context Integrated Network with Cross-Modal Consensus for Temporal Emotion Localization in Videos
Figure 4 for Dilated Context Integrated Network with Cross-Modal Consensus for Temporal Emotion Localization in Videos
Viaarxiv icon

PoseGU: 3D Human Pose Estimation with Novel Human Pose Generator and Unbiased Learning

Add code
Jul 07, 2022
Figure 1 for PoseGU: 3D Human Pose Estimation with Novel Human Pose Generator and Unbiased Learning
Figure 2 for PoseGU: 3D Human Pose Estimation with Novel Human Pose Generator and Unbiased Learning
Figure 3 for PoseGU: 3D Human Pose Estimation with Novel Human Pose Generator and Unbiased Learning
Figure 4 for PoseGU: 3D Human Pose Estimation with Novel Human Pose Generator and Unbiased Learning
Viaarxiv icon

CenterCLIP: Token Clustering for Efficient Text-Video Retrieval

Add code
May 02, 2022
Figure 1 for CenterCLIP: Token Clustering for Efficient Text-Video Retrieval
Figure 2 for CenterCLIP: Token Clustering for Efficient Text-Video Retrieval
Figure 3 for CenterCLIP: Token Clustering for Efficient Text-Video Retrieval
Figure 4 for CenterCLIP: Token Clustering for Efficient Text-Video Retrieval
Viaarxiv icon

Unified Transformer Tracker for Object Tracking

Add code
Mar 29, 2022
Figure 1 for Unified Transformer Tracker for Object Tracking
Figure 2 for Unified Transformer Tracker for Object Tracking
Figure 3 for Unified Transformer Tracker for Object Tracking
Figure 4 for Unified Transformer Tracker for Object Tracking
Viaarxiv icon