Picture for Houwen Peng

Houwen Peng

Stephen

MiniViT: Compressing Vision Transformers with Weight Multiplexing

Add code
Apr 14, 2022
Figure 1 for MiniViT: Compressing Vision Transformers with Weight Multiplexing
Figure 2 for MiniViT: Compressing Vision Transformers with Weight Multiplexing
Figure 3 for MiniViT: Compressing Vision Transformers with Weight Multiplexing
Figure 4 for MiniViT: Compressing Vision Transformers with Weight Multiplexing
Viaarxiv icon

Searching the Search Space of Vision Transformer

Add code
Nov 29, 2021
Figure 1 for Searching the Search Space of Vision Transformer
Figure 2 for Searching the Search Space of Vision Transformer
Figure 3 for Searching the Search Space of Vision Transformer
Figure 4 for Searching the Search Space of Vision Transformer
Viaarxiv icon

Learning to Track Objects from Unlabeled Videos

Add code
Aug 28, 2021
Figure 1 for Learning to Track Objects from Unlabeled Videos
Figure 2 for Learning to Track Objects from Unlabeled Videos
Figure 3 for Learning to Track Objects from Unlabeled Videos
Figure 4 for Learning to Track Objects from Unlabeled Videos
Viaarxiv icon

Rethinking and Improving Relative Position Encoding for Vision Transformer

Add code
Jul 29, 2021
Figure 1 for Rethinking and Improving Relative Position Encoding for Vision Transformer
Figure 2 for Rethinking and Improving Relative Position Encoding for Vision Transformer
Figure 3 for Rethinking and Improving Relative Position Encoding for Vision Transformer
Figure 4 for Rethinking and Improving Relative Position Encoding for Vision Transformer
Viaarxiv icon

AutoFormer: Searching Transformers for Visual Recognition

Add code
Jul 01, 2021
Figure 1 for AutoFormer: Searching Transformers for Visual Recognition
Figure 2 for AutoFormer: Searching Transformers for Visual Recognition
Figure 3 for AutoFormer: Searching Transformers for Visual Recognition
Figure 4 for AutoFormer: Searching Transformers for Visual Recognition
Viaarxiv icon

Probing Inter-modality: Visual Parsing with Self-Attention for Vision-Language Pre-training

Add code
Jun 28, 2021
Figure 1 for Probing Inter-modality: Visual Parsing with Self-Attention for Vision-Language Pre-training
Figure 2 for Probing Inter-modality: Visual Parsing with Self-Attention for Vision-Language Pre-training
Figure 3 for Probing Inter-modality: Visual Parsing with Self-Attention for Vision-Language Pre-training
Figure 4 for Probing Inter-modality: Visual Parsing with Self-Attention for Vision-Language Pre-training
Viaarxiv icon

LightTrack: Finding Lightweight Neural Networks for Object Tracking via One-Shot Architecture Search

Add code
Apr 29, 2021
Figure 1 for LightTrack: Finding Lightweight Neural Networks for Object Tracking via One-Shot Architecture Search
Figure 2 for LightTrack: Finding Lightweight Neural Networks for Object Tracking via One-Shot Architecture Search
Figure 3 for LightTrack: Finding Lightweight Neural Networks for Object Tracking via One-Shot Architecture Search
Figure 4 for LightTrack: Finding Lightweight Neural Networks for Object Tracking via One-Shot Architecture Search
Viaarxiv icon

One-Shot Neural Ensemble Architecture Search by Diversity-Guided Search Space Shrinking

Add code
Apr 01, 2021
Figure 1 for One-Shot Neural Ensemble Architecture Search by Diversity-Guided Search Space Shrinking
Figure 2 for One-Shot Neural Ensemble Architecture Search by Diversity-Guided Search Space Shrinking
Figure 3 for One-Shot Neural Ensemble Architecture Search by Diversity-Guided Search Space Shrinking
Figure 4 for One-Shot Neural Ensemble Architecture Search by Diversity-Guided Search Space Shrinking
Viaarxiv icon

Learning Spatio-Temporal Transformer for Visual Tracking

Add code
Mar 31, 2021
Viaarxiv icon

Multi-Scale 2D Temporal Adjacent Networks for Moment Localization with Natural Language

Add code
Dec 04, 2020
Figure 1 for Multi-Scale 2D Temporal Adjacent Networks for Moment Localization with Natural Language
Figure 2 for Multi-Scale 2D Temporal Adjacent Networks for Moment Localization with Natural Language
Figure 3 for Multi-Scale 2D Temporal Adjacent Networks for Moment Localization with Natural Language
Figure 4 for Multi-Scale 2D Temporal Adjacent Networks for Moment Localization with Natural Language
Viaarxiv icon