Picture for Yibing Song

Yibing Song

DiffusionDet: Diffusion Model for Object Detection

Add code
Nov 17, 2022
Figure 1 for DiffusionDet: Diffusion Model for Object Detection
Figure 2 for DiffusionDet: Diffusion Model for Object Detection
Figure 3 for DiffusionDet: Diffusion Model for Object Detection
Figure 4 for DiffusionDet: Diffusion Model for Object Detection
Viaarxiv icon

One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations

Add code
Oct 17, 2022
Figure 1 for One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations
Figure 2 for One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations
Figure 3 for One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations
Figure 4 for One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations
Viaarxiv icon

AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition

Add code
May 26, 2022
Figure 1 for AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition
Figure 2 for AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition
Figure 3 for AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition
Figure 4 for AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition
Viaarxiv icon

Self-supervised Learning of Adversarial Example: Towards Good Generalizations for Deepfake Detection

Add code
Apr 01, 2022
Figure 1 for Self-supervised Learning of Adversarial Example: Towards Good Generalizations for Deepfake Detection
Figure 2 for Self-supervised Learning of Adversarial Example: Towards Good Generalizations for Deepfake Detection
Figure 3 for Self-supervised Learning of Adversarial Example: Towards Good Generalizations for Deepfake Detection
Figure 4 for Self-supervised Learning of Adversarial Example: Towards Good Generalizations for Deepfake Detection
Viaarxiv icon

VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Add code
Mar 23, 2022
Figure 1 for VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Figure 2 for VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Figure 3 for VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Figure 4 for VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Viaarxiv icon

DynaMixer: A Vision MLP Architecture with Dynamic Mixing

Add code
Feb 16, 2022
Figure 1 for DynaMixer: A Vision MLP Architecture with Dynamic Mixing
Figure 2 for DynaMixer: A Vision MLP Architecture with Dynamic Mixing
Figure 3 for DynaMixer: A Vision MLP Architecture with Dynamic Mixing
Figure 4 for DynaMixer: A Vision MLP Architecture with Dynamic Mixing
Viaarxiv icon

Not All Patches are What You Need: Expediting Vision Transformers via Token Reorganizations

Add code
Feb 16, 2022
Figure 1 for Not All Patches are What You Need: Expediting Vision Transformers via Token Reorganizations
Figure 2 for Not All Patches are What You Need: Expediting Vision Transformers via Token Reorganizations
Figure 3 for Not All Patches are What You Need: Expediting Vision Transformers via Token Reorganizations
Figure 4 for Not All Patches are What You Need: Expediting Vision Transformers via Token Reorganizations
Viaarxiv icon

MetaDance: Few-shot Dancing Video Retargeting via Temporal-aware Meta-learning

Add code
Jan 13, 2022
Figure 1 for MetaDance: Few-shot Dancing Video Retargeting via Temporal-aware Meta-learning
Figure 2 for MetaDance: Few-shot Dancing Video Retargeting via Temporal-aware Meta-learning
Figure 3 for MetaDance: Few-shot Dancing Video Retargeting via Temporal-aware Meta-learning
Figure 4 for MetaDance: Few-shot Dancing Video Retargeting via Temporal-aware Meta-learning
Viaarxiv icon

Revitalizing CNN Attentions via Transformers in Self-Supervised Visual Representation Learning

Add code
Oct 11, 2021
Figure 1 for Revitalizing CNN Attentions via Transformers in Self-Supervised Visual Representation Learning
Figure 2 for Revitalizing CNN Attentions via Transformers in Self-Supervised Visual Representation Learning
Figure 3 for Revitalizing CNN Attentions via Transformers in Self-Supervised Visual Representation Learning
Figure 4 for Revitalizing CNN Attentions via Transformers in Self-Supervised Visual Representation Learning
Viaarxiv icon

PD-GAN: Probabilistic Diverse GAN for Image Inpainting

Add code
May 05, 2021
Figure 1 for PD-GAN: Probabilistic Diverse GAN for Image Inpainting
Figure 2 for PD-GAN: Probabilistic Diverse GAN for Image Inpainting
Figure 3 for PD-GAN: Probabilistic Diverse GAN for Image Inpainting
Figure 4 for PD-GAN: Probabilistic Diverse GAN for Image Inpainting
Viaarxiv icon