Alert button
Picture for Yibing Song

Yibing Song

Alert button

DiffusionDet: Diffusion Model for Object Detection

Add code
Bookmark button
Alert button
Nov 17, 2022
Shoufa Chen, Peize Sun, Yibing Song, Ping Luo

Figure 1 for DiffusionDet: Diffusion Model for Object Detection
Figure 2 for DiffusionDet: Diffusion Model for Object Detection
Figure 3 for DiffusionDet: Diffusion Model for Object Detection
Figure 4 for DiffusionDet: Diffusion Model for Object Detection
Viaarxiv icon

One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations

Add code
Bookmark button
Alert button
Oct 17, 2022
Yiming Zhu, Hongyu Liu, Yibing Song, ziyang Yuan, Xintong Han, Chun Yuan, Qifeng Chen, Jue Wang

Figure 1 for One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations
Figure 2 for One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations
Figure 3 for One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations
Figure 4 for One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations
Viaarxiv icon

AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition

Add code
Bookmark button
Alert button
May 26, 2022
Shoufa Chen, Chongjian Ge, Zhan Tong, Jiangliu Wang, Yibing Song, Jue Wang, Ping Luo

Figure 1 for AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition
Figure 2 for AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition
Figure 3 for AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition
Figure 4 for AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition
Viaarxiv icon

Self-supervised Learning of Adversarial Example: Towards Good Generalizations for Deepfake Detection

Add code
Bookmark button
Alert button
Apr 01, 2022
Liang Chen, Yong Zhang, Yibing Song, Lingqiao Liu, Jue Wang

Figure 1 for Self-supervised Learning of Adversarial Example: Towards Good Generalizations for Deepfake Detection
Figure 2 for Self-supervised Learning of Adversarial Example: Towards Good Generalizations for Deepfake Detection
Figure 3 for Self-supervised Learning of Adversarial Example: Towards Good Generalizations for Deepfake Detection
Figure 4 for Self-supervised Learning of Adversarial Example: Towards Good Generalizations for Deepfake Detection
Viaarxiv icon

VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Add code
Bookmark button
Alert button
Mar 23, 2022
Zhan Tong, Yibing Song, Jue Wang, Limin Wang

Figure 1 for VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Figure 2 for VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Figure 3 for VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Figure 4 for VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Viaarxiv icon

DynaMixer: A Vision MLP Architecture with Dynamic Mixing

Add code
Bookmark button
Alert button
Feb 16, 2022
Ziyu Wang, Wenhao Jiang, Yiming Zhu, Li Yuan, Yibing Song, Wei Liu

Figure 1 for DynaMixer: A Vision MLP Architecture with Dynamic Mixing
Figure 2 for DynaMixer: A Vision MLP Architecture with Dynamic Mixing
Figure 3 for DynaMixer: A Vision MLP Architecture with Dynamic Mixing
Figure 4 for DynaMixer: A Vision MLP Architecture with Dynamic Mixing
Viaarxiv icon

Not All Patches are What You Need: Expediting Vision Transformers via Token Reorganizations

Add code
Bookmark button
Alert button
Feb 16, 2022
Youwei Liang, Chongjian Ge, Zhan Tong, Yibing Song, Jue Wang, Pengtao Xie

Figure 1 for Not All Patches are What You Need: Expediting Vision Transformers via Token Reorganizations
Figure 2 for Not All Patches are What You Need: Expediting Vision Transformers via Token Reorganizations
Figure 3 for Not All Patches are What You Need: Expediting Vision Transformers via Token Reorganizations
Figure 4 for Not All Patches are What You Need: Expediting Vision Transformers via Token Reorganizations
Viaarxiv icon

MetaDance: Few-shot Dancing Video Retargeting via Temporal-aware Meta-learning

Add code
Bookmark button
Alert button
Jan 13, 2022
Yuying Ge, Yibing Song, Ruimao Zhang, Ping Luo

Figure 1 for MetaDance: Few-shot Dancing Video Retargeting via Temporal-aware Meta-learning
Figure 2 for MetaDance: Few-shot Dancing Video Retargeting via Temporal-aware Meta-learning
Figure 3 for MetaDance: Few-shot Dancing Video Retargeting via Temporal-aware Meta-learning
Figure 4 for MetaDance: Few-shot Dancing Video Retargeting via Temporal-aware Meta-learning
Viaarxiv icon

Revitalizing CNN Attentions via Transformers in Self-Supervised Visual Representation Learning

Add code
Bookmark button
Alert button
Oct 11, 2021
Chongjian Ge, Youwei Liang, Yibing Song, Jianbo Jiao, Jue Wang, Ping Luo

Figure 1 for Revitalizing CNN Attentions via Transformers in Self-Supervised Visual Representation Learning
Figure 2 for Revitalizing CNN Attentions via Transformers in Self-Supervised Visual Representation Learning
Figure 3 for Revitalizing CNN Attentions via Transformers in Self-Supervised Visual Representation Learning
Figure 4 for Revitalizing CNN Attentions via Transformers in Self-Supervised Visual Representation Learning
Viaarxiv icon