Alert button
Picture for Han Hu

Han Hu

Alert button

ResFormer: Scaling ViTs with Multi-Resolution Training

Add code
Bookmark button
Alert button
Dec 01, 2022
Rui Tian, Zuxuan Wu, Qi Dai, Han Hu, Yu Qiao, Yu-Gang Jiang

Figure 1 for ResFormer: Scaling ViTs with Multi-Resolution Training
Figure 2 for ResFormer: Scaling ViTs with Multi-Resolution Training
Figure 3 for ResFormer: Scaling ViTs with Multi-Resolution Training
Figure 4 for ResFormer: Scaling ViTs with Multi-Resolution Training
Viaarxiv icon

SVFormer: Semi-supervised Video Transformer for Action Recognition

Add code
Bookmark button
Alert button
Nov 23, 2022
Zhen Xing, Qi Dai, Han Hu, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang

Figure 1 for SVFormer: Semi-supervised Video Transformer for Action Recognition
Figure 2 for SVFormer: Semi-supervised Video Transformer for Action Recognition
Figure 3 for SVFormer: Semi-supervised Video Transformer for Action Recognition
Figure 4 for SVFormer: Semi-supervised Video Transformer for Action Recognition
Viaarxiv icon

Exploring Discrete Diffusion Models for Image Captioning

Add code
Bookmark button
Alert button
Nov 21, 2022
Zixin Zhu, Yixuan Wei, Jianfeng Wang, Zhe Gan, Zheng Zhang, Le Wang, Gang Hua, Lijuan Wang, Zicheng Liu, Han Hu

Figure 1 for Exploring Discrete Diffusion Models for Image Captioning
Figure 2 for Exploring Discrete Diffusion Models for Image Captioning
Figure 3 for Exploring Discrete Diffusion Models for Image Captioning
Figure 4 for Exploring Discrete Diffusion Models for Image Captioning
Viaarxiv icon

ClipCrop: Conditioned Cropping Driven by Vision-Language Model

Add code
Bookmark button
Alert button
Nov 21, 2022
Zhihang Zhong, Mingxi Cheng, Zhirong Wu, Yuhui Yuan, Yinqiang Zheng, Ji Li, Han Hu, Stephen Lin, Yoichi Sato, Imari Sato

Figure 1 for ClipCrop: Conditioned Cropping Driven by Vision-Language Model
Figure 2 for ClipCrop: Conditioned Cropping Driven by Vision-Language Model
Figure 3 for ClipCrop: Conditioned Cropping Driven by Vision-Language Model
Figure 4 for ClipCrop: Conditioned Cropping Driven by Vision-Language Model
Viaarxiv icon

Could Giant Pretrained Image Models Extract Universal Representations?

Add code
Bookmark button
Alert button
Nov 03, 2022
Yutong Lin, Ze Liu, Zheng Zhang, Han Hu, Nanning Zheng, Stephen Lin, Yue Cao

Figure 1 for Could Giant Pretrained Image Models Extract Universal Representations?
Figure 2 for Could Giant Pretrained Image Models Extract Universal Representations?
Figure 3 for Could Giant Pretrained Image Models Extract Universal Representations?
Figure 4 for Could Giant Pretrained Image Models Extract Universal Representations?
Viaarxiv icon

Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning

Add code
Bookmark button
Alert button
Oct 03, 2022
Weicong Liang, Yuhui Yuan, Henghui Ding, Xiao Luo, Weihong Lin, Ding Jia, Zheng Zhang, Chao Zhang, Han Hu

Figure 1 for Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning
Figure 2 for Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning
Figure 3 for Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning
Figure 4 for Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning
Viaarxiv icon

One-to-Many Semantic Communication Systems: Design, Implementation, Performance Evaluation

Add code
Bookmark button
Alert button
Sep 20, 2022
Han Hu, Xingwu Zhu, Fuhui Zhou, Wei Wu, Rose Qingyang Hu, Hongbo Zhu

Figure 1 for One-to-Many Semantic Communication Systems: Design, Implementation, Performance Evaluation
Figure 2 for One-to-Many Semantic Communication Systems: Design, Implementation, Performance Evaluation
Figure 3 for One-to-Many Semantic Communication Systems: Design, Implementation, Performance Evaluation
Figure 4 for One-to-Many Semantic Communication Systems: Design, Implementation, Performance Evaluation
Viaarxiv icon

Not All Instances Contribute Equally: Instance-adaptive Class Representation Learning for Few-Shot Visual Recognition

Add code
Bookmark button
Alert button
Sep 07, 2022
Mengya Han, Yibing Zhan, Yong Luo, Bo Du, Han Hu, Yonggang Wen, Dacheng Tao

Figure 1 for Not All Instances Contribute Equally: Instance-adaptive Class Representation Learning for Few-Shot Visual Recognition
Figure 2 for Not All Instances Contribute Equally: Instance-adaptive Class Representation Learning for Few-Shot Visual Recognition
Figure 3 for Not All Instances Contribute Equally: Instance-adaptive Class Representation Learning for Few-Shot Visual Recognition
Figure 4 for Not All Instances Contribute Equally: Instance-adaptive Class Representation Learning for Few-Shot Visual Recognition
Viaarxiv icon

Leveraging GAN Priors for Few-Shot Part Segmentation

Add code
Bookmark button
Alert button
Jul 27, 2022
Mengya Han, Heliang Zheng, Chaoyue Wang, Yong Luo, Han Hu, Bo Du

Figure 1 for Leveraging GAN Priors for Few-Shot Part Segmentation
Figure 2 for Leveraging GAN Priors for Few-Shot Part Segmentation
Figure 3 for Leveraging GAN Priors for Few-Shot Part Segmentation
Figure 4 for Leveraging GAN Priors for Few-Shot Part Segmentation
Viaarxiv icon

DETRs with Hybrid Matching

Add code
Bookmark button
Alert button
Jul 26, 2022
Ding Jia, Yuhui Yuan, Haodi He, Xiaopei Wu, Haojun Yu, Weihong Lin, Lei Sun, Chao Zhang, Han Hu

Figure 1 for DETRs with Hybrid Matching
Figure 2 for DETRs with Hybrid Matching
Figure 3 for DETRs with Hybrid Matching
Figure 4 for DETRs with Hybrid Matching
Viaarxiv icon