Alert button
Picture for Wenze Hu

Wenze Hu

Alert button

Guiding Instruction-based Image Editing via Multimodal Large Language Models

Sep 29, 2023
Tsu-Jui Fu, Wenze Hu, Xianzhi Du, William Yang Wang, Yinfei Yang, Zhe Gan

Figure 1 for Guiding Instruction-based Image Editing via Multimodal Large Language Models
Figure 2 for Guiding Instruction-based Image Editing via Multimodal Large Language Models
Figure 3 for Guiding Instruction-based Image Editing via Multimodal Large Language Models
Figure 4 for Guiding Instruction-based Image Editing via Multimodal Large Language Models
Viaarxiv icon

Million-scale Object Detection with Large Vision Model

Dec 19, 2022
Feng Lin, Wenze Hu, Yaowei Wang, Yonghong Tian, Guangming Lu, Fanglin Chen, Yong Xu, Xiaoyu Wang

Figure 1 for Million-scale Object Detection with Large Vision Model
Figure 2 for Million-scale Object Detection with Large Vision Model
Figure 3 for Million-scale Object Detection with Large Vision Model
Figure 4 for Million-scale Object Detection with Large Vision Model
Viaarxiv icon

NAR-Former: Neural Architecture Representation Learning towards Holistic Attributes Prediction

Nov 15, 2022
Yun Yi, Haokui Zhang, Wenze Hu, Nannan Wang, Xiaoyu Wang

Figure 1 for NAR-Former: Neural Architecture Representation Learning towards Holistic Attributes Prediction
Figure 2 for NAR-Former: Neural Architecture Representation Learning towards Holistic Attributes Prediction
Figure 3 for NAR-Former: Neural Architecture Representation Learning towards Holistic Attributes Prediction
Figure 4 for NAR-Former: Neural Architecture Representation Learning towards Holistic Attributes Prediction
Viaarxiv icon

CabViT: Cross Attention among Blocks for Vision Transformer

Nov 14, 2022
Haokui Zhang, Wenze Hu, Xiaoyu Wang

Figure 1 for CabViT: Cross Attention among Blocks for Vision Transformer
Figure 2 for CabViT: Cross Attention among Blocks for Vision Transformer
Figure 3 for CabViT: Cross Attention among Blocks for Vision Transformer
Figure 4 for CabViT: Cross Attention among Blocks for Vision Transformer
Viaarxiv icon

ParCNetV2: Oversized Kernel with Enhanced Attention

Nov 14, 2022
Ruihan Xu, Haokui Zhang, Wenze Hu, Shiliang Zhang, Xiaoyu Wang

Figure 1 for ParCNetV2: Oversized Kernel with Enhanced Attention
Figure 2 for ParCNetV2: Oversized Kernel with Enhanced Attention
Figure 3 for ParCNetV2: Oversized Kernel with Enhanced Attention
Figure 4 for ParCNetV2: Oversized Kernel with Enhanced Attention
Viaarxiv icon

Fast-ParC: Position Aware Global Kernel for ConvNets and ViTs

Oct 08, 2022
Tao Yang, Haokui Zhang, Wenze Hu, Changwen Chen, Xiaoyu Wang

Figure 1 for Fast-ParC: Position Aware Global Kernel for ConvNets and ViTs
Figure 2 for Fast-ParC: Position Aware Global Kernel for ConvNets and ViTs
Figure 3 for Fast-ParC: Position Aware Global Kernel for ConvNets and ViTs
Figure 4 for Fast-ParC: Position Aware Global Kernel for ConvNets and ViTs
Viaarxiv icon

ALBench: A Framework for Evaluating Active Learning in Object Detection

Aug 10, 2022
Zhanpeng Feng, Shiliang Zhang, Rinyoichi Takezoe, Wenze Hu, Manmohan Chandraker, Li-Jia Li, Vijay K. Narayanan, Xiaoyu Wang

Figure 1 for ALBench: A Framework for Evaluating Active Learning in Object Detection
Figure 2 for ALBench: A Framework for Evaluating Active Learning in Object Detection
Figure 3 for ALBench: A Framework for Evaluating Active Learning in Object Detection
Figure 4 for ALBench: A Framework for Evaluating Active Learning in Object Detection
Viaarxiv icon

Implementation of an Automated Learning System for Non-experts

Mar 26, 2022
Phoenix X. Huang, Zhiwei Zhao, Chao Liu, Jingyi Liu, Wenze Hu, Xiaoyu Wang

Figure 1 for Implementation of an Automated Learning System for Non-experts
Figure 2 for Implementation of an Automated Learning System for Non-experts
Figure 3 for Implementation of an Automated Learning System for Non-experts
Figure 4 for Implementation of an Automated Learning System for Non-experts
Viaarxiv icon

EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers

Mar 15, 2022
Haokui Zhang, Wenze Hu, Xiaoyu Wang

Figure 1 for EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers
Figure 2 for EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers
Figure 3 for EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers
Figure 4 for EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers
Viaarxiv icon