Alert button
Picture for Wenhao Wu

Wenhao Wu

Alert button

DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM

Mar 19, 2024
Yixuan Wu, Yizhou Wang, Shixiang Tang, Wenhao Wu, Tong He, Wanli Ouyang, Jian Wu, Philip Torr

Viaarxiv icon

MetaSplit: Meta-Split Network for Limited-Stock Product Recommendation

Mar 16, 2024
Wenhao Wu, Jialiang Zhou, Ailong He, Shuguang Han, Jufeng Chen, Bo Zheng

Viaarxiv icon

GPT4Ego: Unleashing the Potential of Pre-trained Models for Zero-Shot Egocentric Action Recognition

Jan 18, 2024
Guangzhao Dai, Xiangbo Shu, Wenhao Wu

Viaarxiv icon

Deep Structure and Attention Aware Subspace Clustering

Dec 25, 2023
Wenhao Wu, Weiwei Wang, Shengjiang Kong

Viaarxiv icon

Side4Video: Spatial-Temporal Side Network for Memory-Efficient Image-to-Video Transfer Learning

Nov 27, 2023
Huanjin Yao, Wenhao Wu, Zhiheng Li

Figure 1 for Side4Video: Spatial-Temporal Side Network for Memory-Efficient Image-to-Video Transfer Learning
Figure 2 for Side4Video: Spatial-Temporal Side Network for Memory-Efficient Image-to-Video Transfer Learning
Figure 3 for Side4Video: Spatial-Temporal Side Network for Memory-Efficient Image-to-Video Transfer Learning
Figure 4 for Side4Video: Spatial-Temporal Side Network for Memory-Efficient Image-to-Video Transfer Learning
Viaarxiv icon

GPT4Vis: What Can GPT-4 Do for Zero-shot Visual Recognition?

Nov 27, 2023
Wenhao Wu, Huanjin Yao, Mengxi Zhang, Yuxin Song, Wanli Ouyang, Jingdong Wang

Figure 1 for GPT4Vis: What Can GPT-4 Do for Zero-shot Visual Recognition?
Figure 2 for GPT4Vis: What Can GPT-4 Do for Zero-shot Visual Recognition?
Figure 3 for GPT4Vis: What Can GPT-4 Do for Zero-shot Visual Recognition?
Figure 4 for GPT4Vis: What Can GPT-4 Do for Zero-shot Visual Recognition?
Viaarxiv icon

PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training

Sep 19, 2023
Dawei Zhu, Nan Yang, Liang Wang, Yifan Song, Wenhao Wu, Furu Wei, Sujian Li

Figure 1 for PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training
Figure 2 for PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training
Figure 3 for PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training
Figure 4 for PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training
Viaarxiv icon

What Can Simple Arithmetic Operations Do for Temporal Modeling?

Jul 18, 2023
Wenhao Wu, Yuxin Song, Zhun Sun, Jingdong Wang, Chang Xu, Wanli Ouyang

Figure 1 for What Can Simple Arithmetic Operations Do for Temporal Modeling?
Figure 2 for What Can Simple Arithmetic Operations Do for Temporal Modeling?
Figure 3 for What Can Simple Arithmetic Operations Do for Temporal Modeling?
Figure 4 for What Can Simple Arithmetic Operations Do for Temporal Modeling?
Viaarxiv icon

Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?

Dec 31, 2022
Wenhao Wu, Haipeng Luo, Bo Fang, Jingdong Wang, Wanli Ouyang

Figure 1 for Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?
Figure 2 for Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?
Figure 3 for Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?
Figure 4 for Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?
Viaarxiv icon