Alert button
Picture for Teng Wang

Teng Wang

Alert button

UniAV: Unified Audio-Visual Perception for Multi-Task Video Localization

Add code
Bookmark button
Alert button
Apr 04, 2024
Tiantian Geng, Teng Wang, Yanfu Zhang, Jinming Duan, Weili Guan, Feng Zheng

Viaarxiv icon

Video Understanding with Large Language Models: A Survey

Add code
Bookmark button
Alert button
Jan 04, 2024
Yunlong Tang, Jing Bi, Siting Xu, Luchuan Song, Susan Liang, Teng Wang, Daoan Zhang, Jie An, Jingyang Lin, Rongyi Zhu, Ali Vosoughi, Chao Huang, Zeliang Zhang, Feng Zheng, Jianguo Zhang, Ping Luo, Jiebo Luo, Chenliang Xu

Viaarxiv icon

DGMem: Learning Visual Navigation Policy without Any Labels by Dynamic Graph Memory

Add code
Bookmark button
Alert button
Nov 30, 2023
Wenzhe Cai, Teng Wang, Guangran Cheng, Lele Xu, Changyin Sun

Viaarxiv icon

Knowledge-Aware Prompt Tuning for Generalizable Vision-Language Models

Add code
Bookmark button
Alert button
Aug 22, 2023
Baoshuo Kan, Teng Wang, Wenpeng Lu, Xiantong Zhen, Weili Guan, Feng Zheng

Figure 1 for Knowledge-Aware Prompt Tuning for Generalizable Vision-Language Models
Figure 2 for Knowledge-Aware Prompt Tuning for Generalizable Vision-Language Models
Figure 3 for Knowledge-Aware Prompt Tuning for Generalizable Vision-Language Models
Figure 4 for Knowledge-Aware Prompt Tuning for Generalizable Vision-Language Models
Viaarxiv icon

Transferable Decoding with Visual Entities for Zero-Shot Image Captioning

Add code
Bookmark button
Alert button
Jul 31, 2023
Junjie Fei, Teng Wang, Jinrui Zhang, Zhenyu He, Chengjie Wang, Feng Zheng

Figure 1 for Transferable Decoding with Visual Entities for Zero-Shot Image Captioning
Figure 2 for Transferable Decoding with Visual Entities for Zero-Shot Image Captioning
Figure 3 for Transferable Decoding with Visual Entities for Zero-Shot Image Captioning
Figure 4 for Transferable Decoding with Visual Entities for Zero-Shot Image Captioning
Viaarxiv icon

Set-level Guidance Attack: Boosting Adversarial Transferability of Vision-Language Pre-training Models

Add code
Bookmark button
Alert button
Jul 26, 2023
Dong Lu, Zhiqiang Wang, Teng Wang, Weili Guan, Hongchang Gao, Feng Zheng

Figure 1 for Set-level Guidance Attack: Boosting Adversarial Transferability of Vision-Language Pre-training Models
Figure 2 for Set-level Guidance Attack: Boosting Adversarial Transferability of Vision-Language Pre-training Models
Figure 3 for Set-level Guidance Attack: Boosting Adversarial Transferability of Vision-Language Pre-training Models
Figure 4 for Set-level Guidance Attack: Boosting Adversarial Transferability of Vision-Language Pre-training Models
Viaarxiv icon

PTVD: A Large-Scale Plot-Oriented Multimodal Dataset Based on Television Dramas

Add code
Bookmark button
Alert button
Jun 26, 2023
Chen Li, Xutan Peng, Teng Wang, Yixiao Ge, Mengyang Liu, Xuyuan Xu, Yexin Wang, Ying Shan

Figure 1 for PTVD: A Large-Scale Plot-Oriented Multimodal Dataset Based on Television Dramas
Figure 2 for PTVD: A Large-Scale Plot-Oriented Multimodal Dataset Based on Television Dramas
Figure 3 for PTVD: A Large-Scale Plot-Oriented Multimodal Dataset Based on Television Dramas
Figure 4 for PTVD: A Large-Scale Plot-Oriented Multimodal Dataset Based on Television Dramas
Viaarxiv icon

LLMVA-GEBC: Large Language Model with Video Adapter for Generic Event Boundary Captioning

Add code
Bookmark button
Alert button
Jun 17, 2023
Yunlong Tang, Jinrui Zhang, Xiangchen Wang, Teng Wang, Feng Zheng

Figure 1 for LLMVA-GEBC: Large Language Model with Video Adapter for Generic Event Boundary Captioning
Figure 2 for LLMVA-GEBC: Large Language Model with Video Adapter for Generic Event Boundary Captioning
Figure 3 for LLMVA-GEBC: Large Language Model with Video Adapter for Generic Event Boundary Captioning
Figure 4 for LLMVA-GEBC: Large Language Model with Video Adapter for Generic Event Boundary Captioning
Viaarxiv icon

Caption Anything: Interactive Image Description with Diverse Multimodal Controls

Add code
Bookmark button
Alert button
May 08, 2023
Teng Wang, Jinrui Zhang, Junjie Fei, Hao Zheng, Yunlong Tang, Zhe Li, Mingqi Gao, Shanshan Zhao

Figure 1 for Caption Anything: Interactive Image Description with Diverse Multimodal Controls
Figure 2 for Caption Anything: Interactive Image Description with Diverse Multimodal Controls
Figure 3 for Caption Anything: Interactive Image Description with Diverse Multimodal Controls
Figure 4 for Caption Anything: Interactive Image Description with Diverse Multimodal Controls
Viaarxiv icon