Alert button
Picture for Gengyuan Zhang

Gengyuan Zhang

Alert button

SPOT! Revisiting Video-Language Models for Event Understanding

Add code
Bookmark button
Alert button
Dec 01, 2023
Gengyuan Zhang, Jinhe Bi, Jindong Gu, Yanyu Chen, Volker Tresp

Viaarxiv icon

Multi-event Video-Text Retrieval

Add code
Bookmark button
Alert button
Aug 22, 2023
Gengyuan Zhang, Jisen Ren, Jindong Gu, Volker Tresp

Figure 1 for Multi-event Video-Text Retrieval
Figure 2 for Multi-event Video-Text Retrieval
Figure 3 for Multi-event Video-Text Retrieval
Figure 4 for Multi-event Video-Text Retrieval
Viaarxiv icon

A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models

Add code
Bookmark button
Alert button
Jul 24, 2023
Jindong Gu, Zhen Han, Shuo Chen, Ahmad Beirami, Bailan He, Gengyuan Zhang, Ruotong Liao, Yao Qin, Volker Tresp, Philip Torr

Viaarxiv icon

Can Vision-Language Models be a Good Guesser? Exploring VLMs for Times and Location Reasoning

Add code
Bookmark button
Alert button
Jul 12, 2023
Gengyuan Zhang, Yurui Zhang, Kerui Zhang, Volker Tresp

Figure 1 for Can Vision-Language Models be a Good Guesser? Exploring VLMs for Times and Location Reasoning
Figure 2 for Can Vision-Language Models be a Good Guesser? Exploring VLMs for Times and Location Reasoning
Figure 3 for Can Vision-Language Models be a Good Guesser? Exploring VLMs for Times and Location Reasoning
Figure 4 for Can Vision-Language Models be a Good Guesser? Exploring VLMs for Times and Location Reasoning
Viaarxiv icon

CL-CrossVQA: A Continual Learning Benchmark for Cross-Domain Visual Question Answering

Add code
Bookmark button
Alert button
Nov 19, 2022
Yao Zhang, Haokun Chen, Ahmed Frikha, Yezi Yang, Denis Krompass, Gengyuan Zhang, Jindong Gu, Volker Tresp

Figure 1 for CL-CrossVQA: A Continual Learning Benchmark for Cross-Domain Visual Question Answering
Figure 2 for CL-CrossVQA: A Continual Learning Benchmark for Cross-Domain Visual Question Answering
Figure 3 for CL-CrossVQA: A Continual Learning Benchmark for Cross-Domain Visual Question Answering
Figure 4 for CL-CrossVQA: A Continual Learning Benchmark for Cross-Domain Visual Question Answering
Viaarxiv icon