Alert button
Picture for Tengda Han

Tengda Han

Alert button

Stale Diffusion: Hyper-realistic 5D Movie Generation Using Old-school Methods

Add code
Bookmark button
Alert button
Apr 01, 2024
Joao F. Henriques, Dylan Campbell, Tengda Han

Viaarxiv icon

A Strong Baseline for Temporal Video-Text Alignment

Add code
Bookmark button
Alert button
Dec 21, 2023
Zeqian Li, Qirui Chen, Tengda Han, Ya Zhang, Yanfeng Wang, Weidi Xie

Viaarxiv icon

AutoAD II: The Sequel -- Who, When, and What in Movie Audio Description

Add code
Bookmark button
Alert button
Oct 10, 2023
Tengda Han, Max Bain, Arsha Nagrani, Gül Varol, Weidi Xie, Andrew Zisserman

Figure 1 for AutoAD II: The Sequel -- Who, When, and What in Movie Audio Description
Figure 2 for AutoAD II: The Sequel -- Who, When, and What in Movie Audio Description
Figure 3 for AutoAD II: The Sequel -- Who, When, and What in Movie Audio Description
Figure 4 for AutoAD II: The Sequel -- Who, When, and What in Movie Audio Description
Viaarxiv icon

Semantic Counting from Self-Collages

Add code
Bookmark button
Alert button
Jul 17, 2023
Lukas Knobel, Tengda Han, Yuki M. Asano

Figure 1 for Semantic Counting from Self-Collages
Figure 2 for Semantic Counting from Self-Collages
Figure 3 for Semantic Counting from Self-Collages
Figure 4 for Semantic Counting from Self-Collages
Viaarxiv icon

Open-world Text-specified Object Counting

Add code
Bookmark button
Alert button
Jun 02, 2023
Niki Amini-Naieni, Kiana Amini-Naieni, Tengda Han, Andrew Zisserman

Figure 1 for Open-world Text-specified Object Counting
Figure 2 for Open-world Text-specified Object Counting
Figure 3 for Open-world Text-specified Object Counting
Figure 4 for Open-world Text-specified Object Counting
Viaarxiv icon

AutoAD: Movie Description in Context

Add code
Bookmark button
Alert button
Mar 29, 2023
Tengda Han, Max Bain, Arsha Nagrani, Gül Varol, Weidi Xie, Andrew Zisserman

Figure 1 for AutoAD: Movie Description in Context
Figure 2 for AutoAD: Movie Description in Context
Figure 3 for AutoAD: Movie Description in Context
Figure 4 for AutoAD: Movie Description in Context
Viaarxiv icon

WhisperX: Time-Accurate Speech Transcription of Long-Form Audio

Add code
Bookmark button
Alert button
Mar 01, 2023
Max Bain, Jaesung Huh, Tengda Han, Andrew Zisserman

Figure 1 for WhisperX: Time-Accurate Speech Transcription of Long-Form Audio
Figure 2 for WhisperX: Time-Accurate Speech Transcription of Long-Form Audio
Figure 3 for WhisperX: Time-Accurate Speech Transcription of Long-Form Audio
Figure 4 for WhisperX: Time-Accurate Speech Transcription of Long-Form Audio
Viaarxiv icon

Prompt Generation Networks for Efficient Adaptation of Frozen Vision Transformers

Add code
Bookmark button
Alert button
Oct 12, 2022
Jochem Loedeman, Maarten C. Stol, Tengda Han, Yuki M. Asano

Figure 1 for Prompt Generation Networks for Efficient Adaptation of Frozen Vision Transformers
Figure 2 for Prompt Generation Networks for Efficient Adaptation of Frozen Vision Transformers
Figure 3 for Prompt Generation Networks for Efficient Adaptation of Frozen Vision Transformers
Figure 4 for Prompt Generation Networks for Efficient Adaptation of Frozen Vision Transformers
Viaarxiv icon

Turbo Training with Token Dropout

Add code
Bookmark button
Alert button
Oct 10, 2022
Tengda Han, Weidi Xie, Andrew Zisserman

Figure 1 for Turbo Training with Token Dropout
Figure 2 for Turbo Training with Token Dropout
Figure 3 for Turbo Training with Token Dropout
Figure 4 for Turbo Training with Token Dropout
Viaarxiv icon

Flamingo: a Visual Language Model for Few-Shot Learning

Add code
Bookmark button
Alert button
Apr 29, 2022
Jean-Baptiste Alayrac, Jeff Donahue, Pauline Luc, Antoine Miech, Iain Barr, Yana Hasson, Karel Lenc, Arthur Mensch, Katie Millican, Malcolm Reynolds, Roman Ring, Eliza Rutherford, Serkan Cabi, Tengda Han, Zhitao Gong, Sina Samangooei, Marianne Monteiro, Jacob Menick, Sebastian Borgeaud, Andrew Brock, Aida Nematzadeh, Sahand Sharifzadeh, Mikolaj Binkowski, Ricardo Barreira, Oriol Vinyals, Andrew Zisserman, Karen Simonyan

Figure 1 for Flamingo: a Visual Language Model for Few-Shot Learning
Figure 2 for Flamingo: a Visual Language Model for Few-Shot Learning
Figure 3 for Flamingo: a Visual Language Model for Few-Shot Learning
Figure 4 for Flamingo: a Visual Language Model for Few-Shot Learning
Viaarxiv icon