Alert button
Picture for Long Zhao

Long Zhao

Alert button

VideoPrism: A Foundational Visual Encoder for Video Understanding

Add code
Bookmark button
Alert button
Feb 20, 2024
Long Zhao, Nitesh B. Gundavarapu, Liangzhe Yuan, Hao Zhou, Shen Yan, Jennifer J. Sun, Luke Friedman, Rui Qian, Tobias Weyand, Yue Zhao, Rachel Hornung, Florian Schroff, Ming-Hsuan Yang, David A. Ross, Huisheng Wang, Hartwig Adam, Mikhail Sirotenko, Ting Liu, Boqing Gong

Viaarxiv icon

Distilling Vision-Language Models on Millions of Videos

Add code
Bookmark button
Alert button
Jan 11, 2024
Yue Zhao, Long Zhao, Xingyi Zhou, Jialin Wu, Chun-Te Chu, Hui Miao, Florian Schroff, Hartwig Adam, Ting Liu, Boqing Gong, Philipp Krähenbühl, Liangzhe Yuan

Viaarxiv icon

Generating Enhanced Negatives for Training Language-Based Object Detectors

Add code
Bookmark button
Alert button
Dec 29, 2023
Shiyu Zhao, Long Zhao, Vijay Kumar B. G, Yumin Suh, Dimitris N. Metaxas, Manmohan Chandraker, Samuel Schulter

Viaarxiv icon

Deep Deformable Models: Learning 3D Shape Abstractions with Part Consistency

Add code
Bookmark button
Alert button
Sep 02, 2023
Di Liu, Long Zhao, Qilong Zhangli, Yunhe Gao, Ting Liu, Dimitris N. Metaxas

Figure 1 for Deep Deformable Models: Learning 3D Shape Abstractions with Part Consistency
Figure 2 for Deep Deformable Models: Learning 3D Shape Abstractions with Part Consistency
Figure 3 for Deep Deformable Models: Learning 3D Shape Abstractions with Part Consistency
Figure 4 for Deep Deformable Models: Learning 3D Shape Abstractions with Part Consistency
Viaarxiv icon

Learning from Semantic Alignment between Unpaired Multiviews for Egocentric Video Recognition

Add code
Bookmark button
Alert button
Aug 23, 2023
Qitong Wang, Long Zhao, Liangzhe Yuan, Ting Liu, Xi Peng

Figure 1 for Learning from Semantic Alignment between Unpaired Multiviews for Egocentric Video Recognition
Figure 2 for Learning from Semantic Alignment between Unpaired Multiviews for Egocentric Video Recognition
Figure 3 for Learning from Semantic Alignment between Unpaired Multiviews for Egocentric Video Recognition
Figure 4 for Learning from Semantic Alignment between Unpaired Multiviews for Egocentric Video Recognition
Viaarxiv icon

Improving Pseudo Labels for Open-Vocabulary Object Detection

Add code
Bookmark button
Alert button
Aug 11, 2023
Shiyu Zhao, Samuel Schulter, Long Zhao, Zhixing Zhang, Vijay Kumar B. G, Yumin Suh, Manmohan Chandraker, Dimitris N. Metaxas

Figure 1 for Improving Pseudo Labels for Open-Vocabulary Object Detection
Figure 2 for Improving Pseudo Labels for Open-Vocabulary Object Detection
Figure 3 for Improving Pseudo Labels for Open-Vocabulary Object Detection
Figure 4 for Improving Pseudo Labels for Open-Vocabulary Object Detection
Viaarxiv icon

VideoGLUE: Video General Understanding Evaluation of Foundation Models

Add code
Bookmark button
Alert button
Jul 06, 2023
Liangzhe Yuan, Nitesh Bharadwaj Gundavarapu, Long Zhao, Hao Zhou, Yin Cui, Lu Jiang, Xuan Yang, Menglin Jia, Tobias Weyand, Luke Friedman, Mikhail Sirotenko, Huisheng Wang, Florian Schroff, Hartwig Adam, Ming-Hsuan Yang, Ting Liu, Boqing Gong

Figure 1 for VideoGLUE: Video General Understanding Evaluation of Foundation Models
Figure 2 for VideoGLUE: Video General Understanding Evaluation of Foundation Models
Figure 3 for VideoGLUE: Video General Understanding Evaluation of Foundation Models
Figure 4 for VideoGLUE: Video General Understanding Evaluation of Foundation Models
Viaarxiv icon

Spatiotemporally Discriminative Video-Language Pre-Training with Text Grounding

Add code
Bookmark button
Alert button
Mar 28, 2023
Yuanhao Xiong, Long Zhao, Boqing Gong, Ming-Hsuan Yang, Florian Schroff, Ting Liu, Cho-Jui Hsieh, Liangzhe Yuan

Figure 1 for Spatiotemporally Discriminative Video-Language Pre-Training with Text Grounding
Figure 2 for Spatiotemporally Discriminative Video-Language Pre-Training with Text Grounding
Figure 3 for Spatiotemporally Discriminative Video-Language Pre-Training with Text Grounding
Figure 4 for Spatiotemporally Discriminative Video-Language Pre-Training with Text Grounding
Viaarxiv icon

Steering Prototype with Prompt-tuning for Rehearsal-free Continual Learning

Add code
Bookmark button
Alert button
Mar 16, 2023
Zhuowei Li, Long Zhao, Zizhao Zhang, Han Zhang, Di Liu, Ting Liu, Dimitris N. Metaxas

Figure 1 for Steering Prototype with Prompt-tuning for Rehearsal-free Continual Learning
Figure 2 for Steering Prototype with Prompt-tuning for Rehearsal-free Continual Learning
Figure 3 for Steering Prototype with Prompt-tuning for Rehearsal-free Continual Learning
Figure 4 for Steering Prototype with Prompt-tuning for Rehearsal-free Continual Learning
Viaarxiv icon

Unified Visual Relationship Detection with Vision and Language Models

Add code
Bookmark button
Alert button
Mar 16, 2023
Long Zhao, Liangzhe Yuan, Boqing Gong, Yin Cui, Florian Schroff, Ming-Hsuan Yang, Hartwig Adam, Ting Liu

Figure 1 for Unified Visual Relationship Detection with Vision and Language Models
Figure 2 for Unified Visual Relationship Detection with Vision and Language Models
Figure 3 for Unified Visual Relationship Detection with Vision and Language Models
Figure 4 for Unified Visual Relationship Detection with Vision and Language Models
Viaarxiv icon