Alert button
Picture for Florian Schroff

Florian Schroff

Alert button

VideoPrism: A Foundational Visual Encoder for Video Understanding

Add code
Bookmark button
Alert button
Feb 20, 2024
Long Zhao, Nitesh B. Gundavarapu, Liangzhe Yuan, Hao Zhou, Shen Yan, Jennifer J. Sun, Luke Friedman, Rui Qian, Tobias Weyand, Yue Zhao, Rachel Hornung, Florian Schroff, Ming-Hsuan Yang, David A. Ross, Huisheng Wang, Hartwig Adam, Mikhail Sirotenko, Ting Liu, Boqing Gong

Viaarxiv icon

Distilling Vision-Language Models on Millions of Videos

Add code
Bookmark button
Alert button
Jan 11, 2024
Yue Zhao, Long Zhao, Xingyi Zhou, Jialin Wu, Chun-Te Chu, Hui Miao, Florian Schroff, Hartwig Adam, Ting Liu, Boqing Gong, Philipp Krähenbühl, Liangzhe Yuan

Viaarxiv icon

VideoGLUE: Video General Understanding Evaluation of Foundation Models

Add code
Bookmark button
Alert button
Jul 06, 2023
Liangzhe Yuan, Nitesh Bharadwaj Gundavarapu, Long Zhao, Hao Zhou, Yin Cui, Lu Jiang, Xuan Yang, Menglin Jia, Tobias Weyand, Luke Friedman, Mikhail Sirotenko, Huisheng Wang, Florian Schroff, Hartwig Adam, Ming-Hsuan Yang, Ting Liu, Boqing Gong

Figure 1 for VideoGLUE: Video General Understanding Evaluation of Foundation Models
Figure 2 for VideoGLUE: Video General Understanding Evaluation of Foundation Models
Figure 3 for VideoGLUE: Video General Understanding Evaluation of Foundation Models
Figure 4 for VideoGLUE: Video General Understanding Evaluation of Foundation Models
Viaarxiv icon

Spatiotemporally Discriminative Video-Language Pre-Training with Text Grounding

Add code
Bookmark button
Alert button
Mar 28, 2023
Yuanhao Xiong, Long Zhao, Boqing Gong, Ming-Hsuan Yang, Florian Schroff, Ting Liu, Cho-Jui Hsieh, Liangzhe Yuan

Figure 1 for Spatiotemporally Discriminative Video-Language Pre-Training with Text Grounding
Figure 2 for Spatiotemporally Discriminative Video-Language Pre-Training with Text Grounding
Figure 3 for Spatiotemporally Discriminative Video-Language Pre-Training with Text Grounding
Figure 4 for Spatiotemporally Discriminative Video-Language Pre-Training with Text Grounding
Viaarxiv icon

Unified Visual Relationship Detection with Vision and Language Models

Add code
Bookmark button
Alert button
Mar 16, 2023
Long Zhao, Liangzhe Yuan, Boqing Gong, Yin Cui, Florian Schroff, Ming-Hsuan Yang, Hartwig Adam, Ting Liu

Figure 1 for Unified Visual Relationship Detection with Vision and Language Models
Figure 2 for Unified Visual Relationship Detection with Vision and Language Models
Figure 3 for Unified Visual Relationship Detection with Vision and Language Models
Figure 4 for Unified Visual Relationship Detection with Vision and Language Models
Viaarxiv icon

Learning to Generate Image Embeddings with User-level Differential Privacy

Add code
Bookmark button
Alert button
Nov 20, 2022
Zheng Xu, Maxwell Collins, Yuxiao Wang, Liviu Panait, Sewoong Oh, Sean Augenstein, Ting Liu, Florian Schroff, H. Brendan McMahan

Figure 1 for Learning to Generate Image Embeddings with User-level Differential Privacy
Figure 2 for Learning to Generate Image Embeddings with User-level Differential Privacy
Figure 3 for Learning to Generate Image Embeddings with User-level Differential Privacy
Figure 4 for Learning to Generate Image Embeddings with User-level Differential Privacy
Viaarxiv icon

Contextualized Spatio-Temporal Contrastive Learning with Self-Supervision

Add code
Bookmark button
Alert button
Dec 09, 2021
Liangzhe Yuan, Rui Qian, Yin Cui, Boqing Gong, Florian Schroff, Ming-Hsuan Yang, Hartwig Adam, Ting Liu

Figure 1 for Contextualized Spatio-Temporal Contrastive Learning with Self-Supervision
Figure 2 for Contextualized Spatio-Temporal Contrastive Learning with Self-Supervision
Figure 3 for Contextualized Spatio-Temporal Contrastive Learning with Self-Supervision
Figure 4 for Contextualized Spatio-Temporal Contrastive Learning with Self-Supervision
Viaarxiv icon

DeepLab2: A TensorFlow Library for Deep Labeling

Add code
Bookmark button
Alert button
Jun 17, 2021
Mark Weber, Huiyu Wang, Siyuan Qiao, Jun Xie, Maxwell D. Collins, Yukun Zhu, Liangzhe Yuan, Dahun Kim, Qihang Yu, Daniel Cremers, Laura Leal-Taixe, Alan L. Yuille, Florian Schroff, Hartwig Adam, Liang-Chieh Chen

Figure 1 for DeepLab2: A TensorFlow Library for Deep Labeling
Figure 2 for DeepLab2: A TensorFlow Library for Deep Labeling
Figure 3 for DeepLab2: A TensorFlow Library for Deep Labeling
Viaarxiv icon

Learning View-Disentangled Human Pose Representation by Contrastive Cross-View Mutual Information Maximization

Add code
Bookmark button
Alert button
Dec 02, 2020
Long Zhao, Yuxiao Wang, Jiaping Zhao, Liangzhe Yuan, Jennifer J. Sun, Florian Schroff, Hartwig Adam, Xi Peng, Dimitris Metaxas, Ting Liu

Figure 1 for Learning View-Disentangled Human Pose Representation by Contrastive Cross-View Mutual Information Maximization
Figure 2 for Learning View-Disentangled Human Pose Representation by Contrastive Cross-View Mutual Information Maximization
Figure 3 for Learning View-Disentangled Human Pose Representation by Contrastive Cross-View Mutual Information Maximization
Figure 4 for Learning View-Disentangled Human Pose Representation by Contrastive Cross-View Mutual Information Maximization
Viaarxiv icon

View-Invariant, Occlusion-Robust Probabilistic Embedding for Human Pose

Add code
Bookmark button
Alert button
Oct 23, 2020
Ting Liu, Jennifer J. Sun, Long Zhao, Jiaping Zhao, Liangzhe Yuan, Yuxiao Wang, Liang-Chieh Chen, Florian Schroff, Hartwig Adam

Figure 1 for View-Invariant, Occlusion-Robust Probabilistic Embedding for Human Pose
Figure 2 for View-Invariant, Occlusion-Robust Probabilistic Embedding for Human Pose
Figure 3 for View-Invariant, Occlusion-Robust Probabilistic Embedding for Human Pose
Figure 4 for View-Invariant, Occlusion-Robust Probabilistic Embedding for Human Pose
Viaarxiv icon