Alert button
Picture for Mikhail Sirotenko

Mikhail Sirotenko

Alert button

VideoPrism: A Foundational Visual Encoder for Video Understanding

Add code
Bookmark button
Alert button
Feb 20, 2024
Long Zhao, Nitesh B. Gundavarapu, Liangzhe Yuan, Hao Zhou, Shen Yan, Jennifer J. Sun, Luke Friedman, Rui Qian, Tobias Weyand, Yue Zhao, Rachel Hornung, Florian Schroff, Ming-Hsuan Yang, David A. Ross, Huisheng Wang, Hartwig Adam, Mikhail Sirotenko, Ting Liu, Boqing Gong

Viaarxiv icon

VideoPoet: A Large Language Model for Zero-Shot Video Generation

Add code
Bookmark button
Alert button
Dec 21, 2023
Dan Kondratyuk, Lijun Yu, Xiuye Gu, José Lezama, Jonathan Huang, Rachel Hornung, Hartwig Adam, Hassan Akbari, Yair Alon, Vighnesh Birodkar, Yong Cheng, Ming-Chang Chiu, Josh Dillon, Irfan Essa, Agrim Gupta, Meera Hahn, Anja Hauth, David Hendon, Alonso Martinez, David Minnen, David Ross, Grant Schindler, Mikhail Sirotenko, Kihyuk Sohn, Krishna Somandepalli, Huisheng Wang, Jimmy Yan, Ming-Hsuan Yang, Xuan Yang, Bryan Seybold, Lu Jiang

Viaarxiv icon

PolyMaX: General Dense Prediction with Mask Transformer

Add code
Bookmark button
Alert button
Nov 09, 2023
Xuan Yang, Liangzhe Yuan, Kimberly Wilber, Astuti Sharma, Xiuye Gu, Siyuan Qiao, Stephanie Debats, Huisheng Wang, Hartwig Adam, Mikhail Sirotenko, Liang-Chieh Chen

Viaarxiv icon

SANPO: A Scene Understanding, Accessibility, Navigation, Pathfinding, Obstacle Avoidance Dataset

Add code
Bookmark button
Alert button
Sep 21, 2023
Sagar M. Waghmare, Kimberly Wilber, Dave Hawkey, Xuan Yang, Matthew Wilson, Stephanie Debats, Cattalyya Nuengsigkapian, Astuti Sharma, Lars Pandikow, Huisheng Wang, Hartwig Adam, Mikhail Sirotenko

Figure 1 for SANPO: A Scene Understanding, Accessibility, Navigation, Pathfinding, Obstacle Avoidance Dataset
Figure 2 for SANPO: A Scene Understanding, Accessibility, Navigation, Pathfinding, Obstacle Avoidance Dataset
Figure 3 for SANPO: A Scene Understanding, Accessibility, Navigation, Pathfinding, Obstacle Avoidance Dataset
Figure 4 for SANPO: A Scene Understanding, Accessibility, Navigation, Pathfinding, Obstacle Avoidance Dataset
Viaarxiv icon

VideoGLUE: Video General Understanding Evaluation of Foundation Models

Add code
Bookmark button
Alert button
Jul 06, 2023
Liangzhe Yuan, Nitesh Bharadwaj Gundavarapu, Long Zhao, Hao Zhou, Yin Cui, Lu Jiang, Xuan Yang, Menglin Jia, Tobias Weyand, Luke Friedman, Mikhail Sirotenko, Huisheng Wang, Florian Schroff, Hartwig Adam, Ming-Hsuan Yang, Ting Liu, Boqing Gong

Figure 1 for VideoGLUE: Video General Understanding Evaluation of Foundation Models
Figure 2 for VideoGLUE: Video General Understanding Evaluation of Foundation Models
Figure 3 for VideoGLUE: Video General Understanding Evaluation of Foundation Models
Figure 4 for VideoGLUE: Video General Understanding Evaluation of Foundation Models
Viaarxiv icon

Efficient Image Representation Learning with Federated Sampled Softmax

Add code
Bookmark button
Alert button
Mar 09, 2022
Sagar M. Waghmare, Hang Qi, Huizhong Chen, Mikhail Sirotenko, Tomer Meron

Figure 1 for Efficient Image Representation Learning with Federated Sampled Softmax
Figure 2 for Efficient Image Representation Learning with Federated Sampled Softmax
Figure 3 for Efficient Image Representation Learning with Federated Sampled Softmax
Figure 4 for Efficient Image Representation Learning with Federated Sampled Softmax
Viaarxiv icon

Fashionpedia: Ontology, Segmentation, and an Attribute Localization Dataset

Add code
Bookmark button
Alert button
Apr 26, 2020
Menglin Jia, Mengyun Shi, Mikhail Sirotenko, Yin Cui, Claire Cardie, Bharath Hariharan, Hartwig Adam, Serge Belongie

Figure 1 for Fashionpedia: Ontology, Segmentation, and an Attribute Localization Dataset
Figure 2 for Fashionpedia: Ontology, Segmentation, and an Attribute Localization Dataset
Figure 3 for Fashionpedia: Ontology, Segmentation, and an Attribute Localization Dataset
Figure 4 for Fashionpedia: Ontology, Segmentation, and an Attribute Localization Dataset
Viaarxiv icon