Alert button
Picture for Bryan Seybold

Bryan Seybold

Alert button

VideoPoet: A Large Language Model for Zero-Shot Video Generation

Add code
Bookmark button
Alert button
Dec 21, 2023
Dan Kondratyuk, Lijun Yu, Xiuye Gu, José Lezama, Jonathan Huang, Rachel Hornung, Hartwig Adam, Hassan Akbari, Yair Alon, Vighnesh Birodkar, Yong Cheng, Ming-Chang Chiu, Josh Dillon, Irfan Essa, Agrim Gupta, Meera Hahn, Anja Hauth, David Hendon, Alonso Martinez, David Minnen, David Ross, Grant Schindler, Mikhail Sirotenko, Kihyuk Sohn, Krishna Somandepalli, Huisheng Wang, Jimmy Yan, Ming-Hsuan Yang, Xuan Yang, Bryan Seybold, Lu Jiang

Viaarxiv icon

Open-Vocabulary Temporal Action Detection with Off-the-Shelf Image-Text Features

Add code
Bookmark button
Alert button
Dec 20, 2022
Vivek Rathod, Bryan Seybold, Sudheendra Vijayanarasimhan, Austin Myers, Xiuye Gu, Vighnesh Birodkar, David A. Ross

Figure 1 for Open-Vocabulary Temporal Action Detection with Off-the-Shelf Image-Text Features
Figure 2 for Open-Vocabulary Temporal Action Detection with Off-the-Shelf Image-Text Features
Figure 3 for Open-Vocabulary Temporal Action Detection with Off-the-Shelf Image-Text Features
Figure 4 for Open-Vocabulary Temporal Action Detection with Off-the-Shelf Image-Text Features
Viaarxiv icon

What's in a Caption? Dataset-Specific Linguistic Diversity and Its Effect on Visual Description Models and Metrics

Add code
Bookmark button
Alert button
May 12, 2022
David M. Chan, Austin Myers, Sudheendra Vijayanarasimhan, David A. Ross, Bryan Seybold, John F. Canny

Figure 1 for What's in a Caption? Dataset-Specific Linguistic Diversity and Its Effect on Visual Description Models and Metrics
Figure 2 for What's in a Caption? Dataset-Specific Linguistic Diversity and Its Effect on Visual Description Models and Metrics
Figure 3 for What's in a Caption? Dataset-Specific Linguistic Diversity and Its Effect on Visual Description Models and Metrics
Figure 4 for What's in a Caption? Dataset-Specific Linguistic Diversity and Its Effect on Visual Description Models and Metrics
Viaarxiv icon

Learning Audio-Video Modalities from Image Captions

Add code
Bookmark button
Alert button
Apr 01, 2022
Arsha Nagrani, Paul Hongsuck Seo, Bryan Seybold, Anja Hauth, Santiago Manen, Chen Sun, Cordelia Schmid

Figure 1 for Learning Audio-Video Modalities from Image Captions
Figure 2 for Learning Audio-Video Modalities from Image Captions
Figure 3 for Learning Audio-Video Modalities from Image Captions
Figure 4 for Learning Audio-Video Modalities from Image Captions
Viaarxiv icon

Optical Mouse: 3D Mouse Pose From Single-View Video

Add code
Bookmark button
Alert button
Jun 17, 2021
Bo Hu, Bryan Seybold, Shan Yang, David Ross, Avneesh Sud, Graham Ruby, Yi Liu

Figure 1 for Optical Mouse: 3D Mouse Pose From Single-View Video
Figure 2 for Optical Mouse: 3D Mouse Pose From Single-View Video
Figure 3 for Optical Mouse: 3D Mouse Pose From Single-View Video
Figure 4 for Optical Mouse: 3D Mouse Pose From Single-View Video
Viaarxiv icon

Dueling Decoders: Regularizing Variational Autoencoder Latent Spaces

Add code
Bookmark button
Alert button
May 17, 2019
Bryan Seybold, Emily Fertig, Alex Alemi, Ian Fischer

Figure 1 for Dueling Decoders: Regularizing Variational Autoencoder Latent Spaces
Figure 2 for Dueling Decoders: Regularizing Variational Autoencoder Latent Spaces
Figure 3 for Dueling Decoders: Regularizing Variational Autoencoder Latent Spaces
Figure 4 for Dueling Decoders: Regularizing Variational Autoencoder Latent Spaces
Viaarxiv icon

Rethinking the Faster R-CNN Architecture for Temporal Action Localization

Add code
Bookmark button
Alert button
Apr 20, 2018
Yu-Wei Chao, Sudheendra Vijayanarasimhan, Bryan Seybold, David A. Ross, Jia Deng, Rahul Sukthankar

Figure 1 for Rethinking the Faster R-CNN Architecture for Temporal Action Localization
Figure 2 for Rethinking the Faster R-CNN Architecture for Temporal Action Localization
Figure 3 for Rethinking the Faster R-CNN Architecture for Temporal Action Localization
Figure 4 for Rethinking the Faster R-CNN Architecture for Temporal Action Localization
Viaarxiv icon

Instance Embedding Transfer to Unsupervised Video Object Segmentation

Add code
Bookmark button
Alert button
Feb 27, 2018
Siyang Li, Bryan Seybold, Alexey Vorobyov, Alireza Fathi, Qin Huang, C. -C. Jay Kuo

Figure 1 for Instance Embedding Transfer to Unsupervised Video Object Segmentation
Figure 2 for Instance Embedding Transfer to Unsupervised Video Object Segmentation
Figure 3 for Instance Embedding Transfer to Unsupervised Video Object Segmentation
Figure 4 for Instance Embedding Transfer to Unsupervised Video Object Segmentation
Viaarxiv icon

CNN Architectures for Large-Scale Audio Classification

Add code
Bookmark button
Alert button
Jan 10, 2017
Shawn Hershey, Sourish Chaudhuri, Daniel P. W. Ellis, Jort F. Gemmeke, Aren Jansen, R. Channing Moore, Manoj Plakal, Devin Platt, Rif A. Saurous, Bryan Seybold, Malcolm Slaney, Ron J. Weiss, Kevin Wilson

Figure 1 for CNN Architectures for Large-Scale Audio Classification
Figure 2 for CNN Architectures for Large-Scale Audio Classification
Figure 3 for CNN Architectures for Large-Scale Audio Classification
Figure 4 for CNN Architectures for Large-Scale Audio Classification
Viaarxiv icon