Alert button
Picture for Matthew Brown

Matthew Brown

Alert button

Module-wise Adaptive Distillation for Multimodality Foundation Models

Add code
Bookmark button
Alert button
Oct 06, 2023
Chen Liang, Jiahui Yu, Ming-Hsuan Yang, Matthew Brown, Yin Cui, Tuo Zhao, Boqing Gong, Tianyi Zhou

Figure 1 for Module-wise Adaptive Distillation for Multimodality Foundation Models
Figure 2 for Module-wise Adaptive Distillation for Multimodality Foundation Models
Figure 3 for Module-wise Adaptive Distillation for Multimodality Foundation Models
Figure 4 for Module-wise Adaptive Distillation for Multimodality Foundation Models
Viaarxiv icon

Towards a Unified Foundation Model: Jointly Pre-Training Transformers on Unpaired Images and Text

Add code
Bookmark button
Alert button
Dec 14, 2021
Qing Li, Boqing Gong, Yin Cui, Dan Kondratyuk, Xianzhi Du, Ming-Hsuan Yang, Matthew Brown

Figure 1 for Towards a Unified Foundation Model: Jointly Pre-Training Transformers on Unpaired Images and Text
Figure 2 for Towards a Unified Foundation Model: Jointly Pre-Training Transformers on Unpaired Images and Text
Figure 3 for Towards a Unified Foundation Model: Jointly Pre-Training Transformers on Unpaired Images and Text
Figure 4 for Towards a Unified Foundation Model: Jointly Pre-Training Transformers on Unpaired Images and Text
Viaarxiv icon

Exploring Temporal Granularity in Self-Supervised Video Representation Learning

Add code
Bookmark button
Alert button
Dec 08, 2021
Rui Qian, Yeqing Li, Liangzhe Yuan, Boqing Gong, Ting Liu, Matthew Brown, Serge Belongie, Ming-Hsuan Yang, Hartwig Adam, Yin Cui

Figure 1 for Exploring Temporal Granularity in Self-Supervised Video Representation Learning
Figure 2 for Exploring Temporal Granularity in Self-Supervised Video Representation Learning
Figure 3 for Exploring Temporal Granularity in Self-Supervised Video Representation Learning
Figure 4 for Exploring Temporal Granularity in Self-Supervised Video Representation Learning
Viaarxiv icon

2.5D Visual Relationship Detection

Add code
Bookmark button
Alert button
Apr 26, 2021
Yu-Chuan Su, Soravit Changpinyo, Xiangning Chen, Sathish Thoppay, Cho-Jui Hsieh, Lior Shapira, Radu Soricut, Hartwig Adam, Matthew Brown, Ming-Hsuan Yang, Boqing Gong

Figure 1 for 2.5D Visual Relationship Detection
Figure 2 for 2.5D Visual Relationship Detection
Figure 3 for 2.5D Visual Relationship Detection
Figure 4 for 2.5D Visual Relationship Detection
Viaarxiv icon

MoViNets: Mobile Video Networks for Efficient Video Recognition

Add code
Bookmark button
Alert button
Apr 18, 2021
Dan Kondratyuk, Liangzhe Yuan, Yandong Li, Li Zhang, Mingxing Tan, Matthew Brown, Boqing Gong

Figure 1 for MoViNets: Mobile Video Networks for Efficient Video Recognition
Figure 2 for MoViNets: Mobile Video Networks for Efficient Video Recognition
Figure 3 for MoViNets: Mobile Video Networks for Efficient Video Recognition
Figure 4 for MoViNets: Mobile Video Networks for Efficient Video Recognition
Viaarxiv icon

FiG-NeRF: Figure-Ground Neural Radiance Fields for 3D Object Category Modelling

Add code
Bookmark button
Alert button
Apr 17, 2021
Christopher Xie, Keunhong Park, Ricardo Martin-Brualla, Matthew Brown

Figure 1 for FiG-NeRF: Figure-Ground Neural Radiance Fields for 3D Object Category Modelling
Figure 2 for FiG-NeRF: Figure-Ground Neural Radiance Fields for 3D Object Category Modelling
Figure 3 for FiG-NeRF: Figure-Ground Neural Radiance Fields for 3D Object Category Modelling
Figure 4 for FiG-NeRF: Figure-Ground Neural Radiance Fields for 3D Object Category Modelling
Viaarxiv icon

GeLaTO: Generative Latent Textured Objects

Add code
Bookmark button
Alert button
Aug 11, 2020
Ricardo Martin-Brualla, Rohit Pandey, Sofien Bouaziz, Matthew Brown, Dan B Goldman

Figure 1 for GeLaTO: Generative Latent Textured Objects
Figure 2 for GeLaTO: Generative Latent Textured Objects
Figure 3 for GeLaTO: Generative Latent Textured Objects
Figure 4 for GeLaTO: Generative Latent Textured Objects
Viaarxiv icon

When Ensembling Smaller Models is More Efficient than Single Large Models

Add code
Bookmark button
Alert button
May 01, 2020
Dan Kondratyuk, Mingxing Tan, Matthew Brown, Boqing Gong

Figure 1 for When Ensembling Smaller Models is More Efficient than Single Large Models
Figure 2 for When Ensembling Smaller Models is More Efficient than Single Large Models
Figure 3 for When Ensembling Smaller Models is More Efficient than Single Large Models
Viaarxiv icon

Rethinking Class-Balanced Methods for Long-Tailed Visual Recognition from a Domain Adaptation Perspective

Add code
Bookmark button
Alert button
Mar 24, 2020
Muhammad Abdullah Jamal, Matthew Brown, Ming-Hsuan Yang, Liqiang Wang, Boqing Gong

Figure 1 for Rethinking Class-Balanced Methods for Long-Tailed Visual Recognition from a Domain Adaptation Perspective
Figure 2 for Rethinking Class-Balanced Methods for Long-Tailed Visual Recognition from a Domain Adaptation Perspective
Figure 3 for Rethinking Class-Balanced Methods for Long-Tailed Visual Recognition from a Domain Adaptation Perspective
Figure 4 for Rethinking Class-Balanced Methods for Long-Tailed Visual Recognition from a Domain Adaptation Perspective
Viaarxiv icon