Alert button
Picture for Po-Yao Huang

Po-Yao Huang

Alert button

MAViL: Masked Audio-Video Learners

Add code
Bookmark button
Alert button
Dec 15, 2022
Po-Yao Huang, Vasu Sharma, Hu Xu, Chaitanya Ryali, Haoqi Fan, Yanghao Li, Shang-Wen Li, Gargi Ghosh, Jitendra Malik, Christoph Feichtenhofer

Figure 1 for MAViL: Masked Audio-Video Learners
Figure 2 for MAViL: Masked Audio-Video Learners
Figure 3 for MAViL: Masked Audio-Video Learners
Figure 4 for MAViL: Masked Audio-Video Learners
Viaarxiv icon

AudioTagging Done Right: 2nd comparison of deep learning methods for environmental sound classification

Add code
Bookmark button
Alert button
Apr 03, 2022
Juncheng B Li, Shuhui Qu, Po-Yao Huang, Florian Metze

Figure 1 for AudioTagging Done Right: 2nd comparison of deep learning methods for environmental sound classification
Figure 2 for AudioTagging Done Right: 2nd comparison of deep learning methods for environmental sound classification
Figure 3 for AudioTagging Done Right: 2nd comparison of deep learning methods for environmental sound classification
Figure 4 for AudioTagging Done Right: 2nd comparison of deep learning methods for environmental sound classification
Viaarxiv icon

VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding

Add code
Bookmark button
Alert button
Oct 01, 2021
Hu Xu, Gargi Ghosh, Po-Yao Huang, Dmytro Okhonko, Armen Aghajanyan, Florian Metze, Luke Zettlemoyer, Christoph Feichtenhofer

Figure 1 for VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding
Figure 2 for VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding
Figure 3 for VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding
Figure 4 for VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding
Viaarxiv icon

VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding

Add code
Bookmark button
Alert button
May 20, 2021
Hu Xu, Gargi Ghosh, Po-Yao Huang, Prahal Arora, Masoumeh Aminzadeh, Christoph Feichtenhofer, Florian Metze, Luke Zettlemoyer

Figure 1 for VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding
Figure 2 for VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding
Figure 3 for VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding
Figure 4 for VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding
Viaarxiv icon

Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models

Add code
Bookmark button
Alert button
Apr 15, 2021
Po-Yao Huang, Mandela Patrick, Junjie Hu, Graham Neubig, Florian Metze, Alexander Hauptmann

Figure 1 for Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models
Figure 2 for Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models
Figure 3 for Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models
Figure 4 for Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models
Viaarxiv icon

Audio-Visual Event Recognition through the lens of Adversary

Add code
Bookmark button
Alert button
Nov 15, 2020
Juncheng B Li, Kaixin Ma, Shuhui Qu, Po-Yao Huang, Florian Metze

Figure 1 for Audio-Visual Event Recognition through the lens of Adversary
Figure 2 for Audio-Visual Event Recognition through the lens of Adversary
Figure 3 for Audio-Visual Event Recognition through the lens of Adversary
Figure 4 for Audio-Visual Event Recognition through the lens of Adversary
Viaarxiv icon

Support-set bottlenecks for video-text representation learning

Add code
Bookmark button
Alert button
Oct 06, 2020
Mandela Patrick, Po-Yao Huang, Yuki Asano, Florian Metze, Alexander Hauptmann, João Henriques, Andrea Vedaldi

Figure 1 for Support-set bottlenecks for video-text representation learning
Figure 2 for Support-set bottlenecks for video-text representation learning
Figure 3 for Support-set bottlenecks for video-text representation learning
Figure 4 for Support-set bottlenecks for video-text representation learning
Viaarxiv icon

A Survey of Deep Active Learning

Add code
Bookmark button
Alert button
Aug 30, 2020
Pengzhen Ren, Yun Xiao, Xiaojun Chang, Po-Yao Huang, Zhihui Li, Xiaojiang Chen, Xin Wang

Figure 1 for A Survey of Deep Active Learning
Figure 2 for A Survey of Deep Active Learning
Figure 3 for A Survey of Deep Active Learning
Figure 4 for A Survey of Deep Active Learning
Viaarxiv icon

A Comprehensive Survey of Neural Architecture Search: Challenges and Solutions

Add code
Bookmark button
Alert button
Jun 01, 2020
Pengzhen Ren, Yun Xiao, Xiaojun Chang, Po-Yao Huang, Zhihui Li, Xiaojiang Chen, Xin Wang

Figure 1 for A Comprehensive Survey of Neural Architecture Search: Challenges and Solutions
Figure 2 for A Comprehensive Survey of Neural Architecture Search: Challenges and Solutions
Figure 3 for A Comprehensive Survey of Neural Architecture Search: Challenges and Solutions
Figure 4 for A Comprehensive Survey of Neural Architecture Search: Challenges and Solutions
Viaarxiv icon