Alert button
Picture for Hao Tang

Hao Tang

Alert button

Peeling the Onion: Hierarchical Reduction of Data Redundancy for Efficient Vision Transformer Training

Add code
Bookmark button
Alert button
Nov 19, 2022
Zhenglun Kong, Haoyu Ma, Geng Yuan, Mengshu Sun, Yanyue Xie, Peiyan Dong, Xin Meng, Xuan Shen, Hao Tang, Minghai Qin, Tianlong Chen, Xiaolong Ma, Xiaohui Xie, Zhangyang Wang, Yanzhi Wang

Figure 1 for Peeling the Onion: Hierarchical Reduction of Data Redundancy for Efficient Vision Transformer Training
Figure 2 for Peeling the Onion: Hierarchical Reduction of Data Redundancy for Efficient Vision Transformer Training
Figure 3 for Peeling the Onion: Hierarchical Reduction of Data Redundancy for Efficient Vision Transformer Training
Figure 4 for Peeling the Onion: Hierarchical Reduction of Data Redundancy for Efficient Vision Transformer Training
Viaarxiv icon

Compressing Transformer-based self-supervised models for speech processing

Add code
Bookmark button
Alert button
Nov 17, 2022
Tzu-Quan Lin, Tsung-Huan Yang, Chun-Yao Chang, Kuang-Ming Chen, Tzu-hsun Feng, Hung-yi Lee, Hao Tang

Figure 1 for Compressing Transformer-based self-supervised models for speech processing
Figure 2 for Compressing Transformer-based self-supervised models for speech processing
Figure 3 for Compressing Transformer-based self-supervised models for speech processing
Figure 4 for Compressing Transformer-based self-supervised models for speech processing
Viaarxiv icon

MelHuBERT: A simplified HuBERT on Mel spectrogram

Add code
Bookmark button
Alert button
Nov 17, 2022
Tzu-Quan Lin, Hung-yi Lee, Hao Tang

Figure 1 for MelHuBERT: A simplified HuBERT on Mel spectrogram
Figure 2 for MelHuBERT: A simplified HuBERT on Mel spectrogram
Figure 3 for MelHuBERT: A simplified HuBERT on Mel spectrogram
Figure 4 for MelHuBERT: A simplified HuBERT on Mel spectrogram
Viaarxiv icon

Deep Unsupervised Key Frame Extraction for Efficient Video Classification

Add code
Bookmark button
Alert button
Nov 12, 2022
Hao Tang, Lei Ding, Songsong Wu, Bin Ren, Nicu Sebe, Paolo Rota

Figure 1 for Deep Unsupervised Key Frame Extraction for Efficient Video Classification
Figure 2 for Deep Unsupervised Key Frame Extraction for Efficient Video Classification
Figure 3 for Deep Unsupervised Key Frame Extraction for Efficient Video Classification
Figure 4 for Deep Unsupervised Key Frame Extraction for Efficient Video Classification
Viaarxiv icon

Bipartite Graph Reasoning GANs for Person Pose and Facial Image Synthesis

Add code
Bookmark button
Alert button
Nov 12, 2022
Hao Tang, Ling Shao, Philip H. S. Torr, Nicu Sebe

Figure 1 for Bipartite Graph Reasoning GANs for Person Pose and Facial Image Synthesis
Figure 2 for Bipartite Graph Reasoning GANs for Person Pose and Facial Image Synthesis
Figure 3 for Bipartite Graph Reasoning GANs for Person Pose and Facial Image Synthesis
Figure 4 for Bipartite Graph Reasoning GANs for Person Pose and Facial Image Synthesis
Viaarxiv icon

The Lottery Ticket Hypothesis for Vision Transformers

Add code
Bookmark button
Alert button
Nov 02, 2022
Xuan Shen, Zhenglun Kong, Minghai Qin, Peiyan Dong, Geng Yuan, Xin Meng, Hao Tang, Xiaolong Ma, Yanzhi Wang

Figure 1 for The Lottery Ticket Hypothesis for Vision Transformers
Figure 2 for The Lottery Ticket Hypothesis for Vision Transformers
Figure 3 for The Lottery Ticket Hypothesis for Vision Transformers
Figure 4 for The Lottery Ticket Hypothesis for Vision Transformers
Viaarxiv icon

Learning Dependencies of Discrete Speech Representations with Neural Hidden Markov Models

Add code
Bookmark button
Alert button
Oct 29, 2022
Sung-Lin Yeh, Hao Tang

Figure 1 for Learning Dependencies of Discrete Speech Representations with Neural Hidden Markov Models
Figure 2 for Learning Dependencies of Discrete Speech Representations with Neural Hidden Markov Models
Figure 3 for Learning Dependencies of Discrete Speech Representations with Neural Hidden Markov Models
Figure 4 for Learning Dependencies of Discrete Speech Representations with Neural Hidden Markov Models
Viaarxiv icon

Analyzing Acoustic Word Embeddings from Pre-trained Self-supervised Speech Models

Add code
Bookmark button
Alert button
Oct 28, 2022
Ramon Sanabria, Hao Tang, Sharon Goldwater

Figure 1 for Analyzing Acoustic Word Embeddings from Pre-trained Self-supervised Speech Models
Figure 2 for Analyzing Acoustic Word Embeddings from Pre-trained Self-supervised Speech Models
Figure 3 for Analyzing Acoustic Word Embeddings from Pre-trained Self-supervised Speech Models
Figure 4 for Analyzing Acoustic Word Embeddings from Pre-trained Self-supervised Speech Models
Viaarxiv icon

Conditioning and Sampling in Variational Diffusion Models for Speech Super-resolution

Add code
Bookmark button
Alert button
Oct 27, 2022
Chin-Yun Yu, Sung-Lin Yeh, György Fazekas, Hao Tang

Figure 1 for Conditioning and Sampling in Variational Diffusion Models for Speech Super-resolution
Figure 2 for Conditioning and Sampling in Variational Diffusion Models for Speech Super-resolution
Figure 3 for Conditioning and Sampling in Variational Diffusion Models for Speech Super-resolution
Figure 4 for Conditioning and Sampling in Variational Diffusion Models for Speech Super-resolution
Viaarxiv icon

On Compressing Sequences for Self-Supervised Speech Models

Add code
Bookmark button
Alert button
Oct 14, 2022
Yen Meng, Hsuan-Jui Chen, Jiatong Shi, Shinji Watanabe, Paola Garcia, Hung-yi Lee, Hao Tang

Figure 1 for On Compressing Sequences for Self-Supervised Speech Models
Figure 2 for On Compressing Sequences for Self-Supervised Speech Models
Figure 3 for On Compressing Sequences for Self-Supervised Speech Models
Figure 4 for On Compressing Sequences for Self-Supervised Speech Models
Viaarxiv icon