Alert button
Picture for Heng-Jui Chang

Heng-Jui Chang

Alert button

SpeechCLIP+: Self-supervised multi-task representation learning for speech via CLIP and speech-image data

Add code
Bookmark button
Alert button
Feb 10, 2024
Hsuan-Fu Wang, Yi-Jen Shih, Heng-Jui Chang, Layne Berry, Puyuan Peng, Hung-yi Lee, Hsin-Min Wang, David Harwath

Viaarxiv icon

R-Spin: Efficient Speaker and Noise-invariant Representation Learning with Acoustic Pieces

Add code
Bookmark button
Alert button
Nov 15, 2023
Heng-Jui Chang, James Glass

Viaarxiv icon

CoLLD: Contrastive Layer-to-layer Distillation for Compressing Multilingual Pre-trained Speech Encoders

Add code
Bookmark button
Alert button
Sep 14, 2023
Heng-Jui Chang, Ning Dong, Ruslan Mavlyutov, Sravya Popuri, Yu-An Chung

Viaarxiv icon

Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering

Add code
Bookmark button
Alert button
May 18, 2023
Heng-Jui Chang, Alexander H. Liu, James Glass

Figure 1 for Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering
Figure 2 for Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering
Figure 3 for Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering
Figure 4 for Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering
Viaarxiv icon

DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning

Add code
Bookmark button
Alert button
May 17, 2023
Alexander H. Liu, Heng-Jui Chang, Michael Auli, Wei-Ning Hsu, James R. Glass

Figure 1 for DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning
Figure 2 for DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning
Figure 3 for DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning
Figure 4 for DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning
Viaarxiv icon

M-SpeechCLIP: Leveraging Large-Scale, Pre-Trained Models for Multilingual Speech to Image Retrieval

Add code
Bookmark button
Alert button
Nov 02, 2022
Layne Berry, Yi-Jen Shih, Hsuan-Fu Wang, Heng-Jui Chang, Hung-yi Lee, David Harwath

Figure 1 for M-SpeechCLIP: Leveraging Large-Scale, Pre-Trained Models for Multilingual Speech to Image Retrieval
Figure 2 for M-SpeechCLIP: Leveraging Large-Scale, Pre-Trained Models for Multilingual Speech to Image Retrieval
Figure 3 for M-SpeechCLIP: Leveraging Large-Scale, Pre-Trained Models for Multilingual Speech to Image Retrieval
Figure 4 for M-SpeechCLIP: Leveraging Large-Scale, Pre-Trained Models for Multilingual Speech to Image Retrieval
Viaarxiv icon

SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model

Add code
Bookmark button
Alert button
Oct 03, 2022
Yi-Jen Shih, Hsuan-Fu Wang, Heng-Jui Chang, Layne Berry, Hung-yi Lee, David Harwath

Figure 1 for SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model
Figure 2 for SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model
Figure 3 for SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model
Figure 4 for SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model
Viaarxiv icon

SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities

Add code
Bookmark button
Alert button
Mar 14, 2022
Hsiang-Sheng Tsai, Heng-Jui Chang, Wen-Chin Huang, Zili Huang, Kushal Lakhotia, Shu-wen Yang, Shuyan Dong, Andy T. Liu, Cheng-I Jeff Lai, Jiatong Shi, Xuankai Chang, Phil Hall, Hsuan-Jui Chen, Shang-Wen Li, Shinji Watanabe, Abdelrahman Mohamed, Hung-yi Lee

Figure 1 for SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities
Figure 2 for SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities
Figure 3 for SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities
Figure 4 for SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities
Viaarxiv icon

Mandarin-English Code-switching Speech Recognition with Self-supervised Speech Representation Models

Add code
Bookmark button
Alert button
Oct 07, 2021
Liang-Hsuan Tseng, Yu-Kuan Fu, Heng-Jui Chang, Hung-yi Lee

Figure 1 for Mandarin-English Code-switching Speech Recognition with Self-supervised Speech Representation Models
Figure 2 for Mandarin-English Code-switching Speech Recognition with Self-supervised Speech Representation Models
Figure 3 for Mandarin-English Code-switching Speech Recognition with Self-supervised Speech Representation Models
Figure 4 for Mandarin-English Code-switching Speech Recognition with Self-supervised Speech Representation Models
Viaarxiv icon

DistilHuBERT: Speech Representation Learning by Layer-wise Distillation of Hidden-unit BERT

Add code
Bookmark button
Alert button
Oct 06, 2021
Heng-Jui Chang, Shu-wen Yang, Hung-yi Lee

Figure 1 for DistilHuBERT: Speech Representation Learning by Layer-wise Distillation of Hidden-unit BERT
Figure 2 for DistilHuBERT: Speech Representation Learning by Layer-wise Distillation of Hidden-unit BERT
Figure 3 for DistilHuBERT: Speech Representation Learning by Layer-wise Distillation of Hidden-unit BERT
Figure 4 for DistilHuBERT: Speech Representation Learning by Layer-wise Distillation of Hidden-unit BERT
Viaarxiv icon