Alert button
Picture for Alexander H. Liu

Alexander H. Liu

Alert button

Towards audio language modeling -- an overview

Add code
Bookmark button
Alert button
Feb 20, 2024
Haibin Wu, Xuanjun Chen, Yi-Cheng Lin, Kai-wei Chang, Ho-Lam Chung, Alexander H. Liu, Hung-yi Lee

Viaarxiv icon

Codec-SUPERB: An In-Depth Analysis of Sound Codec Models

Add code
Bookmark button
Alert button
Feb 20, 2024
Haibin Wu, Ho-Lam Chung, Yi-Cheng Lin, Yuan-Kuei Wu, Xuanjun Chen, Yu-Chi Pai, Hsiu-Hsuan Wang, Kai-Wei Chang, Alexander H. Liu, Hung-yi Lee

Viaarxiv icon

Revisiting Self-supervised Learning of Speech Representation from a Mutual Information Perspective

Add code
Bookmark button
Alert button
Jan 16, 2024
Alexander H. Liu, Sung-Lin Yeh, James Glass

Viaarxiv icon

Generative Pre-training for Speech with Flow Matching

Add code
Bookmark button
Alert button
Oct 25, 2023
Alexander H. Liu, Matt Le, Apoorv Vyas, Bowen Shi, Andros Tjandra, Wei-Ning Hsu

Viaarxiv icon

Joint Audio and Speech Understanding

Add code
Bookmark button
Alert button
Oct 02, 2023
Yuan Gong, Alexander H. Liu, Hongyin Luo, Leonid Karlinsky, James Glass

Viaarxiv icon

Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering

Add code
Bookmark button
Alert button
May 18, 2023
Heng-Jui Chang, Alexander H. Liu, James Glass

Figure 1 for Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering
Figure 2 for Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering
Figure 3 for Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering
Figure 4 for Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering
Viaarxiv icon

Listen, Think, and Understand

Add code
Bookmark button
Alert button
May 18, 2023
Yuan Gong, Hongyin Luo, Alexander H. Liu, Leonid Karlinsky, James Glass

Figure 1 for Listen, Think, and Understand
Figure 2 for Listen, Think, and Understand
Figure 3 for Listen, Think, and Understand
Figure 4 for Listen, Think, and Understand
Viaarxiv icon

DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning

Add code
Bookmark button
Alert button
May 17, 2023
Alexander H. Liu, Heng-Jui Chang, Michael Auli, Wei-Ning Hsu, James R. Glass

Figure 1 for DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning
Figure 2 for DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning
Figure 3 for DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning
Figure 4 for DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning
Viaarxiv icon

UAVM: A Unified Model for Audio-Visual Learning

Add code
Bookmark button
Alert button
Jul 29, 2022
Yuan Gong, Alexander H. Liu, Andrew Rouditchenko, James Glass

Figure 1 for UAVM: A Unified Model for Audio-Visual Learning
Figure 2 for UAVM: A Unified Model for Audio-Visual Learning
Figure 3 for UAVM: A Unified Model for Audio-Visual Learning
Figure 4 for UAVM: A Unified Model for Audio-Visual Learning
Viaarxiv icon

Simple and Effective Unsupervised Speech Synthesis

Add code
Bookmark button
Alert button
Apr 20, 2022
Alexander H. Liu, Cheng-I Jeff Lai, Wei-Ning Hsu, Michael Auli, Alexei Baevski, James Glass

Figure 1 for Simple and Effective Unsupervised Speech Synthesis
Figure 2 for Simple and Effective Unsupervised Speech Synthesis
Figure 3 for Simple and Effective Unsupervised Speech Synthesis
Figure 4 for Simple and Effective Unsupervised Speech Synthesis
Viaarxiv icon