Alert button
Picture for Kwanghee Choi

Kwanghee Choi

Alert button

Wav2Gloss: Generating Interlinear Glossed Text from Speech

Add code
Bookmark button
Alert button
Mar 19, 2024
Taiqi He, Kwanghee Choi, Lindia Tjuatja, Nathaniel R. Robinson, Jiatong Shi, Shinji Watanabe, Graham Neubig, David R. Mortensen, Lori Levin

Figure 1 for Wav2Gloss: Generating Interlinear Glossed Text from Speech
Figure 2 for Wav2Gloss: Generating Interlinear Glossed Text from Speech
Figure 3 for Wav2Gloss: Generating Interlinear Glossed Text from Speech
Figure 4 for Wav2Gloss: Generating Interlinear Glossed Text from Speech
Viaarxiv icon

OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer

Add code
Bookmark button
Alert button
Jan 30, 2024
Yifan Peng, Jinchuan Tian, William Chen, Siddhant Arora, Brian Yan, Yui Sudo, Muhammad Shakeel, Kwanghee Choi, Jiatong Shi, Xuankai Chang, Jee-weon Jung, Shinji Watanabe

Viaarxiv icon

Understanding Probe Behaviors through Variational Bounds of Mutual Information

Add code
Bookmark button
Alert button
Dec 15, 2023
Kwanghee Choi, Jee-weon Jung, Shinji Watanabe

Viaarxiv icon

Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study

Add code
Bookmark button
Alert button
Sep 27, 2023
Xuankai Chang, Brian Yan, Kwanghee Choi, Jeeweon Jung, Yichen Lu, Soumi Maiti, Roshan Sharma, Jiatong Shi, Jinchuan Tian, Shinji Watanabe, Yuya Fujita, Takashi Maekaku, Pengcheng Guo, Yao-Fei Cheng, Pavel Denisov, Kohei Saijo, Hsiu-Hsuan Wang

Figure 1 for Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study
Figure 2 for Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study
Figure 3 for Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study
Figure 4 for Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study
Viaarxiv icon

Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Uncertainty Quantification

Add code
Bookmark button
Alert button
May 28, 2023
Eun Jung Yeo, Kwanghee Choi, Sunhee Kim, Minhwa Chung

Figure 1 for Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Uncertainty Quantification
Figure 2 for Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Uncertainty Quantification
Figure 3 for Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Uncertainty Quantification
Figure 4 for Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Uncertainty Quantification
Viaarxiv icon

Unsupervised Speech Representation Pooling Using Vector Quantization

Add code
Bookmark button
Alert button
Apr 08, 2023
Jeongkyun Park, Kwanghee Choi, Hyunjun Heo, Hyung-Min Park

Figure 1 for Unsupervised Speech Representation Pooling Using Vector Quantization
Figure 2 for Unsupervised Speech Representation Pooling Using Vector Quantization
Figure 3 for Unsupervised Speech Representation Pooling Using Vector Quantization
Figure 4 for Unsupervised Speech Representation Pooling Using Vector Quantization
Viaarxiv icon

OLKAVS: An Open Large-Scale Korean Audio-Visual Speech Dataset

Add code
Bookmark button
Alert button
Jan 16, 2023
Jeongkyun Park, Jung-Wook Hwang, Kwanghee Choi, Seung-Hyun Lee, Jun Hwan Ahn, Rae-Hong Park, Hyung-Min Park

Figure 1 for OLKAVS: An Open Large-Scale Korean Audio-Visual Speech Dataset
Figure 2 for OLKAVS: An Open Large-Scale Korean Audio-Visual Speech Dataset
Figure 3 for OLKAVS: An Open Large-Scale Korean Audio-Visual Speech Dataset
Figure 4 for OLKAVS: An Open Large-Scale Korean Audio-Visual Speech Dataset
Viaarxiv icon

Data-driven Approach for Automatically Correcting Faulty Road Maps

Add code
Bookmark button
Alert button
Nov 12, 2022
Soojung Hong, Kwanghee Choi

Figure 1 for Data-driven Approach for Automatically Correcting Faulty Road Maps
Figure 2 for Data-driven Approach for Automatically Correcting Faulty Road Maps
Figure 3 for Data-driven Approach for Automatically Correcting Faulty Road Maps
Figure 4 for Data-driven Approach for Automatically Correcting Faulty Road Maps
Viaarxiv icon

Automatic Severity Assessment of Dysarthric speech by using Self-supervised Model with Multi-task Learning

Add code
Bookmark button
Alert button
Oct 27, 2022
Eun Jung Yeo, Kwanghee Choi, Sunhee Kim, Minhwa Chung

Figure 1 for Automatic Severity Assessment of Dysarthric speech by using Self-supervised Model with Multi-task Learning
Figure 2 for Automatic Severity Assessment of Dysarthric speech by using Self-supervised Model with Multi-task Learning
Figure 3 for Automatic Severity Assessment of Dysarthric speech by using Self-supervised Model with Multi-task Learning
Figure 4 for Automatic Severity Assessment of Dysarthric speech by using Self-supervised Model with Multi-task Learning
Viaarxiv icon

Opening the Black Box of wav2vec Feature Encoder

Add code
Bookmark button
Alert button
Oct 27, 2022
Kwanghee Choi, Eun Jung Yeo

Figure 1 for Opening the Black Box of wav2vec Feature Encoder
Figure 2 for Opening the Black Box of wav2vec Feature Encoder
Figure 3 for Opening the Black Box of wav2vec Feature Encoder
Figure 4 for Opening the Black Box of wav2vec Feature Encoder
Viaarxiv icon