Alert button
Picture for Hyung-Min Park

Hyung-Min Park

Alert button

NeXt-TDNN: Modernizing Multi-Scale Temporal Convolution Backbone for Speaker Verification

Add code
Bookmark button
Alert button
Dec 15, 2023
Hyun-Jun Heo, Ui-Hyeop Shin, Ran Lee, YoungJu Cheon, Hyung-Min Park

Viaarxiv icon

Statistical Beamformer Exploiting Non-stationarity and Sparsity with Spatially Constrained ICA for Robust Speech Recognition

Add code
Bookmark button
Alert button
Jun 13, 2023
Ui-Hyeop Shin, Hyung-Min Park

Figure 1 for Statistical Beamformer Exploiting Non-stationarity and Sparsity with Spatially Constrained ICA for Robust Speech Recognition
Figure 2 for Statistical Beamformer Exploiting Non-stationarity and Sparsity with Spatially Constrained ICA for Robust Speech Recognition
Figure 3 for Statistical Beamformer Exploiting Non-stationarity and Sparsity with Spatially Constrained ICA for Robust Speech Recognition
Figure 4 for Statistical Beamformer Exploiting Non-stationarity and Sparsity with Spatially Constrained ICA for Robust Speech Recognition
Viaarxiv icon

Unsupervised Speech Representation Pooling Using Vector Quantization

Add code
Bookmark button
Alert button
Apr 08, 2023
Jeongkyun Park, Kwanghee Choi, Hyunjun Heo, Hyung-Min Park

Figure 1 for Unsupervised Speech Representation Pooling Using Vector Quantization
Figure 2 for Unsupervised Speech Representation Pooling Using Vector Quantization
Figure 3 for Unsupervised Speech Representation Pooling Using Vector Quantization
Figure 4 for Unsupervised Speech Representation Pooling Using Vector Quantization
Viaarxiv icon

OLKAVS: An Open Large-Scale Korean Audio-Visual Speech Dataset

Add code
Bookmark button
Alert button
Jan 16, 2023
Jeongkyun Park, Jung-Wook Hwang, Kwanghee Choi, Seung-Hyun Lee, Jun Hwan Ahn, Rae-Hong Park, Hyung-Min Park

Figure 1 for OLKAVS: An Open Large-Scale Korean Audio-Visual Speech Dataset
Figure 2 for OLKAVS: An Open Large-Scale Korean Audio-Visual Speech Dataset
Figure 3 for OLKAVS: An Open Large-Scale Korean Audio-Visual Speech Dataset
Figure 4 for OLKAVS: An Open Large-Scale Korean Audio-Visual Speech Dataset
Viaarxiv icon

Distilling a Pretrained Language Model to a Multilingual ASR Model

Add code
Bookmark button
Alert button
Jun 25, 2022
Kwanghee Choi, Hyung-Min Park

Figure 1 for Distilling a Pretrained Language Model to a Multilingual ASR Model
Figure 2 for Distilling a Pretrained Language Model to a Multilingual ASR Model
Figure 3 for Distilling a Pretrained Language Model to a Multilingual ASR Model
Figure 4 for Distilling a Pretrained Language Model to a Multilingual ASR Model
Viaarxiv icon

Unsupervised Speech Domain Adaptation Based on Disentangled Representation Learning for Robust Speech Recognition

Add code
Bookmark button
Alert button
Apr 12, 2019
Jong-Hyeon Park, Myungwoo Oh, Hyung-Min Park

Figure 1 for Unsupervised Speech Domain Adaptation Based on Disentangled Representation Learning for Robust Speech Recognition
Figure 2 for Unsupervised Speech Domain Adaptation Based on Disentangled Representation Learning for Robust Speech Recognition
Figure 3 for Unsupervised Speech Domain Adaptation Based on Disentangled Representation Learning for Robust Speech Recognition
Figure 4 for Unsupervised Speech Domain Adaptation Based on Disentangled Representation Learning for Robust Speech Recognition
Viaarxiv icon

BREN: Body Reflection Essence-Neuter Model for Separation of Reflection Components

Add code
Bookmark button
Alert button
Aug 25, 2015
Changsoo Je, Hyung-Min Park

Figure 1 for BREN: Body Reflection Essence-Neuter Model for Separation of Reflection Components
Figure 2 for BREN: Body Reflection Essence-Neuter Model for Separation of Reflection Components
Figure 3 for BREN: Body Reflection Essence-Neuter Model for Separation of Reflection Components
Figure 4 for BREN: Body Reflection Essence-Neuter Model for Separation of Reflection Components
Viaarxiv icon