Picture for Chin-Hui Lee

Chin-Hui Lee

An Experimental Study on Private Aggregation of Teacher Ensemble Learning for End-to-End Speech Recognition

Add code
Oct 13, 2022
Figure 1 for An Experimental Study on Private Aggregation of Teacher Ensemble Learning for End-to-End Speech Recognition
Figure 2 for An Experimental Study on Private Aggregation of Teacher Ensemble Learning for End-to-End Speech Recognition
Figure 3 for An Experimental Study on Private Aggregation of Teacher Ensemble Learning for End-to-End Speech Recognition
Figure 4 for An Experimental Study on Private Aggregation of Teacher Ensemble Learning for End-to-End Speech Recognition
Viaarxiv icon

An Ensemble Teacher-Student Learning Approach with Poisson Sub-sampling to Differential Privacy Preserving Speech Recognition

Add code
Oct 12, 2022
Figure 1 for An Ensemble Teacher-Student Learning Approach with Poisson Sub-sampling to Differential Privacy Preserving Speech Recognition
Figure 2 for An Ensemble Teacher-Student Learning Approach with Poisson Sub-sampling to Differential Privacy Preserving Speech Recognition
Figure 3 for An Ensemble Teacher-Student Learning Approach with Poisson Sub-sampling to Differential Privacy Preserving Speech Recognition
Figure 4 for An Ensemble Teacher-Student Learning Approach with Poisson Sub-sampling to Differential Privacy Preserving Speech Recognition
Viaarxiv icon

A study on joint modeling and data augmentation of multi-modalities for audio-visual scene classification

Add code
Mar 31, 2022
Figure 1 for A study on joint modeling and data augmentation of multi-modalities for audio-visual scene classification
Figure 2 for A study on joint modeling and data augmentation of multi-modalities for audio-visual scene classification
Figure 3 for A study on joint modeling and data augmentation of multi-modalities for audio-visual scene classification
Figure 4 for A study on joint modeling and data augmentation of multi-modalities for audio-visual scene classification
Viaarxiv icon

A Study of Designing Compact Audio-Visual Wake Word Spotting System Based on Iterative Fine-Tuning in Neural Network Pruning

Add code
Feb 17, 2022
Figure 1 for A Study of Designing Compact Audio-Visual Wake Word Spotting System Based on Iterative Fine-Tuning in Neural Network Pruning
Figure 2 for A Study of Designing Compact Audio-Visual Wake Word Spotting System Based on Iterative Fine-Tuning in Neural Network Pruning
Figure 3 for A Study of Designing Compact Audio-Visual Wake Word Spotting System Based on Iterative Fine-Tuning in Neural Network Pruning
Figure 4 for A Study of Designing Compact Audio-Visual Wake Word Spotting System Based on Iterative Fine-Tuning in Neural Network Pruning
Viaarxiv icon

The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge

Add code
Feb 10, 2022
Figure 1 for The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription  challenge
Figure 2 for The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription  challenge
Figure 3 for The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription  challenge
Figure 4 for The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription  challenge
Viaarxiv icon

Information Fusion in Attention Networks Using Adaptive and Multi-level Factorized Bilinear Pooling for Audio-visual Emotion Recognition

Add code
Nov 17, 2021
Figure 1 for Information Fusion in Attention Networks Using Adaptive and Multi-level Factorized Bilinear Pooling for Audio-visual Emotion Recognition
Figure 2 for Information Fusion in Attention Networks Using Adaptive and Multi-level Factorized Bilinear Pooling for Audio-visual Emotion Recognition
Figure 3 for Information Fusion in Attention Networks Using Adaptive and Multi-level Factorized Bilinear Pooling for Audio-visual Emotion Recognition
Figure 4 for Information Fusion in Attention Networks Using Adaptive and Multi-level Factorized Bilinear Pooling for Audio-visual Emotion Recognition
Viaarxiv icon

A Variational Bayesian Approach to Learning Latent Variables for Acoustic Knowledge Transfer

Add code
Oct 16, 2021
Figure 1 for A Variational Bayesian Approach to Learning Latent Variables for Acoustic Knowledge Transfer
Figure 2 for A Variational Bayesian Approach to Learning Latent Variables for Acoustic Knowledge Transfer
Figure 3 for A Variational Bayesian Approach to Learning Latent Variables for Acoustic Knowledge Transfer
Figure 4 for A Variational Bayesian Approach to Learning Latent Variables for Acoustic Knowledge Transfer
Viaarxiv icon

Separation Guided Speaker Diarization in Realistic Mismatched Conditions

Add code
Jul 06, 2021
Figure 1 for Separation Guided Speaker Diarization in Realistic Mismatched Conditions
Figure 2 for Separation Guided Speaker Diarization in Realistic Mismatched Conditions
Figure 3 for Separation Guided Speaker Diarization in Realistic Mismatched Conditions
Figure 4 for Separation Guided Speaker Diarization in Realistic Mismatched Conditions
Viaarxiv icon

A Lottery Ticket Hypothesis Framework for Low-Complexity Device-Robust Neural Acoustic Scene Classification

Add code
Jul 03, 2021
Figure 1 for A Lottery Ticket Hypothesis Framework for Low-Complexity Device-Robust Neural Acoustic Scene Classification
Figure 2 for A Lottery Ticket Hypothesis Framework for Low-Complexity Device-Robust Neural Acoustic Scene Classification
Figure 3 for A Lottery Ticket Hypothesis Framework for Low-Complexity Device-Robust Neural Acoustic Scene Classification
Figure 4 for A Lottery Ticket Hypothesis Framework for Low-Complexity Device-Robust Neural Acoustic Scene Classification
Viaarxiv icon

PATE-AAE: Incorporating Adversarial Autoencoder into Private Aggregation of Teacher Ensembles for Spoken Command Classification

Add code
Apr 02, 2021
Figure 1 for PATE-AAE: Incorporating Adversarial Autoencoder into Private Aggregation of Teacher Ensembles for Spoken Command Classification
Figure 2 for PATE-AAE: Incorporating Adversarial Autoencoder into Private Aggregation of Teacher Ensembles for Spoken Command Classification
Figure 3 for PATE-AAE: Incorporating Adversarial Autoencoder into Private Aggregation of Teacher Ensembles for Spoken Command Classification
Figure 4 for PATE-AAE: Incorporating Adversarial Autoencoder into Private Aggregation of Teacher Ensembles for Spoken Command Classification
Viaarxiv icon