Picture for Yifan Gong

Yifan Gong

Fred

Reverse Engineering of Imperceptible Adversarial Image Perturbations

Add code
Apr 01, 2022
Figure 1 for Reverse Engineering of Imperceptible Adversarial Image Perturbations
Figure 2 for Reverse Engineering of Imperceptible Adversarial Image Perturbations
Figure 3 for Reverse Engineering of Imperceptible Adversarial Image Perturbations
Figure 4 for Reverse Engineering of Imperceptible Adversarial Image Perturbations
Viaarxiv icon

Endpoint Detection for Streaming End-to-End Multi-talker ASR

Add code
Jan 24, 2022
Figure 1 for Endpoint Detection for Streaming End-to-End Multi-talker ASR
Figure 2 for Endpoint Detection for Streaming End-to-End Multi-talker ASR
Figure 3 for Endpoint Detection for Streaming End-to-End Multi-talker ASR
Figure 4 for Endpoint Detection for Streaming End-to-End Multi-talker ASR
Viaarxiv icon

Automatic Mapping of the Best-Suited DNN Pruning Schemes for Real-Time Mobile Acceleration

Add code
Nov 22, 2021
Figure 1 for Automatic Mapping of the Best-Suited DNN Pruning Schemes for Real-Time Mobile Acceleration
Figure 2 for Automatic Mapping of the Best-Suited DNN Pruning Schemes for Real-Time Mobile Acceleration
Figure 3 for Automatic Mapping of the Best-Suited DNN Pruning Schemes for Real-Time Mobile Acceleration
Figure 4 for Automatic Mapping of the Best-Suited DNN Pruning Schemes for Real-Time Mobile Acceleration
Viaarxiv icon

MEST: Accurate and Fast Memory-Economic Sparse Training Framework on the Edge

Add code
Oct 26, 2021
Figure 1 for MEST: Accurate and Fast Memory-Economic Sparse Training Framework on the Edge
Figure 2 for MEST: Accurate and Fast Memory-Economic Sparse Training Framework on the Edge
Figure 3 for MEST: Accurate and Fast Memory-Economic Sparse Training Framework on the Edge
Figure 4 for MEST: Accurate and Fast Memory-Economic Sparse Training Framework on the Edge
Viaarxiv icon

Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition

Add code
Oct 14, 2021
Figure 1 for Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition
Figure 2 for Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition
Viaarxiv icon

Have best of both worlds: two-pass hybrid and E2E cascading framework for speech recognition

Add code
Oct 10, 2021
Figure 1 for Have best of both worlds: two-pass hybrid and E2E cascading framework for speech recognition
Figure 2 for Have best of both worlds: two-pass hybrid and E2E cascading framework for speech recognition
Figure 3 for Have best of both worlds: two-pass hybrid and E2E cascading framework for speech recognition
Figure 4 for Have best of both worlds: two-pass hybrid and E2E cascading framework for speech recognition
Viaarxiv icon

Diarisation using location tracking with agglomerative clustering

Add code
Sep 24, 2021
Figure 1 for Diarisation using location tracking with agglomerative clustering
Figure 2 for Diarisation using location tracking with agglomerative clustering
Figure 3 for Diarisation using location tracking with agglomerative clustering
Viaarxiv icon

Joint speaker diarisation and tracking in switching state-space model

Add code
Sep 23, 2021
Figure 1 for Joint speaker diarisation and tracking in switching state-space model
Figure 2 for Joint speaker diarisation and tracking in switching state-space model
Figure 3 for Joint speaker diarisation and tracking in switching state-space model
Figure 4 for Joint speaker diarisation and tracking in switching state-space model
Viaarxiv icon

Achieving on-Mobile Real-Time Super-Resolution with Neural Architecture and Pruning Search

Add code
Aug 18, 2021
Figure 1 for Achieving on-Mobile Real-Time Super-Resolution with Neural Architecture and Pruning Search
Figure 2 for Achieving on-Mobile Real-Time Super-Resolution with Neural Architecture and Pruning Search
Figure 3 for Achieving on-Mobile Real-Time Super-Resolution with Neural Architecture and Pruning Search
Figure 4 for Achieving on-Mobile Real-Time Super-Resolution with Neural Architecture and Pruning Search
Viaarxiv icon

Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition

Add code
Jun 04, 2021
Figure 1 for Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition
Figure 2 for Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition
Figure 3 for Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition
Viaarxiv icon