Picture for Hongbin Suo

Hongbin Suo

The Database and Benchmark for Source Speaker Verification Against Voice Conversion

Add code
Jun 07, 2024
Viaarxiv icon

Outlier-aware Inlier Modeling and Multi-scale Scoring for Anomalous Sound Detection via Multitask Learning

Add code
Sep 14, 2023
Viaarxiv icon

Task-Agnostic Structured Pruning of Speech Representation Models

Add code
Jun 02, 2023
Viaarxiv icon

Multilingual Zero Resource Speech Recognition Base on Self-Supervise Pre-Trained Acoustic Models

Add code
Oct 13, 2022
Figure 1 for Multilingual Zero Resource Speech Recognition Base on Self-Supervise Pre-Trained Acoustic Models
Figure 2 for Multilingual Zero Resource Speech Recognition Base on Self-Supervise Pre-Trained Acoustic Models
Figure 3 for Multilingual Zero Resource Speech Recognition Base on Self-Supervise Pre-Trained Acoustic Models
Figure 4 for Multilingual Zero Resource Speech Recognition Base on Self-Supervise Pre-Trained Acoustic Models
Viaarxiv icon

PRISM: Pre-trained Indeterminate Speaker Representation Model for Speaker Diarization and Speaker Verification

Add code
May 16, 2022
Figure 1 for PRISM: Pre-trained Indeterminate Speaker Representation Model for Speaker Diarization and Speaker Verification
Figure 2 for PRISM: Pre-trained Indeterminate Speaker Representation Model for Speaker Diarization and Speaker Verification
Figure 3 for PRISM: Pre-trained Indeterminate Speaker Representation Model for Speaker Diarization and Speaker Verification
Figure 4 for PRISM: Pre-trained Indeterminate Speaker Representation Model for Speaker Diarization and Speaker Verification
Viaarxiv icon

Reformulating Speaker Diarization as Community Detection With Emphasis On Topological Structure

Add code
Apr 26, 2022
Figure 1 for Reformulating Speaker Diarization as Community Detection With Emphasis On Topological Structure
Figure 2 for Reformulating Speaker Diarization as Community Detection With Emphasis On Topological Structure
Figure 3 for Reformulating Speaker Diarization as Community Detection With Emphasis On Topological Structure
Figure 4 for Reformulating Speaker Diarization as Community Detection With Emphasis On Topological Structure
Viaarxiv icon

Graph Convolutional Network Based Semi-Supervised Learning on Multi-Speaker Meeting Data

Add code
Apr 25, 2022
Figure 1 for Graph Convolutional Network Based Semi-Supervised Learning on Multi-Speaker Meeting Data
Figure 2 for Graph Convolutional Network Based Semi-Supervised Learning on Multi-Speaker Meeting Data
Figure 3 for Graph Convolutional Network Based Semi-Supervised Learning on Multi-Speaker Meeting Data
Figure 4 for Graph Convolutional Network Based Semi-Supervised Learning on Multi-Speaker Meeting Data
Viaarxiv icon

BeamTransformer: Microphone Array-based Overlapping Speech Detection

Add code
Sep 09, 2021
Figure 1 for BeamTransformer: Microphone Array-based Overlapping Speech Detection
Figure 2 for BeamTransformer: Microphone Array-based Overlapping Speech Detection
Figure 3 for BeamTransformer: Microphone Array-based Overlapping Speech Detection
Figure 4 for BeamTransformer: Microphone Array-based Overlapping Speech Detection
Viaarxiv icon

A Real-time Speaker Diarization System Based on Spatial Spectrum

Add code
Jul 20, 2021
Figure 1 for A Real-time Speaker Diarization System Based on Spatial Spectrum
Figure 2 for A Real-time Speaker Diarization System Based on Spatial Spectrum
Figure 3 for A Real-time Speaker Diarization System Based on Spatial Spectrum
Figure 4 for A Real-time Speaker Diarization System Based on Spatial Spectrum
Viaarxiv icon