Picture for Guanlong Zhao

Guanlong Zhao

DiarizationLM: Speaker Diarization Post-Processing with Large Language Models

Add code
Jan 16, 2024
Figure 1 for DiarizationLM: Speaker Diarization Post-Processing with Large Language Models
Figure 2 for DiarizationLM: Speaker Diarization Post-Processing with Large Language Models
Figure 3 for DiarizationLM: Speaker Diarization Post-Processing with Large Language Models
Figure 4 for DiarizationLM: Speaker Diarization Post-Processing with Large Language Models
Viaarxiv icon

Personalizing Keyword Spotting with Speaker Information

Add code
Nov 06, 2023
Figure 1 for Personalizing Keyword Spotting with Speaker Information
Figure 2 for Personalizing Keyword Spotting with Speaker Information
Figure 3 for Personalizing Keyword Spotting with Speaker Information
Figure 4 for Personalizing Keyword Spotting with Speaker Information
Viaarxiv icon

Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network

Add code
Sep 15, 2023
Figure 1 for Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network
Figure 2 for Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network
Figure 3 for Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network
Figure 4 for Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network
Viaarxiv icon

USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models

Add code
Sep 14, 2023
Viaarxiv icon

Exploring Sequence-to-Sequence Transformer-Transducer Models for Keyword Spotting

Add code
Nov 11, 2022
Figure 1 for Exploring Sequence-to-Sequence Transformer-Transducer Models for Keyword Spotting
Figure 2 for Exploring Sequence-to-Sequence Transformer-Transducer Models for Keyword Spotting
Figure 3 for Exploring Sequence-to-Sequence Transformer-Transducer Models for Keyword Spotting
Figure 4 for Exploring Sequence-to-Sequence Transformer-Transducer Models for Keyword Spotting
Viaarxiv icon

Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss

Add code
Nov 11, 2022
Figure 1 for Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss
Figure 2 for Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss
Figure 3 for Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss
Figure 4 for Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss
Viaarxiv icon

Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering

Add code
Oct 25, 2022
Figure 1 for Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering
Figure 2 for Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering
Figure 3 for Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering
Figure 4 for Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering
Viaarxiv icon

LSTM Acoustic Models Learn to Align and Pronounce with Graphemes

Add code
Aug 13, 2020
Figure 1 for LSTM Acoustic Models Learn to Align and Pronounce with Graphemes
Figure 2 for LSTM Acoustic Models Learn to Align and Pronounce with Graphemes
Figure 3 for LSTM Acoustic Models Learn to Align and Pronounce with Graphemes
Figure 4 for LSTM Acoustic Models Learn to Align and Pronounce with Graphemes
Viaarxiv icon

Improved Techniques for Learning to Dehaze and Beyond: A Collective Study

Add code
Jul 30, 2018
Figure 1 for Improved Techniques for Learning to Dehaze and Beyond: A Collective Study
Figure 2 for Improved Techniques for Learning to Dehaze and Beyond: A Collective Study
Figure 3 for Improved Techniques for Learning to Dehaze and Beyond: A Collective Study
Figure 4 for Improved Techniques for Learning to Dehaze and Beyond: A Collective Study
Viaarxiv icon

PAD-Net: A Perception-Aided Single Image Dehazing Network

Add code
May 08, 2018
Figure 1 for PAD-Net: A Perception-Aided Single Image Dehazing Network
Figure 2 for PAD-Net: A Perception-Aided Single Image Dehazing Network
Figure 3 for PAD-Net: A Perception-Aided Single Image Dehazing Network
Figure 4 for PAD-Net: A Perception-Aided Single Image Dehazing Network
Viaarxiv icon