Alert button
Picture for Guanlong Zhao

Guanlong Zhao

Alert button

DiarizationLM: Speaker Diarization Post-Processing with Large Language Models

Add code
Bookmark button
Alert button
Jan 16, 2024
Quan Wang, Yiling Huang, Guanlong Zhao, Evan Clark, Wei Xia, Hank Liao

Viaarxiv icon

Personalizing Keyword Spotting with Speaker Information

Add code
Bookmark button
Alert button
Nov 06, 2023
Beltrán Labrador, Pai Zhu, Guanlong Zhao, Angelo Scorza Scarpati, Quan Wang, Alicia Lozano-Diez, Alex Park, Ignacio López Moreno

Viaarxiv icon

Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network

Add code
Bookmark button
Alert button
Sep 15, 2023
Yiling Huang, Weiran Wang, Guanlong Zhao, Hank Liao, Wei Xia, Quan Wang

Figure 1 for Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network
Figure 2 for Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network
Figure 3 for Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network
Figure 4 for Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network
Viaarxiv icon

USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models

Add code
Bookmark button
Alert button
Sep 14, 2023
Guanlong Zhao, Yongqiang Wang, Jason Pelecanos, Yu Zhang, Hank Liao, Yiling Huang, Han Lu, Quan Wang

Figure 1 for USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models
Figure 2 for USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models
Figure 3 for USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models
Figure 4 for USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models
Viaarxiv icon

Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss

Add code
Bookmark button
Alert button
Nov 11, 2022
Guanlong Zhao, Quan Wang, Han Lu, Yiling Huang, Ignacio Lopez Moreno

Figure 1 for Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss
Figure 2 for Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss
Figure 3 for Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss
Figure 4 for Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss
Viaarxiv icon

Exploring Sequence-to-Sequence Transformer-Transducer Models for Keyword Spotting

Add code
Bookmark button
Alert button
Nov 11, 2022
Beltrán Labrador, Guanlong Zhao, Ignacio López Moreno, Angelo Scorza Scarpati, Liam Fowl, Quan Wang

Figure 1 for Exploring Sequence-to-Sequence Transformer-Transducer Models for Keyword Spotting
Figure 2 for Exploring Sequence-to-Sequence Transformer-Transducer Models for Keyword Spotting
Figure 3 for Exploring Sequence-to-Sequence Transformer-Transducer Models for Keyword Spotting
Figure 4 for Exploring Sequence-to-Sequence Transformer-Transducer Models for Keyword Spotting
Viaarxiv icon

Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering

Add code
Bookmark button
Alert button
Oct 25, 2022
Quan Wang, Yiling Huang, Han Lu, Guanlong Zhao, Ignacio Lopez Moreno

Figure 1 for Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering
Figure 2 for Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering
Figure 3 for Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering
Figure 4 for Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering
Viaarxiv icon

LSTM Acoustic Models Learn to Align and Pronounce with Graphemes

Add code
Bookmark button
Alert button
Aug 13, 2020
Arindrima Datta, Guanlong Zhao, Bhuvana Ramabhadran, Eugene Weinstein

Figure 1 for LSTM Acoustic Models Learn to Align and Pronounce with Graphemes
Figure 2 for LSTM Acoustic Models Learn to Align and Pronounce with Graphemes
Figure 3 for LSTM Acoustic Models Learn to Align and Pronounce with Graphemes
Figure 4 for LSTM Acoustic Models Learn to Align and Pronounce with Graphemes
Viaarxiv icon

Improved Techniques for Learning to Dehaze and Beyond: A Collective Study

Add code
Bookmark button
Alert button
Jul 30, 2018
Yu Liu, Guanlong Zhao, Boyuan Gong, Yang Li, Ritu Raj, Niraj Goel, Satya Kesav, Sandeep Gottimukkala, Zhangyang Wang, Wenqi Ren, Dacheng Tao

Figure 1 for Improved Techniques for Learning to Dehaze and Beyond: A Collective Study
Figure 2 for Improved Techniques for Learning to Dehaze and Beyond: A Collective Study
Figure 3 for Improved Techniques for Learning to Dehaze and Beyond: A Collective Study
Figure 4 for Improved Techniques for Learning to Dehaze and Beyond: A Collective Study
Viaarxiv icon

PAD-Net: A Perception-Aided Single Image Dehazing Network

Add code
Bookmark button
Alert button
May 08, 2018
Yu Liu, Guanlong Zhao

Figure 1 for PAD-Net: A Perception-Aided Single Image Dehazing Network
Figure 2 for PAD-Net: A Perception-Aided Single Image Dehazing Network
Figure 3 for PAD-Net: A Perception-Aided Single Image Dehazing Network
Figure 4 for PAD-Net: A Perception-Aided Single Image Dehazing Network
Viaarxiv icon