Picture for Yiling Huang

Yiling Huang

PNeSM: Arbitrary 3D Scene Stylization via Prompt-Based Neural Style Mapping

Add code
Mar 13, 2024
Viaarxiv icon

DiarizationLM: Speaker Diarization Post-Processing with Large Language Models

Add code
Jan 16, 2024
Viaarxiv icon

ArtBank: Artistic Style Transfer with Pre-trained Diffusion Model and Implicit Style Prompt Bank

Add code
Dec 11, 2023
Viaarxiv icon

Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network

Add code
Sep 15, 2023
Viaarxiv icon

USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models

Add code
Sep 14, 2023
Viaarxiv icon

Selective inference using randomized group lasso estimators for general models

Add code
Jun 24, 2023
Viaarxiv icon

Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss

Add code
Nov 11, 2022
Viaarxiv icon

Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering

Add code
Oct 25, 2022
Viaarxiv icon

Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech

Add code
Mar 21, 2022
Figure 1 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Figure 2 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Figure 3 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Figure 4 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Viaarxiv icon

Parameter-Free Attentive Scoring for Speaker Verification

Add code
Mar 10, 2022
Figure 1 for Parameter-Free Attentive Scoring for Speaker Verification
Figure 2 for Parameter-Free Attentive Scoring for Speaker Verification
Figure 3 for Parameter-Free Attentive Scoring for Speaker Verification
Figure 4 for Parameter-Free Attentive Scoring for Speaker Verification
Viaarxiv icon