Picture for Yiling Huang

Yiling Huang

PNeSM: Arbitrary 3D Scene Stylization via Prompt-Based Neural Style Mapping

Add code
Mar 13, 2024
Figure 1 for PNeSM: Arbitrary 3D Scene Stylization via Prompt-Based Neural Style Mapping
Figure 2 for PNeSM: Arbitrary 3D Scene Stylization via Prompt-Based Neural Style Mapping
Figure 3 for PNeSM: Arbitrary 3D Scene Stylization via Prompt-Based Neural Style Mapping
Figure 4 for PNeSM: Arbitrary 3D Scene Stylization via Prompt-Based Neural Style Mapping
Viaarxiv icon

DiarizationLM: Speaker Diarization Post-Processing with Large Language Models

Add code
Jan 16, 2024
Viaarxiv icon

ArtBank: Artistic Style Transfer with Pre-trained Diffusion Model and Implicit Style Prompt Bank

Add code
Dec 11, 2023
Viaarxiv icon

Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network

Add code
Sep 15, 2023
Figure 1 for Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network
Figure 2 for Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network
Figure 3 for Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network
Figure 4 for Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network
Viaarxiv icon

USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models

Add code
Sep 14, 2023
Figure 1 for USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models
Figure 2 for USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models
Figure 3 for USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models
Figure 4 for USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models
Viaarxiv icon

Selective inference using randomized group lasso estimators for general models

Add code
Jun 24, 2023
Figure 1 for Selective inference using randomized group lasso estimators for general models
Figure 2 for Selective inference using randomized group lasso estimators for general models
Figure 3 for Selective inference using randomized group lasso estimators for general models
Figure 4 for Selective inference using randomized group lasso estimators for general models
Viaarxiv icon

Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss

Add code
Nov 11, 2022
Figure 1 for Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss
Figure 2 for Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss
Figure 3 for Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss
Figure 4 for Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss
Viaarxiv icon

Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering

Add code
Oct 25, 2022
Figure 1 for Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering
Figure 2 for Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering
Figure 3 for Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering
Figure 4 for Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering
Viaarxiv icon

Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech

Add code
Mar 21, 2022
Figure 1 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Figure 2 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Figure 3 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Figure 4 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Viaarxiv icon

Parameter-Free Attentive Scoring for Speaker Verification

Add code
Mar 10, 2022
Figure 1 for Parameter-Free Attentive Scoring for Speaker Verification
Figure 2 for Parameter-Free Attentive Scoring for Speaker Verification
Figure 3 for Parameter-Free Attentive Scoring for Speaker Verification
Figure 4 for Parameter-Free Attentive Scoring for Speaker Verification
Viaarxiv icon