Picture for Naohiro Tawara

Naohiro Tawara

Mitigating Non-Target Speaker Bias in Guided Speaker Embedding

Add code
Jun 14, 2025
Viaarxiv icon

Dissecting the Segmentation Model of End-to-End Diarization with Vector Clustering

Add code
Jun 13, 2025
Viaarxiv icon

Pretraining Multi-Speaker Identification for Neural Speaker Diarization

Add code
May 30, 2025
Viaarxiv icon

Guided Speaker Embedding

Add code
Oct 16, 2024
Figure 1 for Guided Speaker Embedding
Figure 2 for Guided Speaker Embedding
Figure 3 for Guided Speaker Embedding
Figure 4 for Guided Speaker Embedding
Viaarxiv icon

Mamba-based Segmentation Model for Speaker Diarization

Add code
Oct 10, 2024
Figure 1 for Mamba-based Segmentation Model for Speaker Diarization
Figure 2 for Mamba-based Segmentation Model for Speaker Diarization
Figure 3 for Mamba-based Segmentation Model for Speaker Diarization
Figure 4 for Mamba-based Segmentation Model for Speaker Diarization
Viaarxiv icon

NTT Multi-Speaker ASR System for the DASR Task of CHiME-8 Challenge

Add code
Sep 09, 2024
Figure 1 for NTT Multi-Speaker ASR System for the DASR Task of CHiME-8 Challenge
Figure 2 for NTT Multi-Speaker ASR System for the DASR Task of CHiME-8 Challenge
Figure 3 for NTT Multi-Speaker ASR System for the DASR Task of CHiME-8 Challenge
Figure 4 for NTT Multi-Speaker ASR System for the DASR Task of CHiME-8 Challenge
Viaarxiv icon

Recursive Attentive Pooling for Extracting Speaker Embeddings from Multi-Speaker Recordings

Add code
Aug 30, 2024
Figure 1 for Recursive Attentive Pooling for Extracting Speaker Embeddings from Multi-Speaker Recordings
Figure 2 for Recursive Attentive Pooling for Extracting Speaker Embeddings from Multi-Speaker Recordings
Figure 3 for Recursive Attentive Pooling for Extracting Speaker Embeddings from Multi-Speaker Recordings
Figure 4 for Recursive Attentive Pooling for Extracting Speaker Embeddings from Multi-Speaker Recordings
Viaarxiv icon

Interaural time difference loss for binaural target sound extraction

Add code
Aug 01, 2024
Viaarxiv icon

Applying LLMs for Rescoring N-best ASR Hypotheses of Casual Conversations: Effects of Domain Adaptation and Context Carry-over

Add code
Jun 27, 2024
Viaarxiv icon

BLSTM-Based Confidence Estimation for End-to-End Speech Recognition

Add code
Dec 22, 2023
Viaarxiv icon