Picture for Jan Černocký

Jan Černocký

Grounding Spoken LLMs in Multi-Speaker Audio via Diarization Conditioning

Add code
Jun 16, 2026
Viaarxiv icon

SpeakerCard-1M: An Evidence-Grounded Speaker Card Corpus for In-the-Wild Speaker Verification

Add code
Jun 03, 2026
Viaarxiv icon

SE-DiCoW: Self-Enrolled Diarization-Conditioned Whisper

Add code
Jan 27, 2026
Viaarxiv icon

DeCRED: Decoder-Centric Regularization for Encoder-Decoder Based Speech Recognition

Add code
Aug 12, 2025
Figure 1 for DeCRED: Decoder-Centric Regularization for Encoder-Decoder Based Speech Recognition
Figure 2 for DeCRED: Decoder-Centric Regularization for Encoder-Decoder Based Speech Recognition
Figure 3 for DeCRED: Decoder-Centric Regularization for Encoder-Decoder Based Speech Recognition
Figure 4 for DeCRED: Decoder-Centric Regularization for Encoder-Decoder Based Speech Recognition
Viaarxiv icon

BUT System for the MLC-SLM Challenge

Add code
Jun 16, 2025
Figure 1 for BUT System for the MLC-SLM Challenge
Figure 2 for BUT System for the MLC-SLM Challenge
Figure 3 for BUT System for the MLC-SLM Challenge
Figure 4 for BUT System for the MLC-SLM Challenge
Viaarxiv icon

Factors affecting the in-context learning abilities of LLMs for dialogue state tracking

Add code
Jun 10, 2025
Viaarxiv icon

Approaching Dialogue State Tracking via Aligning Speech Encoders and LLMs

Add code
Jun 10, 2025
Figure 1 for Approaching Dialogue State Tracking via Aligning Speech Encoders and LLMs
Figure 2 for Approaching Dialogue State Tracking via Aligning Speech Encoders and LLMs
Figure 3 for Approaching Dialogue State Tracking via Aligning Speech Encoders and LLMs
Figure 4 for Approaching Dialogue State Tracking via Aligning Speech Encoders and LLMs
Viaarxiv icon

TS-SUPERB: A Target Speech Processing Benchmark for Speech Self-Supervised Learning Models

Add code
May 10, 2025
Figure 1 for TS-SUPERB: A Target Speech Processing Benchmark for Speech Self-Supervised Learning Models
Figure 2 for TS-SUPERB: A Target Speech Processing Benchmark for Speech Self-Supervised Learning Models
Figure 3 for TS-SUPERB: A Target Speech Processing Benchmark for Speech Self-Supervised Learning Models
Figure 4 for TS-SUPERB: A Target Speech Processing Benchmark for Speech Self-Supervised Learning Models
Viaarxiv icon

DiCoW: Diarization-Conditioned Whisper for Target Speaker Automatic Speech Recognition

Add code
Dec 30, 2024
Figure 1 for DiCoW: Diarization-Conditioned Whisper for Target Speaker Automatic Speech Recognition
Figure 2 for DiCoW: Diarization-Conditioned Whisper for Target Speaker Automatic Speech Recognition
Figure 3 for DiCoW: Diarization-Conditioned Whisper for Target Speaker Automatic Speech Recognition
Figure 4 for DiCoW: Diarization-Conditioned Whisper for Target Speaker Automatic Speech Recognition
Viaarxiv icon

Aligning Pre-trained Models for Spoken Language Translation

Add code
Nov 27, 2024
Viaarxiv icon