Picture for Cyril Allauzen

Cyril Allauzen

Benchmarking LLMs on the Massive Sound Embedding Benchmark (MSEB)

Add code
May 06, 2026
Viaarxiv icon

Massive Sound Embedding Benchmark (MSEB)

Add code
Feb 06, 2026
Viaarxiv icon

Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study

Add code
Jan 23, 2024
Figure 1 for Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study
Figure 2 for Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study
Figure 3 for Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study
Figure 4 for Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study
Viaarxiv icon

Large-scale Language Model Rescoring on Long-form Data

Add code
Jun 13, 2023
Figure 1 for Large-scale Language Model Rescoring on Long-form Data
Figure 2 for Large-scale Language Model Rescoring on Long-form Data
Figure 3 for Large-scale Language Model Rescoring on Long-form Data
Figure 4 for Large-scale Language Model Rescoring on Long-form Data
Viaarxiv icon

Alignment Entropy Regularization

Add code
Dec 22, 2022
Figure 1 for Alignment Entropy Regularization
Figure 2 for Alignment Entropy Regularization
Figure 3 for Alignment Entropy Regularization
Figure 4 for Alignment Entropy Regularization
Viaarxiv icon

E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model

Add code
Nov 28, 2022
Figure 1 for E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model
Figure 2 for E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model
Figure 3 for E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model
Figure 4 for E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model
Viaarxiv icon

Global Normalization for Streaming Speech Recognition in a Modular Framework

Add code
May 26, 2022
Figure 1 for Global Normalization for Streaming Speech Recognition in a Modular Framework
Figure 2 for Global Normalization for Streaming Speech Recognition in a Modular Framework
Figure 3 for Global Normalization for Streaming Speech Recognition in a Modular Framework
Figure 4 for Global Normalization for Streaming Speech Recognition in a Modular Framework
Viaarxiv icon

E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR

Add code
Apr 22, 2022
Figure 1 for E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR
Figure 2 for E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR
Figure 3 for E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR
Figure 4 for E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR
Viaarxiv icon

A* shortest string decoding for non-idempotent semirings

Add code
Apr 14, 2022
Figure 1 for A* shortest string decoding for non-idempotent semirings
Figure 2 for A* shortest string decoding for non-idempotent semirings
Figure 3 for A* shortest string decoding for non-idempotent semirings
Viaarxiv icon

Hybrid Autoregressive Transducer (hat)

Add code
Mar 12, 2020
Figure 1 for Hybrid Autoregressive Transducer (hat)
Figure 2 for Hybrid Autoregressive Transducer (hat)
Figure 3 for Hybrid Autoregressive Transducer (hat)
Figure 4 for Hybrid Autoregressive Transducer (hat)
Viaarxiv icon