Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Context-Driven Dynamic Pruning for Large Speech Foundation Models

May 24, 2025

Masao Someki, Shikhar Bharadwaj, Atharva Anand Joshi, Chyi-Jiunn Lin, Jinchuan Tian, Jee-weon Jung, Markus Müller, Nathan Susanj, Jing Liu, Shinji Watanabe

Figure 1 for Context-Driven Dynamic Pruning for Large Speech Foundation Models

Figure 2 for Context-Driven Dynamic Pruning for Large Speech Foundation Models

Figure 3 for Context-Driven Dynamic Pruning for Large Speech Foundation Models

Figure 4 for Context-Driven Dynamic Pruning for Large Speech Foundation Models

Share this with someone who'll enjoy it:

Abstract:Speech foundation models achieve strong generalization across languages and acoustic conditions, but require significant computational resources for inference. In the context of speech foundation models, pruning techniques have been studied that dynamically optimize model structures based on the target audio leveraging external context. In this work, we extend this line of research and propose context-driven dynamic pruning, a technique that optimizes the model computation depending on the context between different input frames and additional context during inference. We employ the Open Whisper-style Speech Model (OWSM) and incorporate speaker embeddings, acoustic event embeddings, and language information as additional context. By incorporating the speaker embedding, our method achieves a reduction of 56.7 GFLOPs while improving BLEU scores by a relative 25.7% compared to the fully fine-tuned OWSM model.

* Accepted at Interspeech 2025

View paper on

Share this with someone who'll enjoy it:

Title:Context-Driven Dynamic Pruning for Large Speech Foundation Models

Paper and Code