Picture for Katrin Kirchhoff

Katrin Kirchhoff

SpeechVerse: A Large-scale Generalizable Audio Language Model

Add code
May 14, 2024
Figure 1 for SpeechVerse: A Large-scale Generalizable Audio Language Model
Figure 2 for SpeechVerse: A Large-scale Generalizable Audio Language Model
Figure 3 for SpeechVerse: A Large-scale Generalizable Audio Language Model
Figure 4 for SpeechVerse: A Large-scale Generalizable Audio Language Model
Viaarxiv icon

SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models

Add code
May 14, 2024
Viaarxiv icon

AutoGluon-Multimodal (AutoMM): Supercharging Multimodal AutoML with Foundation Models

Add code
Apr 30, 2024
Figure 1 for AutoGluon-Multimodal (AutoMM): Supercharging Multimodal AutoML with Foundation Models
Figure 2 for AutoGluon-Multimodal (AutoMM): Supercharging Multimodal AutoML with Foundation Models
Figure 3 for AutoGluon-Multimodal (AutoMM): Supercharging Multimodal AutoML with Foundation Models
Figure 4 for AutoGluon-Multimodal (AutoMM): Supercharging Multimodal AutoML with Foundation Models
Viaarxiv icon

Don't Stop Self-Supervision: Accent Adaptation of Speech Representations via Residual Adapters

Add code
Jul 02, 2023
Figure 1 for Don't Stop Self-Supervision: Accent Adaptation of Speech Representations via Residual Adapters
Figure 2 for Don't Stop Self-Supervision: Accent Adaptation of Speech Representations via Residual Adapters
Figure 3 for Don't Stop Self-Supervision: Accent Adaptation of Speech Representations via Residual Adapters
Figure 4 for Don't Stop Self-Supervision: Accent Adaptation of Speech Representations via Residual Adapters
Viaarxiv icon

DCTX-Conformer: Dynamic context carry-over for low latency unified streaming and non-streaming Conformer

Add code
Jun 13, 2023
Figure 1 for DCTX-Conformer: Dynamic context carry-over for low latency unified streaming and non-streaming Conformer
Figure 2 for DCTX-Conformer: Dynamic context carry-over for low latency unified streaming and non-streaming Conformer
Figure 3 for DCTX-Conformer: Dynamic context carry-over for low latency unified streaming and non-streaming Conformer
Figure 4 for DCTX-Conformer: Dynamic context carry-over for low latency unified streaming and non-streaming Conformer
Viaarxiv icon

Mask The Bias: Improving Domain-Adaptive Generalization of CTC-based ASR with Internal Language Model Estimation

Add code
May 05, 2023
Figure 1 for Mask The Bias: Improving Domain-Adaptive Generalization of CTC-based ASR with Internal Language Model Estimation
Figure 2 for Mask The Bias: Improving Domain-Adaptive Generalization of CTC-based ASR with Internal Language Model Estimation
Figure 3 for Mask The Bias: Improving Domain-Adaptive Generalization of CTC-based ASR with Internal Language Model Estimation
Figure 4 for Mask The Bias: Improving Domain-Adaptive Generalization of CTC-based ASR with Internal Language Model Estimation
Viaarxiv icon

Rethinking the Role of Scale for In-Context Learning: An Interpretability-based Case Study at 66 Billion Scale

Add code
Dec 18, 2022
Figure 1 for Rethinking the Role of Scale for In-Context Learning: An Interpretability-based Case Study at 66 Billion Scale
Figure 2 for Rethinking the Role of Scale for In-Context Learning: An Interpretability-based Case Study at 66 Billion Scale
Figure 3 for Rethinking the Role of Scale for In-Context Learning: An Interpretability-based Case Study at 66 Billion Scale
Figure 4 for Rethinking the Role of Scale for In-Context Learning: An Interpretability-based Case Study at 66 Billion Scale
Viaarxiv icon

Device Directedness with Contextual Cues for Spoken Dialog Systems

Add code
Nov 23, 2022
Figure 1 for Device Directedness with Contextual Cues for Spoken Dialog Systems
Figure 2 for Device Directedness with Contextual Cues for Spoken Dialog Systems
Figure 3 for Device Directedness with Contextual Cues for Spoken Dialog Systems
Figure 4 for Device Directedness with Contextual Cues for Spoken Dialog Systems
Viaarxiv icon

Personalization of CTC Speech Recognition Models

Add code
Oct 18, 2022
Figure 1 for Personalization of CTC Speech Recognition Models
Figure 2 for Personalization of CTC Speech Recognition Models
Figure 3 for Personalization of CTC Speech Recognition Models
Figure 4 for Personalization of CTC Speech Recognition Models
Viaarxiv icon

Self-Supervised Speech Representation Learning: A Review

Add code
May 21, 2022
Figure 1 for Self-Supervised Speech Representation Learning: A Review
Figure 2 for Self-Supervised Speech Representation Learning: A Review
Figure 3 for Self-Supervised Speech Representation Learning: A Review
Figure 4 for Self-Supervised Speech Representation Learning: A Review
Viaarxiv icon