Picture for Barry-John Theobald

Barry-John Theobald

Learning Spatially-Aware Language and Audio Embedding

Add code
Sep 17, 2024
Figure 1 for Learning Spatially-Aware Language and Audio Embedding
Figure 2 for Learning Spatially-Aware Language and Audio Embedding
Figure 3 for Learning Spatially-Aware Language and Audio Embedding
Figure 4 for Learning Spatially-Aware Language and Audio Embedding
Viaarxiv icon

Towards Automatic Assessment of Self-Supervised Speech Models using Rank

Add code
Sep 16, 2024
Figure 1 for Towards Automatic Assessment of Self-Supervised Speech Models using Rank
Figure 2 for Towards Automatic Assessment of Self-Supervised Speech Models using Rank
Figure 3 for Towards Automatic Assessment of Self-Supervised Speech Models using Rank
Figure 4 for Towards Automatic Assessment of Self-Supervised Speech Models using Rank
Viaarxiv icon

Exploring Prediction Targets in Masked Pre-Training for Speech Foundation Models

Add code
Sep 16, 2024
Figure 1 for Exploring Prediction Targets in Masked Pre-Training for Speech Foundation Models
Figure 2 for Exploring Prediction Targets in Masked Pre-Training for Speech Foundation Models
Figure 3 for Exploring Prediction Targets in Masked Pre-Training for Speech Foundation Models
Figure 4 for Exploring Prediction Targets in Masked Pre-Training for Speech Foundation Models
Viaarxiv icon

Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels

Add code
Sep 16, 2024
Figure 1 for Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels
Figure 2 for Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels
Figure 3 for Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels
Figure 4 for Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels
Viaarxiv icon

On the Limited Generalization Capability of the Implicit Reward Model Induced by Direct Preference Optimization

Add code
Sep 05, 2024
Figure 1 for On the Limited Generalization Capability of the Implicit Reward Model Induced by Direct Preference Optimization
Figure 2 for On the Limited Generalization Capability of the Implicit Reward Model Induced by Direct Preference Optimization
Figure 3 for On the Limited Generalization Capability of the Implicit Reward Model Induced by Direct Preference Optimization
Figure 4 for On the Limited Generalization Capability of the Implicit Reward Model Induced by Direct Preference Optimization
Viaarxiv icon

Sample-Efficient Preference-based Reinforcement Learning with Dynamics Aware Rewards

Add code
Feb 28, 2024
Viaarxiv icon

Can you Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features?

Add code
Feb 01, 2024
Figure 1 for Can you Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features?
Figure 2 for Can you Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features?
Figure 3 for Can you Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features?
Figure 4 for Can you Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features?
Viaarxiv icon

ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models

Add code
Jan 30, 2024
Figure 1 for ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models
Figure 2 for ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models
Figure 3 for ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models
Figure 4 for ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models
Viaarxiv icon

REALM: Robust Entropy Adaptive Loss Minimization for Improved Single-Sample Test-Time Adaptation

Add code
Sep 07, 2023
Figure 1 for REALM: Robust Entropy Adaptive Loss Minimization for Improved Single-Sample Test-Time Adaptation
Figure 2 for REALM: Robust Entropy Adaptive Loss Minimization for Improved Single-Sample Test-Time Adaptation
Figure 3 for REALM: Robust Entropy Adaptive Loss Minimization for Improved Single-Sample Test-Time Adaptation
Figure 4 for REALM: Robust Entropy Adaptive Loss Minimization for Improved Single-Sample Test-Time Adaptation
Viaarxiv icon

Spatial LibriSpeech: An Augmented Dataset for Spatial Audio Learning

Add code
Aug 18, 2023
Figure 1 for Spatial LibriSpeech: An Augmented Dataset for Spatial Audio Learning
Figure 2 for Spatial LibriSpeech: An Augmented Dataset for Spatial Audio Learning
Figure 3 for Spatial LibriSpeech: An Augmented Dataset for Spatial Audio Learning
Figure 4 for Spatial LibriSpeech: An Augmented Dataset for Spatial Audio Learning
Viaarxiv icon