Picture for Peter Bell

Peter Bell

Fusing ASR Outputs in Joint Training for Speech Emotion Recognition

Add code
Oct 29, 2021
Figure 1 for Fusing ASR Outputs in Joint Training for Speech Emotion Recognition
Figure 2 for Fusing ASR Outputs in Joint Training for Speech Emotion Recognition
Figure 3 for Fusing ASR Outputs in Joint Training for Speech Emotion Recognition
Figure 4 for Fusing ASR Outputs in Joint Training for Speech Emotion Recognition
Viaarxiv icon

It's not what you said, it's how you said it: discriminative perception of speech as a multichannel communication system

Add code
May 01, 2021
Figure 1 for It's not what you said, it's how you said it: discriminative perception of speech as a multichannel communication system
Figure 2 for It's not what you said, it's how you said it: discriminative perception of speech as a multichannel communication system
Figure 3 for It's not what you said, it's how you said it: discriminative perception of speech as a multichannel communication system
Figure 4 for It's not what you said, it's how you said it: discriminative perception of speech as a multichannel communication system
Viaarxiv icon

Segmenting Subtitles for Correcting ASR Segmentation Errors

Add code
Apr 16, 2021
Figure 1 for Segmenting Subtitles for Correcting ASR Segmentation Errors
Figure 2 for Segmenting Subtitles for Correcting ASR Segmentation Errors
Figure 3 for Segmenting Subtitles for Correcting ASR Segmentation Errors
Figure 4 for Segmenting Subtitles for Correcting ASR Segmentation Errors
Viaarxiv icon

Train your classifier first: Cascade Neural Networks Training from upper layers to lower layers

Add code
Feb 09, 2021
Figure 1 for Train your classifier first: Cascade Neural Networks Training from upper layers to lower layers
Figure 2 for Train your classifier first: Cascade Neural Networks Training from upper layers to lower layers
Figure 3 for Train your classifier first: Cascade Neural Networks Training from upper layers to lower layers
Figure 4 for Train your classifier first: Cascade Neural Networks Training from upper layers to lower layers
Viaarxiv icon

Enhancing Human Pose Estimation in Ancient Vase Paintings via Perceptually-grounded Style Transfer Learning

Add code
Dec 10, 2020
Figure 1 for Enhancing Human Pose Estimation in Ancient Vase Paintings via Perceptually-grounded Style Transfer Learning
Figure 2 for Enhancing Human Pose Estimation in Ancient Vase Paintings via Perceptually-grounded Style Transfer Learning
Figure 3 for Enhancing Human Pose Estimation in Ancient Vase Paintings via Perceptually-grounded Style Transfer Learning
Figure 4 for Enhancing Human Pose Estimation in Ancient Vase Paintings via Perceptually-grounded Style Transfer Learning
Viaarxiv icon

On the Usefulness of Self-Attention for Automatic Speech Recognition with Transformers

Add code
Nov 08, 2020
Figure 1 for On the Usefulness of Self-Attention for Automatic Speech Recognition with Transformers
Figure 2 for On the Usefulness of Self-Attention for Automatic Speech Recognition with Transformers
Figure 3 for On the Usefulness of Self-Attention for Automatic Speech Recognition with Transformers
Figure 4 for On the Usefulness of Self-Attention for Automatic Speech Recognition with Transformers
Viaarxiv icon

Stochastic Attention Head Removal: A Simple and Effective Method for Improving Automatic Speech Recognition with Transformers

Add code
Nov 08, 2020
Figure 1 for Stochastic Attention Head Removal: A Simple and Effective Method for Improving Automatic Speech Recognition with Transformers
Figure 2 for Stochastic Attention Head Removal: A Simple and Effective Method for Improving Automatic Speech Recognition with Transformers
Figure 3 for Stochastic Attention Head Removal: A Simple and Effective Method for Improving Automatic Speech Recognition with Transformers
Figure 4 for Stochastic Attention Head Removal: A Simple and Effective Method for Improving Automatic Speech Recognition with Transformers
Viaarxiv icon

Leveraging speaker attribute information using multi task learning for speaker verification and diarization

Add code
Oct 27, 2020
Figure 1 for Leveraging speaker attribute information using multi task learning for speaker verification and diarization
Figure 2 for Leveraging speaker attribute information using multi task learning for speaker verification and diarization
Figure 3 for Leveraging speaker attribute information using multi task learning for speaker verification and diarization
Viaarxiv icon

Subtitles to Segmentation: Improving Low-Resource Speech-to-Text Translation Pipelines

Add code
Oct 19, 2020
Figure 1 for Subtitles to Segmentation: Improving Low-Resource Speech-to-Text Translation Pipelines
Figure 2 for Subtitles to Segmentation: Improving Low-Resource Speech-to-Text Translation Pipelines
Figure 3 for Subtitles to Segmentation: Improving Low-Resource Speech-to-Text Translation Pipelines
Figure 4 for Subtitles to Segmentation: Improving Low-Resource Speech-to-Text Translation Pipelines
Viaarxiv icon

Understanding Compositional Structures in Art Historical Images using Pose and Gaze Priors

Add code
Sep 08, 2020
Figure 1 for Understanding Compositional Structures in Art Historical Images using Pose and Gaze Priors
Figure 2 for Understanding Compositional Structures in Art Historical Images using Pose and Gaze Priors
Figure 3 for Understanding Compositional Structures in Art Historical Images using Pose and Gaze Priors
Figure 4 for Understanding Compositional Structures in Art Historical Images using Pose and Gaze Priors
Viaarxiv icon