Alert button
Picture for Zakaria Aldeneh

Zakaria Aldeneh

Alert button

Can you Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features?

Add code
Bookmark button
Alert button
Feb 01, 2024
Zakaria Aldeneh, Takuya Higuchi, Jee-weon Jung, Skyler Seto, Tatiana Likhomanenko, Stephen Shum, Ahmed Hussen Abdelaziz, Shinji Watanabe, Barry-John Theobald

Viaarxiv icon

ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models

Add code
Bookmark button
Alert button
Jan 30, 2024
Jee-weon Jung, Wangyou Zhang, Jiatong Shi, Zakaria Aldeneh, Takuya Higuchi, Barry-John Theobald, Ahmed Hussen Abdelaziz, Shinji Watanabe

Viaarxiv icon

Spatial LibriSpeech: An Augmented Dataset for Spatial Audio Learning

Add code
Bookmark button
Alert button
Aug 18, 2023
Miguel Sarabia, Elena Menyaylenko, Alessandro Toso, Skyler Seto, Zakaria Aldeneh, Shadi Pirhosseinloo, Luca Zappella, Barry-John Theobald, Nicholas Apostoloff, Jonathan Sheaffer

Figure 1 for Spatial LibriSpeech: An Augmented Dataset for Spatial Audio Learning
Figure 2 for Spatial LibriSpeech: An Augmented Dataset for Spatial Audio Learning
Figure 3 for Spatial LibriSpeech: An Augmented Dataset for Spatial Audio Learning
Figure 4 for Spatial LibriSpeech: An Augmented Dataset for Spatial Audio Learning
Viaarxiv icon

Naturalistic Head Motion Generation from Speech

Add code
Bookmark button
Alert button
Oct 26, 2022
Trisha Mittal, Zakaria Aldeneh, Masha Fedzechkina, Anurag Ranjan, Barry-John Theobald

Figure 1 for Naturalistic Head Motion Generation from Speech
Figure 2 for Naturalistic Head Motion Generation from Speech
Figure 3 for Naturalistic Head Motion Generation from Speech
Figure 4 for Naturalistic Head Motion Generation from Speech
Viaarxiv icon

Towards a Perceptual Model for Estimating the Quality of Visual Speech

Add code
Bookmark button
Alert button
Mar 24, 2022
Zakaria Aldeneh, Masha Fedzechkina, Skyler Seto, Katherine Metcalf, Miguel Sarabia, Nicholas Apostoloff, Barry-John Theobald

Figure 1 for Towards a Perceptual Model for Estimating the Quality of Visual Speech
Figure 2 for Towards a Perceptual Model for Estimating the Quality of Visual Speech
Figure 3 for Towards a Perceptual Model for Estimating the Quality of Visual Speech
Figure 4 for Towards a Perceptual Model for Estimating the Quality of Visual Speech
Viaarxiv icon

Self-supervised Learning of Visual Speech Features with Audiovisual Speech Enhancement

Add code
Bookmark button
Alert button
May 06, 2020
Zakaria Aldeneh, Anushree Prasanna Kumar, Barry-John Theobald, Erik Marchi, Sachin Kajarekar, Devang Naik, Ahmed Hussen Abdelaziz

Figure 1 for Self-supervised Learning of Visual Speech Features with Audiovisual Speech Enhancement
Figure 2 for Self-supervised Learning of Visual Speech Features with Audiovisual Speech Enhancement
Figure 3 for Self-supervised Learning of Visual Speech Features with Audiovisual Speech Enhancement
Figure 4 for Self-supervised Learning of Visual Speech Features with Audiovisual Speech Enhancement
Viaarxiv icon

Controlling for Confounders in Multimodal Emotion Classification via Adversarial Learning

Add code
Bookmark button
Alert button
Aug 23, 2019
Mimansa Jaiswal, Zakaria Aldeneh, Emily Mower Provost

Figure 1 for Controlling for Confounders in Multimodal Emotion Classification via Adversarial Learning
Figure 2 for Controlling for Confounders in Multimodal Emotion Classification via Adversarial Learning
Figure 3 for Controlling for Confounders in Multimodal Emotion Classification via Adversarial Learning
Figure 4 for Controlling for Confounders in Multimodal Emotion Classification via Adversarial Learning
Viaarxiv icon

MuSE-ing on the Impact of Utterance Ordering On Crowdsourced Emotion Annotations

Add code
Bookmark button
Alert button
Mar 27, 2019
Mimansa Jaiswal, Zakaria Aldeneh, Cristian-Paul Bara, Yuanhang Luo, Mihai Burzo, Rada Mihalcea, Emily Mower Provost

Figure 1 for MuSE-ing on the Impact of Utterance Ordering On Crowdsourced Emotion Annotations
Figure 2 for MuSE-ing on the Impact of Utterance Ordering On Crowdsourced Emotion Annotations
Figure 3 for MuSE-ing on the Impact of Utterance Ordering On Crowdsourced Emotion Annotations
Figure 4 for MuSE-ing on the Impact of Utterance Ordering On Crowdsourced Emotion Annotations
Viaarxiv icon

Improving End-of-turn Detection in Spoken Dialogues by Detecting Speaker Intentions as a Secondary Task

Add code
Bookmark button
Alert button
May 09, 2018
Zakaria Aldeneh, Dimitrios Dimitriadis, Emily Mower Provost

Figure 1 for Improving End-of-turn Detection in Spoken Dialogues by Detecting Speaker Intentions as a Secondary Task
Figure 2 for Improving End-of-turn Detection in Spoken Dialogues by Detecting Speaker Intentions as a Secondary Task
Figure 3 for Improving End-of-turn Detection in Spoken Dialogues by Detecting Speaker Intentions as a Secondary Task
Figure 4 for Improving End-of-turn Detection in Spoken Dialogues by Detecting Speaker Intentions as a Secondary Task
Viaarxiv icon