Alert button
Picture for Ahmed Hussen Abdelaziz

Ahmed Hussen Abdelaziz

Alert button

Can you Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features?

Add code
Bookmark button
Alert button
Feb 01, 2024
Zakaria Aldeneh, Takuya Higuchi, Jee-weon Jung, Skyler Seto, Tatiana Likhomanenko, Stephen Shum, Ahmed Hussen Abdelaziz, Shinji Watanabe, Barry-John Theobald

Viaarxiv icon

ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models

Add code
Bookmark button
Alert button
Jan 30, 2024
Jee-weon Jung, Wangyou Zhang, Jiatong Shi, Zakaria Aldeneh, Takuya Higuchi, Barry-John Theobald, Ahmed Hussen Abdelaziz, Shinji Watanabe

Viaarxiv icon

Modality Dropout for Multimodal Device Directed Speech Detection using Verbal and Non-Verbal Features

Add code
Bookmark button
Alert button
Oct 23, 2023
Gautam Krishna, Sameer Dharur, Oggi Rudovic, Pranay Dighe, Saurabh Adya, Ahmed Hussen Abdelaziz, Ahmed H Tewfik

Viaarxiv icon

Audiovisual Speech Synthesis using Tacotron2

Add code
Bookmark button
Alert button
Aug 03, 2020
Ahmed Hussen Abdelaziz, Anushree Prasanna Kumar, Chloe Seivwright, Gabriele Fanelli, Justin Binder, Yannis Stylianou, Sachin Kajarekar

Figure 1 for Audiovisual Speech Synthesis using Tacotron2
Figure 2 for Audiovisual Speech Synthesis using Tacotron2
Figure 3 for Audiovisual Speech Synthesis using Tacotron2
Figure 4 for Audiovisual Speech Synthesis using Tacotron2
Viaarxiv icon

Modality Dropout for Improved Performance-driven Talking Faces

Add code
Bookmark button
Alert button
May 27, 2020
Ahmed Hussen Abdelaziz, Barry-John Theobald, Paul Dixon, Reinhard Knothe, Nicholas Apostoloff, Sachin Kajareker

Figure 1 for Modality Dropout for Improved Performance-driven Talking Faces
Figure 2 for Modality Dropout for Improved Performance-driven Talking Faces
Figure 3 for Modality Dropout for Improved Performance-driven Talking Faces
Figure 4 for Modality Dropout for Improved Performance-driven Talking Faces
Viaarxiv icon

Self-supervised Learning of Visual Speech Features with Audiovisual Speech Enhancement

Add code
Bookmark button
Alert button
May 06, 2020
Zakaria Aldeneh, Anushree Prasanna Kumar, Barry-John Theobald, Erik Marchi, Sachin Kajarekar, Devang Naik, Ahmed Hussen Abdelaziz

Figure 1 for Self-supervised Learning of Visual Speech Features with Audiovisual Speech Enhancement
Figure 2 for Self-supervised Learning of Visual Speech Features with Audiovisual Speech Enhancement
Figure 3 for Self-supervised Learning of Visual Speech Features with Audiovisual Speech Enhancement
Figure 4 for Self-supervised Learning of Visual Speech Features with Audiovisual Speech Enhancement
Viaarxiv icon

On Neural Phone Recognition of Mixed-Source ECoG Signals

Add code
Bookmark button
Alert button
Dec 12, 2019
Ahmed Hussen Abdelaziz, Shuo-Yiin Chang, Nelson Morgan, Erik Edwards, Dorothea Kolossa, Dan Ellis, David A. Moses, Edward F. Chang

Figure 1 for On Neural Phone Recognition of Mixed-Source ECoG Signals
Figure 2 for On Neural Phone Recognition of Mixed-Source ECoG Signals
Figure 3 for On Neural Phone Recognition of Mixed-Source ECoG Signals
Figure 4 for On Neural Phone Recognition of Mixed-Source ECoG Signals
Viaarxiv icon

Speaker-Independent Speech-Driven Visual Speech Synthesis using Domain-Adapted Acoustic Models

Add code
Bookmark button
Alert button
May 15, 2019
Ahmed Hussen Abdelaziz, Barry-John Theobald, Justin Binder, Gabriele Fanelli, Paul Dixon, Nicholas Apostoloff, Thibaut Weise, Sachin Kajareker

Figure 1 for Speaker-Independent Speech-Driven Visual Speech Synthesis using Domain-Adapted Acoustic Models
Figure 2 for Speaker-Independent Speech-Driven Visual Speech Synthesis using Domain-Adapted Acoustic Models
Figure 3 for Speaker-Independent Speech-Driven Visual Speech Synthesis using Domain-Adapted Acoustic Models
Viaarxiv icon