Alert button
Picture for Abdelrahman Mohamed

Abdelrahman Mohamed

Alert button

VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild

Add code
Bookmark button
Alert button
Mar 25, 2024
Puyuan Peng, Po-Yao Huang, Daniel Li, Abdelrahman Mohamed, David Harwath

Viaarxiv icon

Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks

Add code
Bookmark button
Alert button
Mar 01, 2024
Fakhraddin Alwajih, El Moatez Billah Nagoudi, Gagan Bhatia, Abdelrahman Mohamed, Muhammad Abdul-Mageed

Figure 1 for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks
Figure 2 for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks
Figure 3 for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks
Figure 4 for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks
Viaarxiv icon

SpeechDPR: End-to-End Spoken Passage Retrieval for Open-Domain Spoken Question Answering

Add code
Bookmark button
Alert button
Jan 24, 2024
Chyi-Jiunn Lin, Guan-Ting Lin, Yung-Sung Chuang, Wei-Lun Wu, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Lin-shan Lee

Viaarxiv icon

Violet: A Vision-Language Model for Arabic Image Captioning with Gemini Decoder

Add code
Bookmark button
Alert button
Nov 15, 2023
Abdelrahman Mohamed, Fakhraddin Alwajih, El Moatez Billah Nagoudi, Alcides Alcoba Inciarte, Muhammad Abdul-Mageed

Viaarxiv icon

SD-HuBERT: Self-Distillation Induces Syllabic Organization in HuBERT

Add code
Bookmark button
Alert button
Oct 16, 2023
Cheol Jun Cho, Abdelrahman Mohamed, Shang-Wen Li, Alan W Black, Gopala K. Anumanchipalli

Figure 1 for SD-HuBERT: Self-Distillation Induces Syllabic Organization in HuBERT
Figure 2 for SD-HuBERT: Self-Distillation Induces Syllabic Organization in HuBERT
Figure 3 for SD-HuBERT: Self-Distillation Induces Syllabic Organization in HuBERT
Figure 4 for SD-HuBERT: Self-Distillation Induces Syllabic Organization in HuBERT
Viaarxiv icon

Self-Supervised Models of Speech Infer Universal Articulatory Kinematics

Add code
Bookmark button
Alert button
Oct 16, 2023
Cheol Jun Cho, Abdelrahman Mohamed, Alan W Black, Gopala K. Anumanchipalli

Viaarxiv icon

Findings of the 2023 ML-SUPERB Challenge: Pre-Training and Evaluation over More Languages and Beyond

Add code
Bookmark button
Alert button
Oct 09, 2023
Jiatong Shi, William Chen, Dan Berrebbi, Hsiu-Hsuan Wang, Wei-Ping Huang, En-Pei Hu, Ho-Lam Chuang, Xuankai Chang, Yuxun Tang, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Shinji Watanabe

Figure 1 for Findings of the 2023 ML-SUPERB Challenge: Pre-Training and Evaluation over More Languages and Beyond
Figure 2 for Findings of the 2023 ML-SUPERB Challenge: Pre-Training and Evaluation over More Languages and Beyond
Figure 3 for Findings of the 2023 ML-SUPERB Challenge: Pre-Training and Evaluation over More Languages and Beyond
Figure 4 for Findings of the 2023 ML-SUPERB Challenge: Pre-Training and Evaluation over More Languages and Beyond
Viaarxiv icon

Low-Resource Self-Supervised Learning with SSL-Enhanced TTS

Add code
Bookmark button
Alert button
Sep 29, 2023
Po-chun Hsu, Ali Elkahky, Wei-Ning Hsu, Yossi Adi, Tu Anh Nguyen, Jade Copet, Emmanuel Dupoux, Hung-yi Lee, Abdelrahman Mohamed

Figure 1 for Low-Resource Self-Supervised Learning with SSL-Enhanced TTS
Figure 2 for Low-Resource Self-Supervised Learning with SSL-Enhanced TTS
Figure 3 for Low-Resource Self-Supervised Learning with SSL-Enhanced TTS
Figure 4 for Low-Resource Self-Supervised Learning with SSL-Enhanced TTS
Viaarxiv icon

AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models

Add code
Bookmark button
Alert button
Sep 19, 2023
Yuan Tseng, Layne Berry, Yi-Ting Chen, I-Hsiang Chiu, Hsuan-Hao Lin, Max Liu, Puyuan Peng, Yi-Jen Shih, Hung-Yu Wang, Haibin Wu, Po-Yao Huang, Chun-Mao Lai, Shang-Wen Li, David Harwath, Yu Tsao, Shinji Watanabe, Abdelrahman Mohamed, Chi-Luen Feng, Hung-yi Lee

Figure 1 for AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models
Figure 2 for AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models
Figure 3 for AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models
Viaarxiv icon

Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Mode

Add code
Bookmark button
Alert button
May 19, 2023
Puyuan Peng, Shang-Wen Li, Okko Räsänen, Abdelrahman Mohamed, David Harwath

Figure 1 for Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Mode
Figure 2 for Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Mode
Figure 3 for Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Mode
Figure 4 for Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Mode
Viaarxiv icon