Picture for Abdelrahman Mohamed

Abdelrahman Mohamed

SpeechDPR: End-to-End Spoken Passage Retrieval for Open-Domain Spoken Question Answering

Add code
Jan 24, 2024
Figure 1 for SpeechDPR: End-to-End Spoken Passage Retrieval for Open-Domain Spoken Question Answering
Figure 2 for SpeechDPR: End-to-End Spoken Passage Retrieval for Open-Domain Spoken Question Answering
Figure 3 for SpeechDPR: End-to-End Spoken Passage Retrieval for Open-Domain Spoken Question Answering
Viaarxiv icon

Violet: A Vision-Language Model for Arabic Image Captioning with Gemini Decoder

Add code
Nov 15, 2023
Figure 1 for Violet: A Vision-Language Model for Arabic Image Captioning with Gemini Decoder
Figure 2 for Violet: A Vision-Language Model for Arabic Image Captioning with Gemini Decoder
Figure 3 for Violet: A Vision-Language Model for Arabic Image Captioning with Gemini Decoder
Figure 4 for Violet: A Vision-Language Model for Arabic Image Captioning with Gemini Decoder
Viaarxiv icon

SD-HuBERT: Self-Distillation Induces Syllabic Organization in HuBERT

Add code
Oct 16, 2023
Figure 1 for SD-HuBERT: Self-Distillation Induces Syllabic Organization in HuBERT
Figure 2 for SD-HuBERT: Self-Distillation Induces Syllabic Organization in HuBERT
Figure 3 for SD-HuBERT: Self-Distillation Induces Syllabic Organization in HuBERT
Figure 4 for SD-HuBERT: Self-Distillation Induces Syllabic Organization in HuBERT
Viaarxiv icon

Self-Supervised Models of Speech Infer Universal Articulatory Kinematics

Add code
Oct 16, 2023
Figure 1 for Self-Supervised Models of Speech Infer Universal Articulatory Kinematics
Figure 2 for Self-Supervised Models of Speech Infer Universal Articulatory Kinematics
Figure 3 for Self-Supervised Models of Speech Infer Universal Articulatory Kinematics
Figure 4 for Self-Supervised Models of Speech Infer Universal Articulatory Kinematics
Viaarxiv icon

Findings of the 2023 ML-SUPERB Challenge: Pre-Training and Evaluation over More Languages and Beyond

Add code
Oct 09, 2023
Figure 1 for Findings of the 2023 ML-SUPERB Challenge: Pre-Training and Evaluation over More Languages and Beyond
Figure 2 for Findings of the 2023 ML-SUPERB Challenge: Pre-Training and Evaluation over More Languages and Beyond
Figure 3 for Findings of the 2023 ML-SUPERB Challenge: Pre-Training and Evaluation over More Languages and Beyond
Figure 4 for Findings of the 2023 ML-SUPERB Challenge: Pre-Training and Evaluation over More Languages and Beyond
Viaarxiv icon

Low-Resource Self-Supervised Learning with SSL-Enhanced TTS

Add code
Sep 29, 2023
Figure 1 for Low-Resource Self-Supervised Learning with SSL-Enhanced TTS
Figure 2 for Low-Resource Self-Supervised Learning with SSL-Enhanced TTS
Figure 3 for Low-Resource Self-Supervised Learning with SSL-Enhanced TTS
Figure 4 for Low-Resource Self-Supervised Learning with SSL-Enhanced TTS
Viaarxiv icon

AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models

Add code
Sep 19, 2023
Figure 1 for AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models
Figure 2 for AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models
Figure 3 for AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models
Viaarxiv icon

Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Mode

Add code
May 19, 2023
Figure 1 for Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Mode
Figure 2 for Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Mode
Figure 3 for Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Mode
Figure 4 for Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Mode
Viaarxiv icon

ML-SUPERB: Multilingual Speech Universal PERformance Benchmark

Add code
May 18, 2023
Figure 1 for ML-SUPERB: Multilingual Speech Universal PERformance Benchmark
Figure 2 for ML-SUPERB: Multilingual Speech Universal PERformance Benchmark
Viaarxiv icon

Efficient Speech Representation Learning with Low-Bit Quantization

Add code
Dec 14, 2022
Figure 1 for Efficient Speech Representation Learning with Low-Bit Quantization
Figure 2 for Efficient Speech Representation Learning with Low-Bit Quantization
Figure 3 for Efficient Speech Representation Learning with Low-Bit Quantization
Viaarxiv icon