Picture for Hanan Aldarmaki

Hanan Aldarmaki

ArVoice: A Multi-Speaker Dataset for Arabic Speech Synthesis

Add code
May 26, 2025
Viaarxiv icon

Voice of a Continent: Mapping Africa's Speech Technology Frontier

Add code
May 24, 2025
Viaarxiv icon

JEEM: Vision-Language Understanding in Four Arabic Dialects

Add code
Mar 27, 2025
Viaarxiv icon

Infant Cry Detection Using Causal Temporal Representation

Add code
Mar 08, 2025
Viaarxiv icon

SparQLe: Speech Queries to Text Translation Through LLMs

Add code
Feb 13, 2025
Viaarxiv icon

Dialectal Coverage And Generalization in Arabic Speech Recognition

Add code
Nov 07, 2024
Viaarxiv icon

STTATTS: Unified Speech-To-Text And Text-To-Speech Model

Add code
Oct 24, 2024
Viaarxiv icon

RelUNet: Relative Channel Fusion U-Net for Multichannel Speech Enhancement

Add code
Oct 07, 2024
Figure 1 for RelUNet: Relative Channel Fusion U-Net for Multichannel Speech Enhancement
Figure 2 for RelUNet: Relative Channel Fusion U-Net for Multichannel Speech Enhancement
Figure 3 for RelUNet: Relative Channel Fusion U-Net for Multichannel Speech Enhancement
Figure 4 for RelUNet: Relative Channel Fusion U-Net for Multichannel Speech Enhancement
Viaarxiv icon

PALM: Few-Shot Prompt Learning for Audio Language Models

Add code
Sep 29, 2024
Viaarxiv icon

Mixat: A Data Set of Bilingual Emirati-English Speech

Add code
May 04, 2024
Viaarxiv icon