Picture for Kazi Tamanna Alam

Kazi Tamanna Alam

WhisQ: Cross-Modal Representation Learning for Text-to-Music MOS Prediction

Add code
Jun 06, 2025
Viaarxiv icon

Whisper Speaker Identification: Leveraging Pre-Trained Multilingual Transformers for Robust Speaker Embeddings

Add code
Mar 13, 2025
Viaarxiv icon