Picture for Amirbek Djanibekov

Amirbek Djanibekov

SparQLe: Speech Queries to Text Translation Through LLMs

Add code
Feb 13, 2025
Figure 1 for SparQLe: Speech Queries to Text Translation Through LLMs
Figure 2 for SparQLe: Speech Queries to Text Translation Through LLMs
Figure 3 for SparQLe: Speech Queries to Text Translation Through LLMs
Figure 4 for SparQLe: Speech Queries to Text Translation Through LLMs
Viaarxiv icon

Music for All: Exploring Multicultural Representations in Music Generation Models

Add code
Feb 12, 2025
Figure 1 for Music for All: Exploring Multicultural Representations in Music Generation Models
Figure 2 for Music for All: Exploring Multicultural Representations in Music Generation Models
Figure 3 for Music for All: Exploring Multicultural Representations in Music Generation Models
Figure 4 for Music for All: Exploring Multicultural Representations in Music Generation Models
Viaarxiv icon

All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages

Add code
Nov 25, 2024
Figure 1 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 2 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 3 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 4 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Viaarxiv icon

Dialectal Coverage And Generalization in Arabic Speech Recognition

Add code
Nov 07, 2024
Figure 1 for Dialectal Coverage And Generalization in Arabic Speech Recognition
Figure 2 for Dialectal Coverage And Generalization in Arabic Speech Recognition
Figure 3 for Dialectal Coverage And Generalization in Arabic Speech Recognition
Figure 4 for Dialectal Coverage And Generalization in Arabic Speech Recognition
Viaarxiv icon

SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages

Add code
Jun 14, 2024
Figure 1 for SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
Figure 2 for SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
Figure 3 for SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
Figure 4 for SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
Viaarxiv icon

ArTST: Arabic Text and Speech Transformer

Add code
Oct 25, 2023
Viaarxiv icon