Picture for Hardik B. Sailor

Hardik B. Sailor

Interpolating Speaker Identities in Embedding Space for Data Expansion

Add code
Aug 26, 2025
Viaarxiv icon

Incorporating Contextual Paralinguistic Understanding in Large Speech-Language Models

Add code
Aug 10, 2025
Viaarxiv icon

Beyond Classification: Towards Speech Emotion Reasoning with Multitask AudioLLMs

Add code
Jun 07, 2025
Viaarxiv icon

Contextual Paralinguistic Data Creation for Multi-Modal Speech-LLM: Data Condensation and Spoken QA Generation

Add code
May 19, 2025
Viaarxiv icon

MERaLiON-SpeechEncoder: Towards a Speech Foundation Model for Singapore and Beyond

Add code
Dec 20, 2024
Figure 1 for MERaLiON-SpeechEncoder: Towards a Speech Foundation Model for Singapore and Beyond
Figure 2 for MERaLiON-SpeechEncoder: Towards a Speech Foundation Model for Singapore and Beyond
Figure 3 for MERaLiON-SpeechEncoder: Towards a Speech Foundation Model for Singapore and Beyond
Figure 4 for MERaLiON-SpeechEncoder: Towards a Speech Foundation Model for Singapore and Beyond
Viaarxiv icon

Towards a Speech Foundation Model for Singapore and Beyond

Add code
Dec 16, 2024
Figure 1 for Towards a Speech Foundation Model for Singapore and Beyond
Figure 2 for Towards a Speech Foundation Model for Singapore and Beyond
Figure 3 for Towards a Speech Foundation Model for Singapore and Beyond
Figure 4 for Towards a Speech Foundation Model for Singapore and Beyond
Viaarxiv icon

Towards Quantifying and Reducing Language Mismatch Effects in Cross-Lingual Speech Anti-Spoofing

Add code
Sep 12, 2024
Figure 1 for Towards Quantifying and Reducing Language Mismatch Effects in Cross-Lingual Speech Anti-Spoofing
Figure 2 for Towards Quantifying and Reducing Language Mismatch Effects in Cross-Lingual Speech Anti-Spoofing
Figure 3 for Towards Quantifying and Reducing Language Mismatch Effects in Cross-Lingual Speech Anti-Spoofing
Figure 4 for Towards Quantifying and Reducing Language Mismatch Effects in Cross-Lingual Speech Anti-Spoofing
Viaarxiv icon

Speech Foundation Model Ensembles for the Controlled Singing Voice Deepfake Detection (CtrSVDD) Challenge 2024

Add code
Sep 03, 2024
Figure 1 for Speech Foundation Model Ensembles for the Controlled Singing Voice Deepfake Detection (CtrSVDD) Challenge 2024
Figure 2 for Speech Foundation Model Ensembles for the Controlled Singing Voice Deepfake Detection (CtrSVDD) Challenge 2024
Figure 3 for Speech Foundation Model Ensembles for the Controlled Singing Voice Deepfake Detection (CtrSVDD) Challenge 2024
Figure 4 for Speech Foundation Model Ensembles for the Controlled Singing Voice Deepfake Detection (CtrSVDD) Challenge 2024
Viaarxiv icon

Attentive Merging of Hidden Embeddings from Pre-trained Speech Model for Anti-spoofing Detection

Add code
Jun 12, 2024
Figure 1 for Attentive Merging of Hidden Embeddings from Pre-trained Speech Model for Anti-spoofing Detection
Figure 2 for Attentive Merging of Hidden Embeddings from Pre-trained Speech Model for Anti-spoofing Detection
Figure 3 for Attentive Merging of Hidden Embeddings from Pre-trained Speech Model for Anti-spoofing Detection
Figure 4 for Attentive Merging of Hidden Embeddings from Pre-trained Speech Model for Anti-spoofing Detection
Viaarxiv icon