Alert button
Picture for Soham Deshmukh

Soham Deshmukh

Alert button

Microsoft

Domain Adaptation for Contrastive Audio-Language Models

Add code
Bookmark button
Alert button
Feb 14, 2024
Soham Deshmukh, Rita Singh, Bhiksha Raj

Viaarxiv icon

PAM: Prompting Audio-Language Models for Audio Quality Assessment

Add code
Bookmark button
Alert button
Feb 01, 2024
Soham Deshmukh, Dareen Alharthi, Benjamin Elizalde, Hannes Gamper, Mahmoud Al Ismail, Rita Singh, Bhiksha Raj, Huaming Wang

Viaarxiv icon

Prompting Audios Using Acoustic Properties For Emotion Representation

Add code
Bookmark button
Alert button
Oct 05, 2023
Hira Dhamyal, Benjamin Elizalde, Soham Deshmukh, Huaming Wang, Bhiksha Raj, Rita Singh

Figure 1 for Prompting Audios Using Acoustic Properties For Emotion Representation
Figure 2 for Prompting Audios Using Acoustic Properties For Emotion Representation
Figure 3 for Prompting Audios Using Acoustic Properties For Emotion Representation
Figure 4 for Prompting Audios Using Acoustic Properties For Emotion Representation
Viaarxiv icon

LoFT: Local Proxy Fine-tuning For Improving Transferability Of Adversarial Attacks Against Large Language Model

Add code
Bookmark button
Alert button
Oct 02, 2023
Muhammad Ahmed Shah, Roshan Sharma, Hira Dhamyal, Raphael Olivier, Ankit Shah, Dareen Alharthi, Hazim T Bukhari, Massa Baali, Soham Deshmukh, Michael Kuhlmann, Bhiksha Raj, Rita Singh

Figure 1 for LoFT: Local Proxy Fine-tuning For Improving Transferability Of Adversarial Attacks Against Large Language Model
Figure 2 for LoFT: Local Proxy Fine-tuning For Improving Transferability Of Adversarial Attacks Against Large Language Model
Figure 3 for LoFT: Local Proxy Fine-tuning For Improving Transferability Of Adversarial Attacks Against Large Language Model
Figure 4 for LoFT: Local Proxy Fine-tuning For Improving Transferability Of Adversarial Attacks Against Large Language Model
Viaarxiv icon

Training Audio Captioning Models without Audio

Add code
Bookmark button
Alert button
Sep 14, 2023
Soham Deshmukh, Benjamin Elizalde, Dimitra Emmanouilidou, Bhiksha Raj, Rita Singh, Huaming Wang

Figure 1 for Training Audio Captioning Models without Audio
Figure 2 for Training Audio Captioning Models without Audio
Figure 3 for Training Audio Captioning Models without Audio
Figure 4 for Training Audio Captioning Models without Audio
Viaarxiv icon

Natural Language Supervision for General-Purpose Audio Representations

Add code
Bookmark button
Alert button
Sep 11, 2023
Benjamin Elizalde, Soham Deshmukh, Huaming Wang

Viaarxiv icon

Pengi: An Audio Language Model for Audio Tasks

Add code
Bookmark button
Alert button
May 19, 2023
Soham Deshmukh, Benjamin Elizalde, Rita Singh, Huaming Wang

Figure 1 for Pengi: An Audio Language Model for Audio Tasks
Figure 2 for Pengi: An Audio Language Model for Audio Tasks
Figure 3 for Pengi: An Audio Language Model for Audio Tasks
Figure 4 for Pengi: An Audio Language Model for Audio Tasks
Viaarxiv icon

Synergy between human and machine approaches to sound/scene recognition and processing: An overview of ICASSP special session

Add code
Bookmark button
Alert button
Feb 24, 2023
Laurie M. Heller, Benjamin Elizalde, Bhiksha Raj, Soham Deshmukh

Viaarxiv icon

Describing emotions with acoustic property prompts for speech emotion recognition

Add code
Bookmark button
Alert button
Nov 14, 2022
Hira Dhamyal, Benjamin Elizalde, Soham Deshmukh, Huaming Wang, Bhiksha Raj, Rita Singh

Figure 1 for Describing emotions with acoustic property prompts for speech emotion recognition
Figure 2 for Describing emotions with acoustic property prompts for speech emotion recognition
Figure 3 for Describing emotions with acoustic property prompts for speech emotion recognition
Figure 4 for Describing emotions with acoustic property prompts for speech emotion recognition
Viaarxiv icon

Audio Retrieval with WavText5K and CLAP Training

Add code
Bookmark button
Alert button
Sep 28, 2022
Soham Deshmukh, Benjamin Elizalde, Huaming Wang

Figure 1 for Audio Retrieval with WavText5K and CLAP Training
Figure 2 for Audio Retrieval with WavText5K and CLAP Training
Figure 3 for Audio Retrieval with WavText5K and CLAP Training
Figure 4 for Audio Retrieval with WavText5K and CLAP Training
Viaarxiv icon