Alert button
Picture for Swapnil Bhosale

Swapnil Bhosale

Alert button

Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection

Sep 29, 2023
Swapnil Bhosale, Abhra Chaudhuri, Alex Lee Robert Williams, Divyank Tiwari, Anjan Dutta, Xiatian Zhu, Pushpak Bhattacharyya, Diptesh Kanojia

Figure 1 for Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection
Figure 2 for Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection
Figure 3 for Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection
Figure 4 for Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection
Viaarxiv icon

Leveraging Foundation models for Unsupervised Audio-Visual Segmentation

Sep 13, 2023
Swapnil Bhosale, Haosen Yang, Diptesh Kanojia, Xiatian Zhu

Figure 1 for Leveraging Foundation models for Unsupervised Audio-Visual Segmentation
Figure 2 for Leveraging Foundation models for Unsupervised Audio-Visual Segmentation
Figure 3 for Leveraging Foundation models for Unsupervised Audio-Visual Segmentation
Figure 4 for Leveraging Foundation models for Unsupervised Audio-Visual Segmentation
Viaarxiv icon

DiffSED: Sound Event Detection with Denoising Diffusion

Aug 16, 2023
Swapnil Bhosale, Sauradip Nag, Diptesh Kanojia, Jiankang Deng, Xiatian Zhu

Figure 1 for DiffSED: Sound Event Detection with Denoising Diffusion
Figure 2 for DiffSED: Sound Event Detection with Denoising Diffusion
Figure 3 for DiffSED: Sound Event Detection with Denoising Diffusion
Figure 4 for DiffSED: Sound Event Detection with Denoising Diffusion
Viaarxiv icon

Text-to-Audio Grounding Based Novel Metric for Evaluating Audio Caption Similarity

Oct 03, 2022
Swapnil Bhosale, Rupayan Chakraborty, Sunil Kumar Kopparapu

Figure 1 for Text-to-Audio Grounding Based Novel Metric for Evaluating Audio Caption Similarity
Figure 2 for Text-to-Audio Grounding Based Novel Metric for Evaluating Audio Caption Similarity
Figure 3 for Text-to-Audio Grounding Based Novel Metric for Evaluating Audio Caption Similarity
Figure 4 for Text-to-Audio Grounding Based Novel Metric for Evaluating Audio Caption Similarity
Viaarxiv icon

Automatic Audio Captioning using Attention weighted Event based Embeddings

Jan 28, 2022
Swapnil Bhosale, Rupayan Chakraborty, Sunil Kumar Kopparapu

Figure 1 for Automatic Audio Captioning using Attention weighted Event based Embeddings
Figure 2 for Automatic Audio Captioning using Attention weighted Event based Embeddings
Figure 3 for Automatic Audio Captioning using Attention weighted Event based Embeddings
Figure 4 for Automatic Audio Captioning using Attention weighted Event based Embeddings
Viaarxiv icon

Automatic Speaker Independent Dysarthric Speech Intelligibility Assessment System

Mar 10, 2021
Ayush Tripathi, Swapnil Bhosale, Sunil Kumar Kopparapu

Figure 1 for Automatic Speaker Independent Dysarthric Speech Intelligibility Assessment System
Figure 2 for Automatic Speaker Independent Dysarthric Speech Intelligibility Assessment System
Figure 3 for Automatic Speaker Independent Dysarthric Speech Intelligibility Assessment System
Figure 4 for Automatic Speaker Independent Dysarthric Speech Intelligibility Assessment System
Viaarxiv icon

Semi Supervised Learning For Few-shot Audio Classification By Episodic Triplet Mining

Feb 16, 2021
Swapnil Bhosale, Rupayan Chakraborty, Sunil Kumar Kopparapu

Figure 1 for Semi Supervised Learning For Few-shot Audio Classification By Episodic Triplet Mining
Figure 2 for Semi Supervised Learning For Few-shot Audio Classification By Episodic Triplet Mining
Figure 3 for Semi Supervised Learning For Few-shot Audio Classification By Episodic Triplet Mining
Figure 4 for Semi Supervised Learning For Few-shot Audio Classification By Episodic Triplet Mining
Viaarxiv icon