Picture for Swapnil Bhosale

Swapnil Bhosale

AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis

Add code
Jun 14, 2024
Figure 1 for AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis
Figure 2 for AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis
Figure 3 for AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis
Figure 4 for AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis
Viaarxiv icon

Unsupervised Audio-Visual Segmentation with Modality Alignment

Add code
Mar 21, 2024
Figure 1 for Unsupervised Audio-Visual Segmentation with Modality Alignment
Figure 2 for Unsupervised Audio-Visual Segmentation with Modality Alignment
Figure 3 for Unsupervised Audio-Visual Segmentation with Modality Alignment
Figure 4 for Unsupervised Audio-Visual Segmentation with Modality Alignment
Viaarxiv icon

Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection

Add code
Sep 29, 2023
Figure 1 for Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection
Figure 2 for Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection
Figure 3 for Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection
Figure 4 for Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection
Viaarxiv icon

Leveraging Foundation models for Unsupervised Audio-Visual Segmentation

Add code
Sep 13, 2023
Figure 1 for Leveraging Foundation models for Unsupervised Audio-Visual Segmentation
Figure 2 for Leveraging Foundation models for Unsupervised Audio-Visual Segmentation
Figure 3 for Leveraging Foundation models for Unsupervised Audio-Visual Segmentation
Figure 4 for Leveraging Foundation models for Unsupervised Audio-Visual Segmentation
Viaarxiv icon

DiffSED: Sound Event Detection with Denoising Diffusion

Add code
Aug 16, 2023
Figure 1 for DiffSED: Sound Event Detection with Denoising Diffusion
Figure 2 for DiffSED: Sound Event Detection with Denoising Diffusion
Figure 3 for DiffSED: Sound Event Detection with Denoising Diffusion
Figure 4 for DiffSED: Sound Event Detection with Denoising Diffusion
Viaarxiv icon

Text-to-Audio Grounding Based Novel Metric for Evaluating Audio Caption Similarity

Add code
Oct 03, 2022
Figure 1 for Text-to-Audio Grounding Based Novel Metric for Evaluating Audio Caption Similarity
Figure 2 for Text-to-Audio Grounding Based Novel Metric for Evaluating Audio Caption Similarity
Figure 3 for Text-to-Audio Grounding Based Novel Metric for Evaluating Audio Caption Similarity
Figure 4 for Text-to-Audio Grounding Based Novel Metric for Evaluating Audio Caption Similarity
Viaarxiv icon

Automatic Audio Captioning using Attention weighted Event based Embeddings

Add code
Jan 28, 2022
Figure 1 for Automatic Audio Captioning using Attention weighted Event based Embeddings
Figure 2 for Automatic Audio Captioning using Attention weighted Event based Embeddings
Figure 3 for Automatic Audio Captioning using Attention weighted Event based Embeddings
Figure 4 for Automatic Audio Captioning using Attention weighted Event based Embeddings
Viaarxiv icon

Automatic Speaker Independent Dysarthric Speech Intelligibility Assessment System

Add code
Mar 10, 2021
Figure 1 for Automatic Speaker Independent Dysarthric Speech Intelligibility Assessment System
Figure 2 for Automatic Speaker Independent Dysarthric Speech Intelligibility Assessment System
Figure 3 for Automatic Speaker Independent Dysarthric Speech Intelligibility Assessment System
Figure 4 for Automatic Speaker Independent Dysarthric Speech Intelligibility Assessment System
Viaarxiv icon

Semi Supervised Learning For Few-shot Audio Classification By Episodic Triplet Mining

Add code
Feb 16, 2021
Figure 1 for Semi Supervised Learning For Few-shot Audio Classification By Episodic Triplet Mining
Figure 2 for Semi Supervised Learning For Few-shot Audio Classification By Episodic Triplet Mining
Figure 3 for Semi Supervised Learning For Few-shot Audio Classification By Episodic Triplet Mining
Figure 4 for Semi Supervised Learning For Few-shot Audio Classification By Episodic Triplet Mining
Viaarxiv icon