Picture for Yin Cao

Yin Cao

SALM: Spatial Audio Language Model with Structured Embeddings for Understanding and Editing

Add code
Jul 22, 2025
Viaarxiv icon

PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection

Add code
Nov 10, 2024
Figure 1 for PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection
Figure 2 for PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection
Figure 3 for PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection
Figure 4 for PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection
Viaarxiv icon

Text-Queried Target Sound Event Localization

Add code
Jun 23, 2024
Figure 1 for Text-Queried Target Sound Event Localization
Figure 2 for Text-Queried Target Sound Event Localization
Figure 3 for Text-Queried Target Sound Event Localization
Figure 4 for Text-Queried Target Sound Event Localization
Viaarxiv icon

Towards Out-of-Distribution Detection in Vocoder Recognition via Latent Feature Reconstruction

Add code
Jun 04, 2024
Viaarxiv icon

WavCraft: Audio Editing and Generation with Natural Language Prompts

Add code
Mar 15, 2024
Figure 1 for WavCraft: Audio Editing and Generation with Natural Language Prompts
Figure 2 for WavCraft: Audio Editing and Generation with Natural Language Prompts
Figure 3 for WavCraft: Audio Editing and Generation with Natural Language Prompts
Figure 4 for WavCraft: Audio Editing and Generation with Natural Language Prompts
Viaarxiv icon

EDTC: enhance depth of text comprehension in automated audio captioning

Add code
Feb 27, 2024
Viaarxiv icon

Selective-Memory Meta-Learning with Environment Representations for Sound Event Localization and Detection

Add code
Dec 27, 2023
Viaarxiv icon

Balanced SNR-Aware Distillation for Guided Text-to-Audio Generation

Add code
Dec 25, 2023
Viaarxiv icon

META-SELD: Meta-Learning for Fast Adaptation to the new environment in Sound Event Localization and Detection

Add code
Aug 17, 2023
Figure 1 for META-SELD: Meta-Learning for Fast Adaptation to the new environment in Sound Event Localization and Detection
Figure 2 for META-SELD: Meta-Learning for Fast Adaptation to the new environment in Sound Event Localization and Detection
Figure 3 for META-SELD: Meta-Learning for Fast Adaptation to the new environment in Sound Event Localization and Detection
Viaarxiv icon

WavJourney: Compositional Audio Creation with Large Language Models

Add code
Jul 26, 2023
Viaarxiv icon