Picture for Wenwu Wang

Wenwu Wang

Regime Learning for Differentiable Particle Filters

Add code
May 08, 2024
Figure 1 for Regime Learning for Differentiable Particle Filters
Figure 2 for Regime Learning for Differentiable Particle Filters
Figure 3 for Regime Learning for Differentiable Particle Filters
Figure 4 for Regime Learning for Differentiable Particle Filters
Viaarxiv icon

ComposerX: Multi-Agent Symbolic Music Composition with LLMs

Add code
Apr 30, 2024
Figure 1 for ComposerX: Multi-Agent Symbolic Music Composition with LLMs
Figure 2 for ComposerX: Multi-Agent Symbolic Music Composition with LLMs
Figure 3 for ComposerX: Multi-Agent Symbolic Music Composition with LLMs
Figure 4 for ComposerX: Multi-Agent Symbolic Music Composition with LLMs
Viaarxiv icon

SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound

Add code
Apr 30, 2024
Viaarxiv icon

T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining

Add code
Apr 27, 2024
Figure 1 for T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining
Figure 2 for T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining
Figure 3 for T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining
Figure 4 for T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining
Viaarxiv icon

WavCraft: Audio Editing and Generation with Natural Language Prompts

Add code
Mar 15, 2024
Figure 1 for WavCraft: Audio Editing and Generation with Natural Language Prompts
Figure 2 for WavCraft: Audio Editing and Generation with Natural Language Prompts
Figure 3 for WavCraft: Audio Editing and Generation with Natural Language Prompts
Figure 4 for WavCraft: Audio Editing and Generation with Natural Language Prompts
Viaarxiv icon

Multi-level graph learning for audio event classification and human-perceived annoyance rating prediction

Add code
Dec 15, 2023
Figure 1 for Multi-level graph learning for audio event classification and human-perceived annoyance rating prediction
Figure 2 for Multi-level graph learning for audio event classification and human-perceived annoyance rating prediction
Figure 3 for Multi-level graph learning for audio event classification and human-perceived annoyance rating prediction
Figure 4 for Multi-level graph learning for audio event classification and human-perceived annoyance rating prediction
Viaarxiv icon

Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection

Add code
Dec 14, 2023
Viaarxiv icon

Acoustic Prompt Tuning: Empowering Large Language Models with Audition Capabilities

Add code
Nov 30, 2023
Viaarxiv icon

Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions

Add code
Oct 23, 2023
Figure 1 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Figure 2 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Figure 3 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Figure 4 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Viaarxiv icon

First-Shot Unsupervised Anomalous Sound Detection With Unknown Anomalies Estimated by Metadata-Assisted Audio Generation

Add code
Oct 22, 2023
Figure 1 for First-Shot Unsupervised Anomalous Sound Detection With Unknown Anomalies Estimated by Metadata-Assisted Audio Generation
Figure 2 for First-Shot Unsupervised Anomalous Sound Detection With Unknown Anomalies Estimated by Metadata-Assisted Audio Generation
Figure 3 for First-Shot Unsupervised Anomalous Sound Detection With Unknown Anomalies Estimated by Metadata-Assisted Audio Generation
Figure 4 for First-Shot Unsupervised Anomalous Sound Detection With Unknown Anomalies Estimated by Metadata-Assisted Audio Generation
Viaarxiv icon