Picture for Bryan Pardo

Bryan Pardo

Do Joint Language-Audio Embeddings Encode Perceptual Timbre Semantics?

Add code
Oct 16, 2025
Viaarxiv icon

The Rhythm In Anything: Audio-Prompted Drums Generation with Masked Language Modeling

Add code
Sep 19, 2025
Figure 1 for The Rhythm In Anything: Audio-Prompted Drums Generation with Masked Language Modeling
Figure 2 for The Rhythm In Anything: Audio-Prompted Drums Generation with Masked Language Modeling
Figure 3 for The Rhythm In Anything: Audio-Prompted Drums Generation with Masked Language Modeling
Figure 4 for The Rhythm In Anything: Audio-Prompted Drums Generation with Masked Language Modeling
Viaarxiv icon

Deep Audio Watermarks are Shallow: Limitations of Post-Hoc Watermarking Techniques for Speech

Add code
Apr 15, 2025
Viaarxiv icon

HARP 2.0: Expanding Hosted, Asynchronous, Remote Processing for Deep Learning in the DAW

Add code
Mar 04, 2025
Figure 1 for HARP 2.0: Expanding Hosted, Asynchronous, Remote Processing for Deep Learning in the DAW
Figure 2 for HARP 2.0: Expanding Hosted, Asynchronous, Remote Processing for Deep Learning in the DAW
Viaarxiv icon

Sketch2Sound: Controllable Audio Generation via Time-Varying Signals and Sonic Imitations

Add code
Dec 11, 2024
Figure 1 for Sketch2Sound: Controllable Audio Generation via Time-Varying Signals and Sonic Imitations
Figure 2 for Sketch2Sound: Controllable Audio Generation via Time-Varying Signals and Sonic Imitations
Figure 3 for Sketch2Sound: Controllable Audio Generation via Time-Varying Signals and Sonic Imitations
Viaarxiv icon

Code Drift: Towards Idempotent Neural Audio Codecs

Add code
Oct 14, 2024
Figure 1 for Code Drift: Towards Idempotent Neural Audio Codecs
Figure 2 for Code Drift: Towards Idempotent Neural Audio Codecs
Figure 3 for Code Drift: Towards Idempotent Neural Audio Codecs
Figure 4 for Code Drift: Towards Idempotent Neural Audio Codecs
Viaarxiv icon

Text2FX: Harnessing CLAP Embeddings for Text-Guided Audio Effects

Add code
Sep 27, 2024
Figure 1 for Text2FX: Harnessing CLAP Embeddings for Text-Guided Audio Effects
Figure 2 for Text2FX: Harnessing CLAP Embeddings for Text-Guided Audio Effects
Figure 3 for Text2FX: Harnessing CLAP Embeddings for Text-Guided Audio Effects
Figure 4 for Text2FX: Harnessing CLAP Embeddings for Text-Guided Audio Effects
Viaarxiv icon

Fine-Grained and Interpretable Neural Speech Editing

Add code
Jul 07, 2024
Figure 1 for Fine-Grained and Interpretable Neural Speech Editing
Figure 2 for Fine-Grained and Interpretable Neural Speech Editing
Viaarxiv icon

High-Fidelity Neural Phonetic Posteriorgrams

Add code
Feb 27, 2024
Figure 1 for High-Fidelity Neural Phonetic Posteriorgrams
Figure 2 for High-Fidelity Neural Phonetic Posteriorgrams
Figure 3 for High-Fidelity Neural Phonetic Posteriorgrams
Figure 4 for High-Fidelity Neural Phonetic Posteriorgrams
Viaarxiv icon

Exploring Musical Roots: Applying Audio Embeddings to Empower Influence Attribution for a Generative Music Model

Add code
Jan 25, 2024
Figure 1 for Exploring Musical Roots: Applying Audio Embeddings to Empower Influence Attribution for a Generative Music Model
Figure 2 for Exploring Musical Roots: Applying Audio Embeddings to Empower Influence Attribution for a Generative Music Model
Figure 3 for Exploring Musical Roots: Applying Audio Embeddings to Empower Influence Attribution for a Generative Music Model
Figure 4 for Exploring Musical Roots: Applying Audio Embeddings to Empower Influence Attribution for a Generative Music Model
Viaarxiv icon