Alert button
Picture for Paris Smaragdis

Paris Smaragdis

Alert button

Scaling Up Adaptive Filter Optimizers

Mar 01, 2024
Jonah Casebeer, Nicholas J. Bryan, Paris Smaragdis

Figure 1 for Scaling Up Adaptive Filter Optimizers
Figure 2 for Scaling Up Adaptive Filter Optimizers
Figure 3 for Scaling Up Adaptive Filter Optimizers
Viaarxiv icon

Sound Source Separation Using Latent Variational Block-Wise Disentanglement

Feb 08, 2024
Karim Helwani, Masahito Togami, Paris Smaragdis, Michael M. Goodwin

Viaarxiv icon

Meta-AF Echo Cancellation for Improved Keyword Spotting

Dec 17, 2023
Jonah Casebeer, Junkai Wu, Paris Smaragdis

Viaarxiv icon

Audio Editing with Non-Rigid Text Prompts

Oct 19, 2023
Francesco Paissan, Zhepei Wang, Mirco Ravanelli, Paris Smaragdis, Cem Subakan

Viaarxiv icon

Mechatronic Generation of Datasets for Acoustics Research

Oct 01, 2023
Austin Lu, Ethaniel Moore, Arya Nallanthighall, Kanad Sarkar, Manan Mittal, Ryan M. Corey, Paris Smaragdis, Andrew Singer

Viaarxiv icon

Noise-Robust DSP-Assisted Neural Pitch Estimation with Very Low Complexity

Sep 25, 2023
Krishna Subramani, Jean-Marc Valin, Jan Buethe, Paris Smaragdis, Mike Goodwin

Viaarxiv icon

Complete and separate: Conditional separation with missing target source attribute completion

Jul 27, 2023
Dimitrios Bralios, Efthymios Tzinis, Paris Smaragdis

Figure 1 for Complete and separate: Conditional separation with missing target source attribute completion
Figure 2 for Complete and separate: Conditional separation with missing target source attribute completion
Figure 3 for Complete and separate: Conditional separation with missing target source attribute completion
Figure 4 for Complete and separate: Conditional separation with missing target source attribute completion
Viaarxiv icon

Unsupervised Improvement of Audio-Text Cross-Modal Representations

May 05, 2023
Zhepei Wang, Cem Subakan, Krishna Subramani, Junkai Wu, Tiago Tavares, Fabio Ayres, Paris Smaragdis

Figure 1 for Unsupervised Improvement of Audio-Text Cross-Modal Representations
Figure 2 for Unsupervised Improvement of Audio-Text Cross-Modal Representations
Figure 3 for Unsupervised Improvement of Audio-Text Cross-Modal Representations
Figure 4 for Unsupervised Improvement of Audio-Text Cross-Modal Representations
Viaarxiv icon

A Framework for Unified Real-time Personalized and Non-Personalized Speech Enhancement

Feb 23, 2023
Zhepei Wang, Ritwik Giri, Devansh Shah, Jean-Marc Valin, Michael M. Goodwin, Paris Smaragdis

Figure 1 for A Framework for Unified Real-time Personalized and Non-Personalized Speech Enhancement
Figure 2 for A Framework for Unified Real-time Personalized and Non-Personalized Speech Enhancement
Figure 3 for A Framework for Unified Real-time Personalized and Non-Personalized Speech Enhancement
Figure 4 for A Framework for Unified Real-time Personalized and Non-Personalized Speech Enhancement
Viaarxiv icon

Framewise WaveGAN: High Speed Adversarial Vocoder in Time Domain with Very Low Computational Complexity

Dec 08, 2022
Ahmed Mustafa, Jean-Marc Valin, Jan Büthe, Paris Smaragdis, Mike Goodwin

Figure 1 for Framewise WaveGAN: High Speed Adversarial Vocoder in Time Domain with Very Low Computational Complexity
Figure 2 for Framewise WaveGAN: High Speed Adversarial Vocoder in Time Domain with Very Low Computational Complexity
Figure 3 for Framewise WaveGAN: High Speed Adversarial Vocoder in Time Domain with Very Low Computational Complexity
Viaarxiv icon