Picture for Yang Ai

Yang Ai

Refining Self-Supervised Learnt Speech Representation using Brain Activations

Add code
Jun 12, 2024
Viaarxiv icon

BiVocoder: A Bidirectional Neural Vocoder Integrating Feature Extraction and Waveform Generation

Add code
Jun 04, 2024
Figure 1 for BiVocoder: A Bidirectional Neural Vocoder Integrating Feature Extraction and Waveform Generation
Figure 2 for BiVocoder: A Bidirectional Neural Vocoder Integrating Feature Extraction and Waveform Generation
Figure 3 for BiVocoder: A Bidirectional Neural Vocoder Integrating Feature Extraction and Waveform Generation
Figure 4 for BiVocoder: A Bidirectional Neural Vocoder Integrating Feature Extraction and Waveform Generation
Viaarxiv icon

Multi-Stage Speech Bandwidth Extension with Flexible Sampling Rate Control

Add code
Jun 04, 2024
Figure 1 for Multi-Stage Speech Bandwidth Extension with Flexible Sampling Rate Control
Figure 2 for Multi-Stage Speech Bandwidth Extension with Flexible Sampling Rate Control
Figure 3 for Multi-Stage Speech Bandwidth Extension with Flexible Sampling Rate Control
Figure 4 for Multi-Stage Speech Bandwidth Extension with Flexible Sampling Rate Control
Viaarxiv icon

Voice Attribute Editing with Text Prompt

Add code
Apr 13, 2024
Figure 1 for Voice Attribute Editing with Text Prompt
Figure 2 for Voice Attribute Editing with Text Prompt
Figure 3 for Voice Attribute Editing with Text Prompt
Figure 4 for Voice Attribute Editing with Text Prompt
Viaarxiv icon

Low-Latency Neural Speech Phase Prediction based on Parallel Estimation Architecture and Anti-Wrapping Losses for Speech Generation Tasks

Add code
Mar 26, 2024
Figure 1 for Low-Latency Neural Speech Phase Prediction based on Parallel Estimation Architecture and Anti-Wrapping Losses for Speech Generation Tasks
Figure 2 for Low-Latency Neural Speech Phase Prediction based on Parallel Estimation Architecture and Anti-Wrapping Losses for Speech Generation Tasks
Figure 3 for Low-Latency Neural Speech Phase Prediction based on Parallel Estimation Architecture and Anti-Wrapping Losses for Speech Generation Tasks
Figure 4 for Low-Latency Neural Speech Phase Prediction based on Parallel Estimation Architecture and Anti-Wrapping Losses for Speech Generation Tasks
Viaarxiv icon

APCodec: A Neural Audio Codec with Parallel Amplitude and Phase Spectrum Encoding and Decoding

Add code
Feb 16, 2024
Figure 1 for APCodec: A Neural Audio Codec with Parallel Amplitude and Phase Spectrum Encoding and Decoding
Figure 2 for APCodec: A Neural Audio Codec with Parallel Amplitude and Phase Spectrum Encoding and Decoding
Figure 3 for APCodec: A Neural Audio Codec with Parallel Amplitude and Phase Spectrum Encoding and Decoding
Figure 4 for APCodec: A Neural Audio Codec with Parallel Amplitude and Phase Spectrum Encoding and Decoding
Viaarxiv icon

Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction

Add code
Jan 12, 2024
Viaarxiv icon

A Dynamic Network for Efficient Point Cloud Registration

Add code
Dec 05, 2023
Figure 1 for A Dynamic Network for Efficient Point Cloud Registration
Figure 2 for A Dynamic Network for Efficient Point Cloud Registration
Figure 3 for A Dynamic Network for Efficient Point Cloud Registration
Figure 4 for A Dynamic Network for Efficient Point Cloud Registration
Viaarxiv icon

APNet2: High-quality and High-efficiency Neural Vocoder with Direct Prediction of Amplitude and Phase Spectra

Add code
Nov 20, 2023
Figure 1 for APNet2: High-quality and High-efficiency Neural Vocoder with Direct Prediction of Amplitude and Phase Spectra
Figure 2 for APNet2: High-quality and High-efficiency Neural Vocoder with Direct Prediction of Amplitude and Phase Spectra
Figure 3 for APNet2: High-quality and High-efficiency Neural Vocoder with Direct Prediction of Amplitude and Phase Spectra
Figure 4 for APNet2: High-quality and High-efficiency Neural Vocoder with Direct Prediction of Amplitude and Phase Spectra
Viaarxiv icon

Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement

Add code
Sep 19, 2023
Figure 1 for Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement
Figure 2 for Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement
Figure 3 for Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement
Figure 4 for Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement
Viaarxiv icon