Alert button
Picture for Yang Ai

Yang Ai

Alert button

Voice Attribute Editing with Text Prompt

Add code
Bookmark button
Alert button
Apr 13, 2024
Zhengyan Sheng, Yang Ai, Li-Juan Liu, Jia Pan, Zhen-Hua Ling

Viaarxiv icon

Low-Latency Neural Speech Phase Prediction based on Parallel Estimation Architecture and Anti-Wrapping Losses for Speech Generation Tasks

Add code
Bookmark button
Alert button
Mar 26, 2024
Yang Ai, Zhen-Hua Ling

Viaarxiv icon

APCodec: A Neural Audio Codec with Parallel Amplitude and Phase Spectrum Encoding and Decoding

Add code
Bookmark button
Alert button
Feb 16, 2024
Yang Ai, Xiao-Hang Jiang, Ye-Xin Lu, Hui-Peng Du, Zhen-Hua Ling

Viaarxiv icon

Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction

Add code
Bookmark button
Alert button
Jan 12, 2024
Ye-Xin Lu, Yang Ai, Hui-Peng Du, Zhen-Hua Ling

Viaarxiv icon

A Dynamic Network for Efficient Point Cloud Registration

Add code
Bookmark button
Alert button
Dec 05, 2023
Yang Ai, Xi Yang

Figure 1 for A Dynamic Network for Efficient Point Cloud Registration
Figure 2 for A Dynamic Network for Efficient Point Cloud Registration
Figure 3 for A Dynamic Network for Efficient Point Cloud Registration
Figure 4 for A Dynamic Network for Efficient Point Cloud Registration
Viaarxiv icon

APNet2: High-quality and High-efficiency Neural Vocoder with Direct Prediction of Amplitude and Phase Spectra

Add code
Bookmark button
Alert button
Nov 20, 2023
Hui-Peng Du, Ye-Xin Lu, Yang Ai, Zhen-Hua Ling

Viaarxiv icon

Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement

Add code
Bookmark button
Alert button
Sep 19, 2023
Rui-Chen Zheng, Yang Ai, Zhen-Hua Ling

Figure 1 for Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement
Figure 2 for Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement
Figure 3 for Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement
Figure 4 for Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement
Viaarxiv icon

Face-Driven Zero-Shot Voice Conversion with Memory-based Face-Voice Alignment

Add code
Bookmark button
Alert button
Sep 18, 2023
Zheng-Yan Sheng, Yang Ai, Yan-Nian Chen, Zhen-Hua Ling

Figure 1 for Face-Driven Zero-Shot Voice Conversion with Memory-based Face-Voice Alignment
Figure 2 for Face-Driven Zero-Shot Voice Conversion with Memory-based Face-Voice Alignment
Figure 3 for Face-Driven Zero-Shot Voice Conversion with Memory-based Face-Voice Alignment
Figure 4 for Face-Driven Zero-Shot Voice Conversion with Memory-based Face-Voice Alignment
Viaarxiv icon

Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement

Add code
Bookmark button
Alert button
Aug 17, 2023
Ye-Xin Lu, Yang Ai, Zhen-Hua Ling

Figure 1 for Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
Figure 2 for Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
Figure 3 for Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
Figure 4 for Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
Viaarxiv icon

Long-frame-shift Neural Speech Phase Prediction with Spectral Continuity Enhancement and Interpolation Error Compensation

Add code
Bookmark button
Alert button
Aug 17, 2023
Yang Ai, Ye-Xin Lu, Zhen-Hua Ling

Viaarxiv icon