Alert button

"speech": models, code, and papers
Alert button

Deep Photonic Reservoir Computer for Speech Recognition

Dec 11, 2023
Enrico Picco, Alessandro Lupo, Serge Massar

Viaarxiv icon

Investigating salient representations and label Variance in Dimensional Speech Emotion Analysis

Dec 17, 2023
Vikramjit Mitra, Jingping Nie, Erdrin Azemi

Viaarxiv icon

Co-speech gestures for human-robot collaboration

Nov 30, 2023
A. Ekrekli, A. Angleraud, G. Sharma, R. Pieters

Viaarxiv icon

Amphion: An Open-Source Audio, Music and Speech Generation Toolkit

Add code
Bookmark button
Alert button
Dec 15, 2023
Xueyao Zhang, Liumeng Xue, Yuancheng Wang, Yicheng Gu, Xi Chen, Zihao Fang, Haopeng Chen, Lexiao Zou, Chaoren Wang, Jun Han, Kai Chen, Haizhou Li, Zhizheng Wu

Figure 1 for Amphion: An Open-Source Audio, Music and Speech Generation Toolkit
Figure 2 for Amphion: An Open-Source Audio, Music and Speech Generation Toolkit
Figure 3 for Amphion: An Open-Source Audio, Music and Speech Generation Toolkit
Figure 4 for Amphion: An Open-Source Audio, Music and Speech Generation Toolkit
Viaarxiv icon

Unconstrained Dysfluency Modeling for Dysfluent Speech Transcription and Detection

Dec 20, 2023
Jiachen Lian, Carly Feng, Naasir Farooqi, Steve Li, Anshul Kashyap, Cheol Jun Cho, Peter Wu, Robbie Netzorg, Tingle Li, Gopala Krishna Anumanchipalli

Viaarxiv icon

Head Orientation Estimation with Distributed Microphones Using Speech Radiation Patterns

Dec 04, 2023
Kaspar Müller, Bilgesu Çakmak, Paul Didier, Simon Doclo, Jan Østergaard, Tobias Wolff

Viaarxiv icon

Zero Shot Audio to Audio Emotion Transfer With Speaker Disentanglement

Jan 09, 2024
Soumya Dutta, Sriram Ganapathy

Viaarxiv icon

Transfer the linguistic representations from TTS to accent conversion with non-parallel data

Jan 07, 2024
Xi Chen, Jiakun Pei, Liumeng Xue, Mingyang Zhang

Viaarxiv icon

LitE-SNN: Designing Lightweight and Efficient Spiking Neural Network through Spatial-Temporal Compressive Network Search and Joint Optimization

Jan 26, 2024
Qianhui Liu, Jiaqi Yan, Malu Zhang, Gang Pan, Haizhou Li

Viaarxiv icon

The GUA-Speech System Description for CNVSRC Challenge 2023

Dec 12, 2023
Shengqiang Li, Chao Lei, Baozhong Ma, Binbin Zhang, Fuping Pan

Viaarxiv icon