Alert button

"speech": models, code, and papers
Alert button

UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion

Add code
Bookmark button
Alert button
Jan 10, 2023
Haogeng Liu, Tao Wang, Ruibo Fu, Jiangyan Yi, Zhengqi Wen, Jianhua Tao

Figure 1 for UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion
Figure 2 for UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion
Figure 3 for UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion
Figure 4 for UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion
Viaarxiv icon

Language Control in Robotics

May 04, 2023
Ravi Tejwani, Chengyuan Ma, Paolo Bonato, H. Harry Asada

Figure 1 for Language Control in Robotics
Figure 2 for Language Control in Robotics
Figure 3 for Language Control in Robotics
Viaarxiv icon

IMaSC -- ICFOSS Malayalam Speech Corpus

Add code
Bookmark button
Alert button
Nov 23, 2022
Deepa P Gopinath, Thennal D K, Vrinda V Nair, Swaraj K S, Sachin G

Figure 1 for IMaSC -- ICFOSS Malayalam Speech Corpus
Figure 2 for IMaSC -- ICFOSS Malayalam Speech Corpus
Figure 3 for IMaSC -- ICFOSS Malayalam Speech Corpus
Figure 4 for IMaSC -- ICFOSS Malayalam Speech Corpus
Viaarxiv icon

PAMP: A unified framework boosting low resource automatic speech recognition

Add code
Bookmark button
Alert button
Feb 05, 2023
Zeping Min, Qian Ge, Zhong Li, Weinan E

Figure 1 for PAMP: A unified framework boosting low resource automatic speech recognition
Figure 2 for PAMP: A unified framework boosting low resource automatic speech recognition
Figure 3 for PAMP: A unified framework boosting low resource automatic speech recognition
Figure 4 for PAMP: A unified framework boosting low resource automatic speech recognition
Viaarxiv icon

DasFormer: Deep Alternating Spectrogram Transformer for Multi/Single-Channel Speech Separation

Feb 21, 2023
Shuo Wang, Xiangyu Kong, Xiulian Peng, Hesam Movassagh, Vinod Prakash, Yan Lu

Figure 1 for DasFormer: Deep Alternating Spectrogram Transformer for Multi/Single-Channel Speech Separation
Figure 2 for DasFormer: Deep Alternating Spectrogram Transformer for Multi/Single-Channel Speech Separation
Figure 3 for DasFormer: Deep Alternating Spectrogram Transformer for Multi/Single-Channel Speech Separation
Figure 4 for DasFormer: Deep Alternating Spectrogram Transformer for Multi/Single-Channel Speech Separation
Viaarxiv icon

Speech-to-Speech Translation For A Real-world Unwritten Language

Add code
Bookmark button
Alert button
Nov 11, 2022
Peng-Jen Chen, Kevin Tran, Yilin Yang, Jingfei Du, Justine Kao, Yu-An Chung, Paden Tomasello, Paul-Ambroise Duquenne, Holger Schwenk, Hongyu Gong, Hirofumi Inaguma, Sravya Popuri, Changhan Wang, Juan Pino, Wei-Ning Hsu, Ann Lee

Figure 1 for Speech-to-Speech Translation For A Real-world Unwritten Language
Figure 2 for Speech-to-Speech Translation For A Real-world Unwritten Language
Figure 3 for Speech-to-Speech Translation For A Real-world Unwritten Language
Figure 4 for Speech-to-Speech Translation For A Real-world Unwritten Language
Viaarxiv icon

A Comprehensive Review of Data-Driven Co-Speech Gesture Generation

Add code
Bookmark button
Alert button
Jan 13, 2023
Simbarashe Nyatsanga, Taras Kucherenko, Chaitanya Ahuja, Gustav Eje Henter, Michael Neff

Figure 1 for A Comprehensive Review of Data-Driven Co-Speech Gesture Generation
Figure 2 for A Comprehensive Review of Data-Driven Co-Speech Gesture Generation
Figure 3 for A Comprehensive Review of Data-Driven Co-Speech Gesture Generation
Figure 4 for A Comprehensive Review of Data-Driven Co-Speech Gesture Generation
Viaarxiv icon

An automated method for the ontological representation of security directives

Jun 30, 2023
Giampaolo Bella, Gianpietro Castiglione, Daniele Francesco Santamaria

Figure 1 for An automated method for the ontological representation of security directives
Figure 2 for An automated method for the ontological representation of security directives
Figure 3 for An automated method for the ontological representation of security directives
Figure 4 for An automated method for the ontological representation of security directives
Viaarxiv icon

CASEIN: Cascading Explicit and Implicit Control for Fine-grained Emotion Intensity Regulation

Jun 27, 2023
Yuhao Cui, Xiongwei Wang, Zhongzhou Zhao, Wei Zhou, Haiqing Chen

Figure 1 for CASEIN: Cascading Explicit and Implicit Control for Fine-grained Emotion Intensity Regulation
Figure 2 for CASEIN: Cascading Explicit and Implicit Control for Fine-grained Emotion Intensity Regulation
Figure 3 for CASEIN: Cascading Explicit and Implicit Control for Fine-grained Emotion Intensity Regulation
Figure 4 for CASEIN: Cascading Explicit and Implicit Control for Fine-grained Emotion Intensity Regulation
Viaarxiv icon

Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations

Add code
Bookmark button
Alert button
Mar 01, 2023
Siyuan Shen, Feng Liu, Aimin Zhou

Figure 1 for Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations
Figure 2 for Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations
Figure 3 for Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations
Figure 4 for Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations
Viaarxiv icon