Alert button

"Text": models, code, and papers
Alert button

Speaker Adaption with Intuitive Prosodic Features for Statistical Parametric Speech Synthesis

Mar 02, 2022
Pengyu Cheng, Zhenhua Ling

Figure 1 for Speaker Adaption with Intuitive Prosodic Features for Statistical Parametric Speech Synthesis
Figure 2 for Speaker Adaption with Intuitive Prosodic Features for Statistical Parametric Speech Synthesis
Figure 3 for Speaker Adaption with Intuitive Prosodic Features for Statistical Parametric Speech Synthesis
Figure 4 for Speaker Adaption with Intuitive Prosodic Features for Statistical Parametric Speech Synthesis
Viaarxiv icon

r-G2P: Evaluating and Enhancing Robustness of Grapheme to Phoneme Conversion by Controlled noise introducing and Contextual information incorporation

Feb 21, 2022
Chendong Zhao, Jianzong Wang, Xiaoyang Qu, Haoqian Wang, Jing Xiao

Figure 1 for r-G2P: Evaluating and Enhancing Robustness of Grapheme to Phoneme Conversion by Controlled noise introducing and Contextual information incorporation
Figure 2 for r-G2P: Evaluating and Enhancing Robustness of Grapheme to Phoneme Conversion by Controlled noise introducing and Contextual information incorporation
Figure 3 for r-G2P: Evaluating and Enhancing Robustness of Grapheme to Phoneme Conversion by Controlled noise introducing and Contextual information incorporation
Figure 4 for r-G2P: Evaluating and Enhancing Robustness of Grapheme to Phoneme Conversion by Controlled noise introducing and Contextual information incorporation
Viaarxiv icon

Exploring and Adapting Chinese GPT to Pinyin Input Method

Mar 02, 2022
Minghuan Tan, Yong Dai, Duyu Tang, Zhangyin Feng, Guoping Huang, Jing Jiang, Jiwei Li, Shuming Shi

Figure 1 for Exploring and Adapting Chinese GPT to Pinyin Input Method
Figure 2 for Exploring and Adapting Chinese GPT to Pinyin Input Method
Figure 3 for Exploring and Adapting Chinese GPT to Pinyin Input Method
Figure 4 for Exploring and Adapting Chinese GPT to Pinyin Input Method
Viaarxiv icon

Spoken Style Learning with Multi-modal Hierarchical Context Encoding for Conversational Text-to-Speech Synthesis

Jun 11, 2021
Jingbei Li, Yi Meng, Chenyi Li, Zhiyong Wu, Helen Meng, Chao Weng, Dan Su

Figure 1 for Spoken Style Learning with Multi-modal Hierarchical Context Encoding for Conversational Text-to-Speech Synthesis
Figure 2 for Spoken Style Learning with Multi-modal Hierarchical Context Encoding for Conversational Text-to-Speech Synthesis
Figure 3 for Spoken Style Learning with Multi-modal Hierarchical Context Encoding for Conversational Text-to-Speech Synthesis
Figure 4 for Spoken Style Learning with Multi-modal Hierarchical Context Encoding for Conversational Text-to-Speech Synthesis
Viaarxiv icon

Neural text normalization leveraging similarities of strings and sounds

Nov 04, 2020
Riku Kawamura, Tatsuya Aoki, Hidetaka Kamigaito, Hiroya Takamura, Manabu Okumura

Figure 1 for Neural text normalization leveraging similarities of strings and sounds
Figure 2 for Neural text normalization leveraging similarities of strings and sounds
Figure 3 for Neural text normalization leveraging similarities of strings and sounds
Figure 4 for Neural text normalization leveraging similarities of strings and sounds
Viaarxiv icon

Text Modeling with Syntax-Aware Variational Autoencoders

Aug 27, 2019
Yijun Xiao, William Yang Wang

Figure 1 for Text Modeling with Syntax-Aware Variational Autoencoders
Figure 2 for Text Modeling with Syntax-Aware Variational Autoencoders
Figure 3 for Text Modeling with Syntax-Aware Variational Autoencoders
Figure 4 for Text Modeling with Syntax-Aware Variational Autoencoders
Viaarxiv icon

Symmetry-constrained Rectification Network for Scene Text Recognition

Aug 06, 2019
MingKun Yang, Yushuo Guan, Minghui Liao, Xin He, Kaigui Bian, Song Bai, Cong Yao, Xiang Bai

Figure 1 for Symmetry-constrained Rectification Network for Scene Text Recognition
Figure 2 for Symmetry-constrained Rectification Network for Scene Text Recognition
Figure 3 for Symmetry-constrained Rectification Network for Scene Text Recognition
Figure 4 for Symmetry-constrained Rectification Network for Scene Text Recognition
Viaarxiv icon

DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering

Mar 26, 2022
Guan-Ting Lin, Yung-Sung Chuang, Ho-Lam Chung, Shu-wen Yang, Hsuan-Jui Chen, Shuyan Dong, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Lin-shan Lee

Figure 1 for DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering
Figure 2 for DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering
Figure 3 for DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering
Figure 4 for DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering
Viaarxiv icon

Emotional Prosody Control for Speech Generation

Nov 07, 2021
Sarath Sivaprasad, Saiteja Kosgi, Vineet Gandhi

Figure 1 for Emotional Prosody Control for Speech Generation
Figure 2 for Emotional Prosody Control for Speech Generation
Figure 3 for Emotional Prosody Control for Speech Generation
Viaarxiv icon

Extracting COVID-19 Diagnoses and Symptoms From Clinical Text: A New Annotated Corpus and Neural Event Extraction Framework

Dec 02, 2020
Kevin Lybarger, Mari Ostendorf, Matthew Thompson, Meliha Yetisgen

Figure 1 for Extracting COVID-19 Diagnoses and Symptoms From Clinical Text: A New Annotated Corpus and Neural Event Extraction Framework
Figure 2 for Extracting COVID-19 Diagnoses and Symptoms From Clinical Text: A New Annotated Corpus and Neural Event Extraction Framework
Figure 3 for Extracting COVID-19 Diagnoses and Symptoms From Clinical Text: A New Annotated Corpus and Neural Event Extraction Framework
Figure 4 for Extracting COVID-19 Diagnoses and Symptoms From Clinical Text: A New Annotated Corpus and Neural Event Extraction Framework
Viaarxiv icon