Alert button

"speech": models, code, and papers
Alert button

CPPF: A contextual and post-processing-free model for automatic speech recognition

Sep 14, 2023
Lei Zhang, Zhengkun Tian, Xiang Chen, Jiaming Sun, Hongyu Xiang, Ke Ding, Guanglu Wan

Figure 1 for CPPF: A contextual and post-processing-free model for automatic speech recognition
Figure 2 for CPPF: A contextual and post-processing-free model for automatic speech recognition
Figure 3 for CPPF: A contextual and post-processing-free model for automatic speech recognition
Viaarxiv icon

Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement

Aug 17, 2023
Ye-Xin Lu, Yang Ai, Zhen-Hua Ling

Figure 1 for Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
Figure 2 for Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
Figure 3 for Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
Figure 4 for Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
Viaarxiv icon

Thesis Distillation: Investigating The Impact of Bias in NLP Models on Hate Speech Detection

Aug 31, 2023
Fatma Elsafoury

Figure 1 for Thesis Distillation: Investigating The Impact of Bias in NLP Models on Hate Speech Detection
Viaarxiv icon

USA: Universal Sentiment Analysis Model & Construction of Japanese Sentiment Text Classification and Part of Speech Dataset

Sep 07, 2023
Chengguang Gan, Qinghao Zhang, Tatsunori Mori

Figure 1 for USA: Universal Sentiment Analysis Model & Construction of Japanese Sentiment Text Classification and Part of Speech Dataset
Figure 2 for USA: Universal Sentiment Analysis Model & Construction of Japanese Sentiment Text Classification and Part of Speech Dataset
Figure 3 for USA: Universal Sentiment Analysis Model & Construction of Japanese Sentiment Text Classification and Part of Speech Dataset
Figure 4 for USA: Universal Sentiment Analysis Model & Construction of Japanese Sentiment Text Classification and Part of Speech Dataset
Viaarxiv icon

Minimal Effective Theory for Phonotactic Memory: Capturing Local Correlations due to Errors in Speech

Sep 04, 2023
Paul Myles Eugenio

Viaarxiv icon

Spatial Reconstructed Local Attention Res2Net with F0 Subband for Fake Speech Detection

Aug 19, 2023
Cunhang Fan, Jun Xue, Jianhua Tao, Jiangyan Yi, Chenglong Wang, Chengshi Zheng, Zhao Lv

Figure 1 for Spatial Reconstructed Local Attention Res2Net with F0 Subband for Fake Speech Detection
Figure 2 for Spatial Reconstructed Local Attention Res2Net with F0 Subband for Fake Speech Detection
Figure 3 for Spatial Reconstructed Local Attention Res2Net with F0 Subband for Fake Speech Detection
Figure 4 for Spatial Reconstructed Local Attention Res2Net with F0 Subband for Fake Speech Detection
Viaarxiv icon

Acoustic-to-articulatory inversion for dysarthric speech: Are pre-trained self-supervised representations favorable?

Sep 07, 2023
Sarthak Kumar Maharana, Krishna Kamal Adidam, Shoumik Nandi, Ajitesh Srivastava

Viaarxiv icon

Where's the Liability in Harmful AI Speech?

Aug 09, 2023
Peter Henderson, Tatsunori Hashimoto, Mark Lemley

Figure 1 for Where's the Liability in Harmful AI Speech?
Figure 2 for Where's the Liability in Harmful AI Speech?
Figure 3 for Where's the Liability in Harmful AI Speech?
Figure 4 for Where's the Liability in Harmful AI Speech?
Viaarxiv icon

On the Opportunities of Green Computing: A Survey

Nov 01, 2023
You Zhou, Xiujing Lin, Xiang Zhang, Maolin Wang, Gangwei Jiang, Huakang Lu, Yupeng Wu, Kai Zhang, Zhe Yang, Kehang Wang, Yongduo Sui, Fengwei Jia, Zuoli Tang, Yao Zhao, Hongxuan Zhang, Tiannuo Yang, Weibo Chen, Yunong Mao, Yi Li, De Bao, Yu Li, Hongrui Liao, Ting Liu, Jingwen Liu, Jinchi Guo, Jin Zhao, Xiangyu Zhao, Ying WEI, Hong Qian, Qi Liu, Xiang Wang, Wai Kin, Chan, Chenliang Li, Yusen Li, Shiyu Yang, Jining Yan, Chao Mou, Shuai Han, Wuxia Jin, Guannan Zhang, Xiaodong Zeng

Figure 1 for On the Opportunities of Green Computing: A Survey
Figure 2 for On the Opportunities of Green Computing: A Survey
Figure 3 for On the Opportunities of Green Computing: A Survey
Figure 4 for On the Opportunities of Green Computing: A Survey
Viaarxiv icon

Multilingual Speech-to-Speech Translation into Multiple Target Languages

Jul 17, 2023
Hongyu Gong, Ning Dong, Sravya Popuri, Vedanuj Goswami, Ann Lee, Juan Pino

Viaarxiv icon