Alert button

"speech": models, code, and papers
Alert button

Weigh Your Own Words: Improving Hate Speech Counter Narrative Generation via Attention Regularization

Add code
Bookmark button
Alert button
Sep 05, 2023
Helena Bonaldi, Giuseppe Attanasio, Debora Nozza, Marco Guerini

Viaarxiv icon

Server-side Rescoring of Spoken Entity-centric Knowledge Queries for Virtual Assistants

Nov 02, 2023
Youyuan Zhang, Sashank Gondala, Thiago Fraga-Silva, Christophe Van Gysel

Viaarxiv icon

Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement

Add code
Bookmark button
Alert button
Aug 17, 2023
Ye-Xin Lu, Yang Ai, Zhen-Hua Ling

Figure 1 for Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
Figure 2 for Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
Figure 3 for Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
Figure 4 for Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
Viaarxiv icon

Time-Frequency Transformer: A Novel Time Frequency Joint Learning Method for Speech Emotion Recognition

Aug 28, 2023
Yong Wang, Cheng Lu, Yuan Zong, Hailun Lian, Yan Zhao, Sunan Li

Figure 1 for Time-Frequency Transformer: A Novel Time Frequency Joint Learning Method for Speech Emotion Recognition
Figure 2 for Time-Frequency Transformer: A Novel Time Frequency Joint Learning Method for Speech Emotion Recognition
Figure 3 for Time-Frequency Transformer: A Novel Time Frequency Joint Learning Method for Speech Emotion Recognition
Figure 4 for Time-Frequency Transformer: A Novel Time Frequency Joint Learning Method for Speech Emotion Recognition
Viaarxiv icon

Where's the Liability in Harmful AI Speech?

Add code
Bookmark button
Alert button
Aug 09, 2023
Peter Henderson, Tatsunori Hashimoto, Mark Lemley

Figure 1 for Where's the Liability in Harmful AI Speech?
Figure 2 for Where's the Liability in Harmful AI Speech?
Figure 3 for Where's the Liability in Harmful AI Speech?
Figure 4 for Where's the Liability in Harmful AI Speech?
Viaarxiv icon

LocSelect: Target Speaker Localization with an Auditory Selective Hearing Mechanism

Oct 17, 2023
Yu Chen, Xinyuan Qian, Zexu Pan, Kainan Chen, Haizhou Li

Viaarxiv icon

Study of speaker localization with binaural microphone array incorporating auditory filters and lateral angle estimation

Oct 31, 2023
Yanir Maymon, Israel Nelken, Boaz Rafaely

Figure 1 for Study of speaker localization with binaural microphone array incorporating auditory filters and lateral angle estimation
Figure 2 for Study of speaker localization with binaural microphone array incorporating auditory filters and lateral angle estimation
Figure 3 for Study of speaker localization with binaural microphone array incorporating auditory filters and lateral angle estimation
Figure 4 for Study of speaker localization with binaural microphone array incorporating auditory filters and lateral angle estimation
Viaarxiv icon

Spatial Reconstructed Local Attention Res2Net with F0 Subband for Fake Speech Detection

Aug 19, 2023
Cunhang Fan, Jun Xue, Jianhua Tao, Jiangyan Yi, Chenglong Wang, Chengshi Zheng, Zhao Lv

Figure 1 for Spatial Reconstructed Local Attention Res2Net with F0 Subband for Fake Speech Detection
Figure 2 for Spatial Reconstructed Local Attention Res2Net with F0 Subband for Fake Speech Detection
Figure 3 for Spatial Reconstructed Local Attention Res2Net with F0 Subband for Fake Speech Detection
Figure 4 for Spatial Reconstructed Local Attention Res2Net with F0 Subband for Fake Speech Detection
Viaarxiv icon

Joint speech and overlap detection: a benchmark over multiple audio setup and speech domains

Jul 24, 2023
Martin Lebourdais, Théo Mariotte, Marie Tahon, Anthony Larcher, Antoine Laurent, Silvio Montresor, Sylvain Meignier, Jean-Hugh Thomas

Figure 1 for Joint speech and overlap detection: a benchmark over multiple audio setup and speech domains
Figure 2 for Joint speech and overlap detection: a benchmark over multiple audio setup and speech domains
Figure 3 for Joint speech and overlap detection: a benchmark over multiple audio setup and speech domains
Figure 4 for Joint speech and overlap detection: a benchmark over multiple audio setup and speech domains
Viaarxiv icon

FinBTech: Blockchain-Based Video and Voice Authentication System for Enhanced Security in Financial Transactions Utilizing FaceNet512 and Gaussian Mixture Models

Oct 28, 2023
Prof N. Jeenath Laila, Dr G. Tamilpavai

Viaarxiv icon