Alert button

"speech": models, code, and papers
Alert button

VAST: Vivify Your Talking Avatar via Zero-Shot Expressive Facial Style Transfer

Aug 09, 2023
Liyang Chen, Zhiyong Wu, Runnan Li, Weihong Bao, Jun Ling, Xu Tan, Sheng Zhao

Figure 1 for VAST: Vivify Your Talking Avatar via Zero-Shot Expressive Facial Style Transfer
Figure 2 for VAST: Vivify Your Talking Avatar via Zero-Shot Expressive Facial Style Transfer
Figure 3 for VAST: Vivify Your Talking Avatar via Zero-Shot Expressive Facial Style Transfer
Figure 4 for VAST: Vivify Your Talking Avatar via Zero-Shot Expressive Facial Style Transfer
Viaarxiv icon

Boosting Local Spectro-Temporal Features for Speech Analysis

May 17, 2023
Michael Guerzhoy

Viaarxiv icon

Guided Speech Enhancement Network

Mar 13, 2023
Yang Yang, Shao-Fu Shih, Hakan Erdogan, Jamie Menjay Lin, Chehung Lee, Yunpeng Li, George Sung, Matthias Grundmann

Figure 1 for Guided Speech Enhancement Network
Figure 2 for Guided Speech Enhancement Network
Figure 3 for Guided Speech Enhancement Network
Figure 4 for Guided Speech Enhancement Network
Viaarxiv icon

Relating EEG recordings to speech using envelope tracking and the speech-FFR

Mar 11, 2023
Mike Thornton, Danilo Mandic, Tobias Reichenbach

Figure 1 for Relating EEG recordings to speech using envelope tracking and the speech-FFR
Figure 2 for Relating EEG recordings to speech using envelope tracking and the speech-FFR
Figure 3 for Relating EEG recordings to speech using envelope tracking and the speech-FFR
Viaarxiv icon

Contextual Biasing of Named-Entities with Large Language Models

Sep 01, 2023
Chuanneng Sun, Zeeshan Ahmed, Yingyi Ma, Zhe Liu, Yutong Pang, Ozlem Kalinli

Figure 1 for Contextual Biasing of Named-Entities with Large Language Models
Figure 2 for Contextual Biasing of Named-Entities with Large Language Models
Figure 3 for Contextual Biasing of Named-Entities with Large Language Models
Figure 4 for Contextual Biasing of Named-Entities with Large Language Models
Viaarxiv icon

Amortizing Pragmatic Program Synthesis with Rankings

Sep 01, 2023
Yewen Pu, Saujas Vaduguru, Priyan Vaithilingam, Elena Glassman, Daniel Fried

Figure 1 for Amortizing Pragmatic Program Synthesis with Rankings
Figure 2 for Amortizing Pragmatic Program Synthesis with Rankings
Figure 3 for Amortizing Pragmatic Program Synthesis with Rankings
Figure 4 for Amortizing Pragmatic Program Synthesis with Rankings
Viaarxiv icon

The Relationship Between Speech Features Changes When You Get Depressed: Feature Correlations for Improving Speed and Performance of Depression Detection

Jul 07, 2023
Fuxiang Tao, Wei Ma, Xuri Ge, Anna Esposito, Alessandro Vinciarelli

Figure 1 for The Relationship Between Speech Features Changes When You Get Depressed: Feature Correlations for Improving Speed and Performance of Depression Detection
Figure 2 for The Relationship Between Speech Features Changes When You Get Depressed: Feature Correlations for Improving Speed and Performance of Depression Detection
Figure 3 for The Relationship Between Speech Features Changes When You Get Depressed: Feature Correlations for Improving Speed and Performance of Depression Detection
Figure 4 for The Relationship Between Speech Features Changes When You Get Depressed: Feature Correlations for Improving Speed and Performance of Depression Detection
Viaarxiv icon

Task-Agnostic Structured Pruning of Speech Representation Models

Add code
Bookmark button
Alert button
Jun 02, 2023
Haoyu Wang, Siyuan Wang, Wei-Qiang Zhang, Hongbin Suo, Yulong Wan

Figure 1 for Task-Agnostic Structured Pruning of Speech Representation Models
Figure 2 for Task-Agnostic Structured Pruning of Speech Representation Models
Figure 3 for Task-Agnostic Structured Pruning of Speech Representation Models
Figure 4 for Task-Agnostic Structured Pruning of Speech Representation Models
Viaarxiv icon

Evil Operation: Breaking Speaker Recognition with PaddingBack

Add code
Bookmark button
Alert button
Aug 08, 2023
Zhe Ye, Diqun Yan, Li Dong, Kailai Shen

Figure 1 for Evil Operation: Breaking Speaker Recognition with PaddingBack
Figure 2 for Evil Operation: Breaking Speaker Recognition with PaddingBack
Figure 3 for Evil Operation: Breaking Speaker Recognition with PaddingBack
Figure 4 for Evil Operation: Breaking Speaker Recognition with PaddingBack
Viaarxiv icon

Efficient Acoustic Echo Suppression with Condition-Aware Training

Jul 28, 2023
Ernst Seidel, Pejman Mowlaee, Tim Fingscheidt

Viaarxiv icon