Alert button
Picture for Haizhou Li

Haizhou Li

Alert button

MDD-Eval: Self-Training on Augmented Data for Multi-Domain Dialogue Evaluation

Add code
Bookmark button
Alert button
Dec 14, 2021
Chen Zhang, Luis Fernando D'Haro, Thomas Friedrichs, Haizhou Li

Figure 1 for MDD-Eval: Self-Training on Augmented Data for Multi-Domain Dialogue Evaluation
Figure 2 for MDD-Eval: Self-Training on Augmented Data for Multi-Domain Dialogue Evaluation
Figure 3 for MDD-Eval: Self-Training on Augmented Data for Multi-Domain Dialogue Evaluation
Figure 4 for MDD-Eval: Self-Training on Augmented Data for Multi-Domain Dialogue Evaluation
Viaarxiv icon

Time-Frequency Attention for Monaural Speech Enhancement

Add code
Bookmark button
Alert button
Nov 17, 2021
Qiquan Zhang, Qi Song, Zhaoheng Ni, Aaron Nicolson, Haizhou Li

Figure 1 for Time-Frequency Attention for Monaural Speech Enhancement
Figure 2 for Time-Frequency Attention for Monaural Speech Enhancement
Figure 3 for Time-Frequency Attention for Monaural Speech Enhancement
Figure 4 for Time-Frequency Attention for Monaural Speech Enhancement
Viaarxiv icon

HLT-NUS SUBMISSION FOR 2020 NIST Conversational Telephone Speech SRE

Add code
Bookmark button
Alert button
Nov 12, 2021
Rohan Kumar Das, Ruijie Tao, Haizhou Li

Figure 1 for HLT-NUS SUBMISSION FOR 2020 NIST Conversational Telephone Speech SRE
Figure 2 for HLT-NUS SUBMISSION FOR 2020 NIST Conversational Telephone Speech SRE
Viaarxiv icon

MEmoBERT: Pre-training Model with Prompt-based Learning for Multimodal Emotion Recognition

Add code
Bookmark button
Alert button
Oct 27, 2021
Jinming Zhao, Ruichen Li, Qin Jin, Xinchao Wang, Haizhou Li

Figure 1 for MEmoBERT: Pre-training Model with Prompt-based Learning for Multimodal Emotion Recognition
Figure 2 for MEmoBERT: Pre-training Model with Prompt-based Learning for Multimodal Emotion Recognition
Figure 3 for MEmoBERT: Pre-training Model with Prompt-based Learning for Multimodal Emotion Recognition
Figure 4 for MEmoBERT: Pre-training Model with Prompt-based Learning for Multimodal Emotion Recognition
Viaarxiv icon

Identity Conversion for Emotional Speakers: A Study for Disentanglement of Emotion Style and Speaker Identity

Add code
Bookmark button
Alert button
Oct 20, 2021
Zongyang Du, Berrak Sisman, Kun Zhou, Haizhou Li

Figure 1 for Identity Conversion for Emotional Speakers: A Study for Disentanglement of Emotion Style and Speaker Identity
Figure 2 for Identity Conversion for Emotional Speakers: A Study for Disentanglement of Emotion Style and Speaker Identity
Figure 3 for Identity Conversion for Emotional Speakers: A Study for Disentanglement of Emotion Style and Speaker Identity
Figure 4 for Identity Conversion for Emotional Speakers: A Study for Disentanglement of Emotion Style and Speaker Identity
Viaarxiv icon

Ego4D: Around the World in 3,000 Hours of Egocentric Video

Add code
Bookmark button
Alert button
Oct 13, 2021
Kristen Grauman, Andrew Westbury, Eugene Byrne, Zachary Chavis, Antonino Furnari, Rohit Girdhar, Jackson Hamburger, Hao Jiang, Miao Liu, Xingyu Liu, Miguel Martin, Tushar Nagarajan, Ilija Radosavovic, Santhosh Kumar Ramakrishnan, Fiona Ryan, Jayant Sharma, Michael Wray, Mengmeng Xu, Eric Zhongcong Xu, Chen Zhao, Siddhant Bansal, Dhruv Batra, Vincent Cartillier, Sean Crane, Tien Do, Morrie Doulaty, Akshay Erapalli, Christoph Feichtenhofer, Adriano Fragomeni, Qichen Fu, Christian Fuegen, Abrham Gebreselasie, Cristina Gonzalez, James Hillis, Xuhua Huang, Yifei Huang, Wenqi Jia, Weslie Khoo, Jachym Kolar, Satwik Kottur, Anurag Kumar, Federico Landini, Chao Li, Yanghao Li, Zhenqiang Li, Karttikeya Mangalam, Raghava Modhugu, Jonathan Munro, Tullie Murrell, Takumi Nishiyasu, Will Price, Paola Ruiz Puentes, Merey Ramazanova, Leda Sari, Kiran Somasundaram, Audrey Southerland, Yusuke Sugano, Ruijie Tao, Minh Vo, Yuchen Wang, Xindi Wu, Takuma Yagi, Yunyi Zhu, Pablo Arbelaez, David Crandall, Dima Damen, Giovanni Maria Farinella, Bernard Ghanem, Vamsi Krishna Ithapu, C. V. Jawahar, Hanbyul Joo, Kris Kitani, Haizhou Li, Richard Newcombe, Aude Oliva, Hyun Soo Park, James M. Rehg, Yoichi Sato, Jianbo Shi, Mike Zheng Shou, Antonio Torralba, Lorenzo Torresani, Mingfei Yan, Jitendra Malik

Figure 1 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Figure 2 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Figure 3 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Figure 4 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Viaarxiv icon

DeepA: A Deep Neural Analyzer For Speech And Singing Vocoding

Add code
Bookmark button
Alert button
Oct 13, 2021
Sergey Nikonorov, Berrak Sisman, Mingyang Zhang, Haizhou Li

Figure 1 for DeepA: A Deep Neural Analyzer For Speech And Singing Vocoding
Figure 2 for DeepA: A Deep Neural Analyzer For Speech And Singing Vocoding
Figure 3 for DeepA: A Deep Neural Analyzer For Speech And Singing Vocoding
Figure 4 for DeepA: A Deep Neural Analyzer For Speech And Singing Vocoding
Viaarxiv icon

VisualTTS: TTS with Accurate Lip-Speech Synchronization for Automatic Voice Over

Add code
Bookmark button
Alert button
Oct 09, 2021
Junchen Lu, Berrak Sisman, Rui Liu, Mingyang Zhang, Haizhou Li

Figure 1 for VisualTTS: TTS with Accurate Lip-Speech Synchronization for Automatic Voice Over
Figure 2 for VisualTTS: TTS with Accurate Lip-Speech Synchronization for Automatic Voice Over
Figure 3 for VisualTTS: TTS with Accurate Lip-Speech Synchronization for Automatic Voice Over
Viaarxiv icon

StrengthNet: Deep Learning-based Emotion Strength Assessment for Emotional Speech Synthesis

Add code
Bookmark button
Alert button
Oct 08, 2021
Rui Liu, Berrak Sisman, Haizhou Li

Figure 1 for StrengthNet: Deep Learning-based Emotion Strength Assessment for Emotional Speech Synthesis
Figure 2 for StrengthNet: Deep Learning-based Emotion Strength Assessment for Emotional Speech Synthesis
Figure 3 for StrengthNet: Deep Learning-based Emotion Strength Assessment for Emotional Speech Synthesis
Figure 4 for StrengthNet: Deep Learning-based Emotion Strength Assessment for Emotional Speech Synthesis
Viaarxiv icon

Self-supervised Speaker Recognition with Loss-gated Learning

Add code
Bookmark button
Alert button
Oct 08, 2021
Ruijie Tao, Kong Aik Lee, Rohan Kumar Das, Ville Hautamäki, Haizhou Li

Figure 1 for Self-supervised Speaker Recognition with Loss-gated Learning
Figure 2 for Self-supervised Speaker Recognition with Loss-gated Learning
Figure 3 for Self-supervised Speaker Recognition with Loss-gated Learning
Figure 4 for Self-supervised Speaker Recognition with Loss-gated Learning
Viaarxiv icon