Alert button
Picture for Haizhou Li

Haizhou Li

Alert button

Computation and Parameter Efficient Multi-Modal Fusion Transformer for Cued Speech Recognition

Feb 08, 2024
Lei Liu, Li Liu, Haizhou Li

Viaarxiv icon

LitE-SNN: Designing Lightweight and Efficient Spiking Neural Network through Spatial-Temporal Compressive Network Search and Joint Optimization

Jan 26, 2024
Qianhui Liu, Jiaqi Yan, Malu Zhang, Gang Pan, Haizhou Li

Viaarxiv icon

CoAVT: A Cognition-Inspired Unified Audio-Visual-Text Pre-Training Model for Multimodal Processing

Jan 22, 2024
Xianghu Yue, Xiaohai Tian, Malu Zhang, Zhizheng Wu, Haizhou Li

Viaarxiv icon

An Empirical Study on the Impact of Positional Encoding in Transformer-based Monaural Speech Enhancement

Jan 18, 2024
Qiquan Zhang, Meng Ge, Hongxu Zhu, Eliathamby Ambikairajah, Qi Song, Zhaoheng Ni, Haizhou Li

Viaarxiv icon

Bridging Research and Readers: A Multi-Modal Automated Academic Papers Interpretation System

Jan 17, 2024
Feng Jiang, Kuang Wang, Haizhou Li

Viaarxiv icon

Gradient weighting for speaker verification in extremely low Signal-to-Noise Ratio

Jan 05, 2024
Yi Ma, Kong Aik Lee, Ville Hautamäki, Meng Ge, Haizhou Li

Viaarxiv icon

The NUS-HLT System for ICASSP2024 ICMC-ASR Grand Challenge

Dec 26, 2023
Meng Ge, Yizhou Peng, Yidi Jiang, Jingru Lin, Junyi Ao, Mehmet Sinan Yildirim, Shuai Wang, Haizhou Li, Mengling Feng

Viaarxiv icon

A Comprehensive Analysis of the Effectiveness of Large Language Models as Automatic Dialogue Evaluators

Dec 24, 2023
Chen Zhang, Luis Fernando D'Haro, Yiming Chen, Malu Zhang, Haizhou Li

Viaarxiv icon

Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling

Dec 19, 2023
Rui Liu, Yifan Hu, Yi Ren, Xiang Yin, Haizhou Li

Viaarxiv icon