Picture for Kai Li

Kai Li

Department of Computer Science and Technology, Tsinghua University, Beijing, China

A Fast and Lightweight Model for Causal Audio-Visual Speech Separation

Add code
Jun 07, 2025
Viaarxiv icon

Zero-Trust Foundation Models: A New Paradigm for Secure and Collaborative Artificial Intelligence for Internet of Things

Add code
May 26, 2025
Viaarxiv icon

SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline

Add code
May 25, 2025
Viaarxiv icon

AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models

Add code
May 22, 2025
Viaarxiv icon

MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix

Add code
May 19, 2025
Viaarxiv icon

Time-Frequency-Based Attention Cache Memory Model for Real-Time Speech Separation

Add code
May 19, 2025
Viaarxiv icon

DIMM: Decoupled Multi-hierarchy Kalman Filter for 3D Object Tracking

Add code
May 18, 2025
Viaarxiv icon

SepPrune: Structured Pruning for Efficient Deep Speech Separation

Add code
May 17, 2025
Viaarxiv icon

Undermining Federated Learning Accuracy in EdgeIoT via Variational Graph Auto-Encoders

Add code
Apr 14, 2025
Viaarxiv icon

Using machine learning method for variable star classification using the TESS Sectors 1-57 data

Add code
Apr 01, 2025
Viaarxiv icon