Picture for Kai Yu

Kai Yu

Sherman

Text-aware Speech Separation for Multi-talker Keyword Spotting

Add code
Jun 18, 2024
Figure 1 for Text-aware Speech Separation for Multi-talker Keyword Spotting
Figure 2 for Text-aware Speech Separation for Multi-talker Keyword Spotting
Figure 3 for Text-aware Speech Separation for Multi-talker Keyword Spotting
Figure 4 for Text-aware Speech Separation for Multi-talker Keyword Spotting
Viaarxiv icon

GigaSpeech 2: An Evolving, Large-Scale and Multi-domain ASR Corpus for Low-Resource Languages with Automated Crawling, Transcription and Refinement

Add code
Jun 17, 2024
Viaarxiv icon

Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases

Add code
Jun 13, 2024
Figure 1 for Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases
Figure 2 for Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases
Figure 3 for Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases
Figure 4 for Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases
Viaarxiv icon

FakeSound: Deepfake General Audio Detection

Add code
Jun 12, 2024
Figure 1 for FakeSound: Deepfake General Audio Detection
Figure 2 for FakeSound: Deepfake General Audio Detection
Figure 3 for FakeSound: Deepfake General Audio Detection
Figure 4 for FakeSound: Deepfake General Audio Detection
Viaarxiv icon

Evolving Subnetwork Training for Large Language Models

Add code
Jun 11, 2024
Figure 1 for Evolving Subnetwork Training for Large Language Models
Figure 2 for Evolving Subnetwork Training for Large Language Models
Figure 3 for Evolving Subnetwork Training for Large Language Models
Figure 4 for Evolving Subnetwork Training for Large Language Models
Viaarxiv icon

Sparsity-Accelerated Training for Large Language Models

Add code
Jun 03, 2024
Figure 1 for Sparsity-Accelerated Training for Large Language Models
Figure 2 for Sparsity-Accelerated Training for Large Language Models
Figure 3 for Sparsity-Accelerated Training for Large Language Models
Figure 4 for Sparsity-Accelerated Training for Large Language Models
Viaarxiv icon

Disentangling Foreground and Background Motion for Enhanced Realism in Human Video Generation

Add code
May 28, 2024
Figure 1 for Disentangling Foreground and Background Motion for Enhanced Realism in Human Video Generation
Figure 2 for Disentangling Foreground and Background Motion for Enhanced Realism in Human Video Generation
Figure 3 for Disentangling Foreground and Background Motion for Enhanced Realism in Human Video Generation
Figure 4 for Disentangling Foreground and Background Motion for Enhanced Realism in Human Video Generation
Viaarxiv icon

Performance Analysis of Uplink/Downlink Decoupled Access in Cellular-V2X Networks

Add code
May 10, 2024
Figure 1 for Performance Analysis of Uplink/Downlink Decoupled Access in Cellular-V2X Networks
Figure 2 for Performance Analysis of Uplink/Downlink Decoupled Access in Cellular-V2X Networks
Figure 3 for Performance Analysis of Uplink/Downlink Decoupled Access in Cellular-V2X Networks
Figure 4 for Performance Analysis of Uplink/Downlink Decoupled Access in Cellular-V2X Networks
Viaarxiv icon

AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding

Add code
May 06, 2024
Figure 1 for AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding
Figure 2 for AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding
Figure 3 for AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding
Figure 4 for AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding
Viaarxiv icon

CoE-SQL: In-Context Learning for Multi-Turn Text-to-SQL with Chain-of-Editions

Add code
May 04, 2024
Viaarxiv icon