Picture for Qingzheng Wang

Qingzheng Wang

Fish Audio S2 Technical Report

Add code
Mar 11, 2026
Viaarxiv icon

Bagpiper: Solving Open-Ended Audio Tasks via Rich Captions

Add code
Feb 05, 2026
Viaarxiv icon

Geolocation-Aware Robust Spoken Language Identification

Add code
Aug 23, 2025
Figure 1 for Geolocation-Aware Robust Spoken Language Identification
Figure 2 for Geolocation-Aware Robust Spoken Language Identification
Figure 3 for Geolocation-Aware Robust Spoken Language Identification
Figure 4 for Geolocation-Aware Robust Spoken Language Identification
Viaarxiv icon

Improving Multilingual Speech Models on ML-SUPERB 2.0: Fine-tuning with Data Augmentation and LID-Aware CTC

Add code
May 30, 2025
Figure 1 for Improving Multilingual Speech Models on ML-SUPERB 2.0: Fine-tuning with Data Augmentation and LID-Aware CTC
Figure 2 for Improving Multilingual Speech Models on ML-SUPERB 2.0: Fine-tuning with Data Augmentation and LID-Aware CTC
Figure 3 for Improving Multilingual Speech Models on ML-SUPERB 2.0: Fine-tuning with Data Augmentation and LID-Aware CTC
Viaarxiv icon