Picture for Tingwei Guo

Tingwei Guo

Do We Need Distinct Representations for Every Speech Token? Unveiling and Exploiting Redundancy in Large Speech Language Models

Add code
Apr 08, 2026
Viaarxiv icon

SARI: Structured Audio Reasoning via Curriculum-Guided Reinforcement Learning

Add code
Apr 22, 2025
Viaarxiv icon

Advancing Speech Language Models by Scaling Supervised Fine-Tuning with Over 60,000 Hours of Synthetic Speech Dialogue Data

Add code
Dec 03, 2024
Figure 1 for Advancing Speech Language Models by Scaling Supervised Fine-Tuning with Over 60,000 Hours of Synthetic Speech Dialogue Data
Figure 2 for Advancing Speech Language Models by Scaling Supervised Fine-Tuning with Over 60,000 Hours of Synthetic Speech Dialogue Data
Figure 3 for Advancing Speech Language Models by Scaling Supervised Fine-Tuning with Over 60,000 Hours of Synthetic Speech Dialogue Data
Figure 4 for Advancing Speech Language Models by Scaling Supervised Fine-Tuning with Over 60,000 Hours of Synthetic Speech Dialogue Data
Viaarxiv icon

VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer

Add code
Nov 05, 2022
Figure 1 for VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer
Figure 2 for VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer
Figure 3 for VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer
Figure 4 for VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer
Viaarxiv icon

Audio-Visual Wake Word Spotting System For MISP Challenge 2021

Add code
Apr 20, 2022
Figure 1 for Audio-Visual Wake Word Spotting System For MISP Challenge 2021
Figure 2 for Audio-Visual Wake Word Spotting System For MISP Challenge 2021
Figure 3 for Audio-Visual Wake Word Spotting System For MISP Challenge 2021
Figure 4 for Audio-Visual Wake Word Spotting System For MISP Challenge 2021
Viaarxiv icon

Time Domain Adversarial Voice Conversion for ADD 2022

Add code
Apr 20, 2022
Figure 1 for Time Domain Adversarial Voice Conversion for ADD 2022
Figure 2 for Time Domain Adversarial Voice Conversion for ADD 2022
Figure 3 for Time Domain Adversarial Voice Conversion for ADD 2022
Figure 4 for Time Domain Adversarial Voice Conversion for ADD 2022
Viaarxiv icon

Audio Deep Fake Detection System with Neural Stitching for ADD 2022

Add code
Apr 20, 2022
Figure 1 for Audio Deep Fake Detection System with Neural Stitching for ADD 2022
Figure 2 for Audio Deep Fake Detection System with Neural Stitching for ADD 2022
Figure 3 for Audio Deep Fake Detection System with Neural Stitching for ADD 2022
Figure 4 for Audio Deep Fake Detection System with Neural Stitching for ADD 2022
Viaarxiv icon

DELTA: A DEep learning based Language Technology plAtform

Add code
Aug 02, 2019
Figure 1 for DELTA: A DEep learning based Language Technology plAtform
Figure 2 for DELTA: A DEep learning based Language Technology plAtform
Figure 3 for DELTA: A DEep learning based Language Technology plAtform
Figure 4 for DELTA: A DEep learning based Language Technology plAtform
Viaarxiv icon