Picture for Hang Lv

Hang Lv

TELEVAL: A Dynamic Benchmark Designed for Spoken Language Models in Chinese Interactive Scenarios

Add code
Jul 24, 2025
Viaarxiv icon

BoSS: Beyond-Semantic Speech

Add code
Jul 23, 2025
Viaarxiv icon

Interpretable Clustering Ensemble

Add code
Jun 06, 2025
Viaarxiv icon

Adaptive Schema-aware Event Extraction with Retrieval-Augmented Generation

Add code
May 13, 2025
Viaarxiv icon

SQ-Whisper: Speaker-Querying based Whisper Model for Target-Speaker ASR

Add code
Dec 07, 2024
Viaarxiv icon

MuseGraph: Graph-oriented Instruction Tuning of Large Language Models for Generic Graph Mining

Add code
Mar 13, 2024
Viaarxiv icon

Conversational Speech Recognition by Learning Audio-textual Cross-modal Contextual Representation

Add code
Oct 22, 2023
Viaarxiv icon

Minimizing Sequential Confusion Error in Speech Command Recognition

Add code
Jul 04, 2022
Figure 1 for Minimizing Sequential Confusion Error in Speech Command Recognition
Figure 2 for Minimizing Sequential Confusion Error in Speech Command Recognition
Viaarxiv icon

WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit

Add code
Mar 29, 2022
Figure 1 for WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit
Figure 2 for WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit
Figure 3 for WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit
Figure 4 for WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit
Viaarxiv icon

WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition

Add code
Oct 18, 2021
Figure 1 for WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition
Figure 2 for WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition
Figure 3 for WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition
Figure 4 for WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition
Viaarxiv icon