Picture for Guanglu Wan

Guanglu Wan

MSR-86K: An Evolving, Multilingual Corpus with 86,300 Hours of Transcribed Audio for Speech Recognition Research

Add code
Jun 26, 2024
Viaarxiv icon

CLAQ: Pushing the Limits of Low-Bit Post-Training Quantization for LLMs

Add code
May 27, 2024
Viaarxiv icon

Learning or Self-aligning? Rethinking Instruction Fine-tuning

Add code
Mar 02, 2024
Viaarxiv icon

A Task-oriented Dialog Model with Task-progressive and Policy-aware Pre-training

Add code
Oct 01, 2023
Viaarxiv icon

CPPF: A contextual and post-processing-free model for automatic speech recognition

Add code
Sep 21, 2023
Figure 1 for CPPF: A contextual and post-processing-free model for automatic speech recognition
Figure 2 for CPPF: A contextual and post-processing-free model for automatic speech recognition
Figure 3 for CPPF: A contextual and post-processing-free model for automatic speech recognition
Viaarxiv icon

Enhancing Multilingual Speech Recognition through Language Prompt Tuning and Frame-Level Language Adapter

Add code
Sep 19, 2023
Figure 1 for Enhancing Multilingual Speech Recognition through Language Prompt Tuning and Frame-Level Language Adapter
Figure 2 for Enhancing Multilingual Speech Recognition through Language Prompt Tuning and Frame-Level Language Adapter
Figure 3 for Enhancing Multilingual Speech Recognition through Language Prompt Tuning and Frame-Level Language Adapter
Figure 4 for Enhancing Multilingual Speech Recognition through Language Prompt Tuning and Frame-Level Language Adapter
Viaarxiv icon

Exploiting Pseudo Future Contexts for Emotion Recognition in Conversations

Add code
Jun 27, 2023
Figure 1 for Exploiting Pseudo Future Contexts for Emotion Recognition in Conversations
Figure 2 for Exploiting Pseudo Future Contexts for Emotion Recognition in Conversations
Figure 3 for Exploiting Pseudo Future Contexts for Emotion Recognition in Conversations
Figure 4 for Exploiting Pseudo Future Contexts for Emotion Recognition in Conversations
Viaarxiv icon

Dialog-to-Actions: Building Task-Oriented Dialogue System via Action-Level Generation

Add code
Apr 03, 2023
Figure 1 for Dialog-to-Actions: Building Task-Oriented Dialogue System via Action-Level Generation
Figure 2 for Dialog-to-Actions: Building Task-Oriented Dialogue System via Action-Level Generation
Figure 3 for Dialog-to-Actions: Building Task-Oriented Dialogue System via Action-Level Generation
Figure 4 for Dialog-to-Actions: Building Task-Oriented Dialogue System via Action-Level Generation
Viaarxiv icon

Label-free Knowledge Distillation with Contrastive Loss for Light-weight Speaker Recognition

Add code
Dec 06, 2022
Figure 1 for Label-free Knowledge Distillation with Contrastive Loss for Light-weight Speaker Recognition
Figure 2 for Label-free Knowledge Distillation with Contrastive Loss for Light-weight Speaker Recognition
Figure 3 for Label-free Knowledge Distillation with Contrastive Loss for Light-weight Speaker Recognition
Figure 4 for Label-free Knowledge Distillation with Contrastive Loss for Light-weight Speaker Recognition
Viaarxiv icon

Covariance Regularization for Probabilistic Linear Discriminant Analysis

Add code
Dec 06, 2022
Figure 1 for Covariance Regularization for Probabilistic Linear Discriminant Analysis
Figure 2 for Covariance Regularization for Probabilistic Linear Discriminant Analysis
Figure 3 for Covariance Regularization for Probabilistic Linear Discriminant Analysis
Viaarxiv icon