Picture for Shujie Liu

Shujie Liu

COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning

Add code
Nov 03, 2023
Figure 1 for COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning
Figure 2 for COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning
Figure 3 for COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning
Figure 4 for COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning
Viaarxiv icon

Diffusion Conditional Expectation Model for Efficient and Robust Target Speech Extraction

Add code
Sep 25, 2023
Viaarxiv icon

WavMark: Watermarking for Audio Generation

Add code
Aug 24, 2023
Viaarxiv icon

SpeechX: Neural Codec Language Model as a Versatile Speech Transformer

Add code
Aug 14, 2023
Viaarxiv icon

On decoder-only architecture for speech-to-text and large language model integration

Add code
Jul 14, 2023
Viaarxiv icon

Prompting Large Language Models for Zero-Shot Domain Adaptation in Speech Recognition

Add code
Jun 28, 2023
Viaarxiv icon

Accelerating Transducers through Adjacent Token Merging

Add code
Jun 28, 2023
Viaarxiv icon

VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation

Add code
May 25, 2023
Viaarxiv icon

ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation

Add code
May 24, 2023
Figure 1 for ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation
Figure 2 for ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation
Figure 3 for ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation
Figure 4 for ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation
Viaarxiv icon

Code-Switching Text Generation and Injection in Mandarin-English ASR

Add code
Mar 20, 2023
Viaarxiv icon