Picture for Yangyang Shi

Yangyang Shi

High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching

Add code
Jul 04, 2024
Viaarxiv icon

Speech ReaLLM -- Real-time Streaming Speech Recognition with Multimodal LLMs by Teaching the Flow of Time

Add code
Jun 13, 2024
Viaarxiv icon

Basis Selection: Low-Rank Decomposition of Pretrained Large Language Models for Target Applications

Add code
May 24, 2024
Viaarxiv icon

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Add code
Feb 22, 2024
Figure 1 for MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases
Figure 2 for MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases
Figure 3 for MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases
Figure 4 for MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases
Viaarxiv icon

Not All Weights Are Created Equal: Enhancing Energy Efficiency in On-Device Streaming Speech Recognition

Add code
Feb 20, 2024
Figure 1 for Not All Weights Are Created Equal: Enhancing Energy Efficiency in On-Device Streaming Speech Recognition
Figure 2 for Not All Weights Are Created Equal: Enhancing Energy Efficiency in On-Device Streaming Speech Recognition
Figure 3 for Not All Weights Are Created Equal: Enhancing Energy Efficiency in On-Device Streaming Speech Recognition
Figure 4 for Not All Weights Are Created Equal: Enhancing Energy Efficiency in On-Device Streaming Speech Recognition
Viaarxiv icon

FADI-AEC: Fast Score Based Diffusion Model Guided by Far-end Signal for Acoustic Echo Cancellation

Add code
Jan 08, 2024
Viaarxiv icon

On The Open Prompt Challenge In Conditional Audio Generation

Add code
Nov 01, 2023
Figure 1 for On The Open Prompt Challenge In Conditional Audio Generation
Figure 2 for On The Open Prompt Challenge In Conditional Audio Generation
Figure 3 for On The Open Prompt Challenge In Conditional Audio Generation
Figure 4 for On The Open Prompt Challenge In Conditional Audio Generation
Viaarxiv icon

In-Context Prompt Editing For Conditional Audio Generation

Add code
Nov 01, 2023
Figure 1 for In-Context Prompt Editing For Conditional Audio Generation
Figure 2 for In-Context Prompt Editing For Conditional Audio Generation
Figure 3 for In-Context Prompt Editing For Conditional Audio Generation
Figure 4 for In-Context Prompt Editing For Conditional Audio Generation
Viaarxiv icon

TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch

Add code
Oct 27, 2023
Figure 1 for TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch
Figure 2 for TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch
Figure 3 for TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch
Figure 4 for TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch
Viaarxiv icon

Folding Attention: Memory and Power Optimization for On-Device Transformer-based Streaming Speech Recognition

Add code
Sep 21, 2023
Figure 1 for Folding Attention: Memory and Power Optimization for On-Device Transformer-based Streaming Speech Recognition
Figure 2 for Folding Attention: Memory and Power Optimization for On-Device Transformer-based Streaming Speech Recognition
Figure 3 for Folding Attention: Memory and Power Optimization for On-Device Transformer-based Streaming Speech Recognition
Figure 4 for Folding Attention: Memory and Power Optimization for On-Device Transformer-based Streaming Speech Recognition
Viaarxiv icon