Picture for Yukun Ma

Yukun Ma

Skip-Layer Attention: Bridging Abstract and Detailed Dependencies in Transformers

Add code
Jun 17, 2024
Figure 1 for Skip-Layer Attention: Bridging Abstract and Detailed Dependencies in Transformers
Figure 2 for Skip-Layer Attention: Bridging Abstract and Detailed Dependencies in Transformers
Figure 3 for Skip-Layer Attention: Bridging Abstract and Detailed Dependencies in Transformers
Figure 4 for Skip-Layer Attention: Bridging Abstract and Detailed Dependencies in Transformers
Viaarxiv icon

Phonetic Enhanced Language Modeling for Text-to-Speech Synthesis

Add code
Jun 04, 2024
Viaarxiv icon

LiqD: A Dynamic Liquid Level Detection Model under Tricky Small Containers

Add code
Mar 13, 2024
Figure 1 for LiqD: A Dynamic Liquid Level Detection Model under Tricky Small Containers
Figure 2 for LiqD: A Dynamic Liquid Level Detection Model under Tricky Small Containers
Figure 3 for LiqD: A Dynamic Liquid Level Detection Model under Tricky Small Containers
Figure 4 for LiqD: A Dynamic Liquid Level Detection Model under Tricky Small Containers
Viaarxiv icon

ICE-GRT: Instruction Context Enhancement by Generative Reinforcement based Transformers

Add code
Jan 04, 2024
Viaarxiv icon

MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation

Add code
Dec 19, 2023
Viaarxiv icon

Loss Masking Is Not Needed in Decoder-only Transformer for Discrete-token Based ASR

Add code
Nov 08, 2023
Figure 1 for Loss Masking Is Not Needed in Decoder-only Transformer for Discrete-token Based ASR
Figure 2 for Loss Masking Is Not Needed in Decoder-only Transformer for Discrete-token Based ASR
Figure 3 for Loss Masking Is Not Needed in Decoder-only Transformer for Discrete-token Based ASR
Figure 4 for Loss Masking Is Not Needed in Decoder-only Transformer for Discrete-token Based ASR
Viaarxiv icon

Balancing Specialized and General Skills in LLMs: The Impact of Modern Tuning and Data Strategy

Add code
Oct 07, 2023
Figure 1 for Balancing Specialized and General Skills in LLMs: The Impact of Modern Tuning and Data Strategy
Figure 2 for Balancing Specialized and General Skills in LLMs: The Impact of Modern Tuning and Data Strategy
Figure 3 for Balancing Specialized and General Skills in LLMs: The Impact of Modern Tuning and Data Strategy
Figure 4 for Balancing Specialized and General Skills in LLMs: The Impact of Modern Tuning and Data Strategy
Viaarxiv icon

SPGM: Prioritizing Local Features for enhanced speech separation performance

Add code
Sep 22, 2023
Figure 1 for SPGM: Prioritizing Local Features for enhanced speech separation performance
Figure 2 for SPGM: Prioritizing Local Features for enhanced speech separation performance
Figure 3 for SPGM: Prioritizing Local Features for enhanced speech separation performance
Figure 4 for SPGM: Prioritizing Local Features for enhanced speech separation performance
Viaarxiv icon

Are Soft Prompts Good Zero-shot Learners for Speech Recognition?

Add code
Sep 18, 2023
Figure 1 for Are Soft Prompts Good Zero-shot Learners for Speech Recognition?
Figure 2 for Are Soft Prompts Good Zero-shot Learners for Speech Recognition?
Figure 3 for Are Soft Prompts Good Zero-shot Learners for Speech Recognition?
Figure 4 for Are Soft Prompts Good Zero-shot Learners for Speech Recognition?
Viaarxiv icon

ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention

Add code
May 20, 2023
Figure 1 for ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention
Figure 2 for ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention
Figure 3 for ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention
Figure 4 for ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention
Viaarxiv icon