Picture for Tzu-Quan Lin

Tzu-Quan Lin

AdaLTM: Adaptive Layer-wise Task Vector Merging for Categorical Speech Emotion Recognition with ASR Knowledge Integration

Add code
Mar 26, 2026
Viaarxiv icon

How Contrastive Decoding Enhances Large Audio Language Models?

Add code
Mar 10, 2026
Viaarxiv icon

Pseudo2Real: Task Arithmetic for Pseudo-Label Correction in Automatic Speech Recognition

Add code
Oct 09, 2025
Viaarxiv icon

DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment

Add code
Jul 03, 2025
Figure 1 for DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment
Figure 2 for DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment
Figure 3 for DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment
Figure 4 for DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment
Viaarxiv icon

An Exploration of Mamba for Speech Self-Supervised Models

Add code
Jun 14, 2025
Figure 1 for An Exploration of Mamba for Speech Self-Supervised Models
Figure 2 for An Exploration of Mamba for Speech Self-Supervised Models
Figure 3 for An Exploration of Mamba for Speech Self-Supervised Models
Figure 4 for An Exploration of Mamba for Speech Self-Supervised Models
Viaarxiv icon

Speech-FT: A Fine-tuning Strategy for Enhancing Speech Representation Models Without Compromising Generalization Ability

Add code
Feb 18, 2025
Figure 1 for Speech-FT: A Fine-tuning Strategy for Enhancing Speech Representation Models Without Compromising Generalization Ability
Figure 2 for Speech-FT: A Fine-tuning Strategy for Enhancing Speech Representation Models Without Compromising Generalization Ability
Figure 3 for Speech-FT: A Fine-tuning Strategy for Enhancing Speech Representation Models Without Compromising Generalization Ability
Figure 4 for Speech-FT: A Fine-tuning Strategy for Enhancing Speech Representation Models Without Compromising Generalization Ability
Viaarxiv icon

Building a Taiwanese Mandarin Spoken Language Model: A First Attempt

Add code
Nov 11, 2024
Figure 1 for Building a Taiwanese Mandarin Spoken Language Model: A First Attempt
Figure 2 for Building a Taiwanese Mandarin Spoken Language Model: A First Attempt
Figure 3 for Building a Taiwanese Mandarin Spoken Language Model: A First Attempt
Figure 4 for Building a Taiwanese Mandarin Spoken Language Model: A First Attempt
Viaarxiv icon

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

Add code
Nov 08, 2024
Figure 1 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Figure 2 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Figure 3 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Figure 4 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Viaarxiv icon

Property Neurons in Self-Supervised Speech Transformers

Add code
Sep 07, 2024
Figure 1 for Property Neurons in Self-Supervised Speech Transformers
Figure 2 for Property Neurons in Self-Supervised Speech Transformers
Figure 3 for Property Neurons in Self-Supervised Speech Transformers
Figure 4 for Property Neurons in Self-Supervised Speech Transformers
Viaarxiv icon

Listen and Speak Fairly: A Study on Semantic Gender Bias in Speech Integrated Large Language Models

Add code
Jul 09, 2024
Viaarxiv icon