Cross Environment Asr


TG-ASR: Translation-Guided Learning with Parallel Gated Cross Attention for Low-Resource Automatic Speech Recognition

Add code
Feb 25, 2026
Viaarxiv icon

DBMIF: a deep balanced multimodal iterative fusion framework for air- and bone-conduction speech enhancement

Add code
Mar 03, 2026
Viaarxiv icon

SE-DiCoW: Self-Enrolled Diarization-Conditioned Whisper

Add code
Jan 27, 2026
Viaarxiv icon

Unifying Speech Recognition, Synthesis and Conversion with Autoregressive Transformers

Add code
Jan 15, 2026
Viaarxiv icon

AdvEvo-MARL: Shaping Internalized Safety through Adversarial Co-Evolution in Multi-Agent Reinforcement Learning

Add code
Oct 02, 2025
Figure 1 for AdvEvo-MARL: Shaping Internalized Safety through Adversarial Co-Evolution in Multi-Agent Reinforcement Learning
Figure 2 for AdvEvo-MARL: Shaping Internalized Safety through Adversarial Co-Evolution in Multi-Agent Reinforcement Learning
Figure 3 for AdvEvo-MARL: Shaping Internalized Safety through Adversarial Co-Evolution in Multi-Agent Reinforcement Learning
Figure 4 for AdvEvo-MARL: Shaping Internalized Safety through Adversarial Co-Evolution in Multi-Agent Reinforcement Learning
Viaarxiv icon

The Tower of Babel Revisited: Multilingual Jailbreak Prompts on Closed-Source Large Language Models

Add code
May 18, 2025
Figure 1 for The Tower of Babel Revisited: Multilingual Jailbreak Prompts on Closed-Source Large Language Models
Figure 2 for The Tower of Babel Revisited: Multilingual Jailbreak Prompts on Closed-Source Large Language Models
Figure 3 for The Tower of Babel Revisited: Multilingual Jailbreak Prompts on Closed-Source Large Language Models
Figure 4 for The Tower of Babel Revisited: Multilingual Jailbreak Prompts on Closed-Source Large Language Models
Viaarxiv icon

Tiny-Align: Bridging Automatic Speech Recognition and Large Language Model on the Edge

Add code
Nov 21, 2024
Figure 1 for Tiny-Align: Bridging Automatic Speech Recognition and Large Language Model on the Edge
Figure 2 for Tiny-Align: Bridging Automatic Speech Recognition and Large Language Model on the Edge
Figure 3 for Tiny-Align: Bridging Automatic Speech Recognition and Large Language Model on the Edge
Figure 4 for Tiny-Align: Bridging Automatic Speech Recognition and Large Language Model on the Edge
Viaarxiv icon

IndicVoices-R: Unlocking a Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTS

Add code
Sep 09, 2024
Figure 1 for IndicVoices-R: Unlocking a Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTS
Figure 2 for IndicVoices-R: Unlocking a Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTS
Figure 3 for IndicVoices-R: Unlocking a Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTS
Figure 4 for IndicVoices-R: Unlocking a Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTS
Viaarxiv icon

LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition

Add code
Jun 06, 2024
Figure 1 for LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition
Figure 2 for LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition
Figure 3 for LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition
Figure 4 for LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition
Viaarxiv icon

Dynamical Embedding of Single Channel Electroencephalogram for Artifact Subspace Reconstruction

Add code
Jun 28, 2024
Figure 1 for Dynamical Embedding of Single Channel Electroencephalogram for Artifact Subspace Reconstruction
Figure 2 for Dynamical Embedding of Single Channel Electroencephalogram for Artifact Subspace Reconstruction
Figure 3 for Dynamical Embedding of Single Channel Electroencephalogram for Artifact Subspace Reconstruction
Figure 4 for Dynamical Embedding of Single Channel Electroencephalogram for Artifact Subspace Reconstruction
Viaarxiv icon