Picture for Zhifeng Han

Zhifeng Han

OmniVoice: Towards Omnilingual Zero-Shot Text-to-Speech with Diffusion Language Models

Add code
Apr 02, 2026
Viaarxiv icon

Flow2GAN: Hybrid Flow Matching and GAN with Multi-Resolution Network for Few-step High-Fidelity Audio Generation

Add code
Dec 29, 2025
Viaarxiv icon

Speech-Audio Compositional Attacks on Multimodal LLMs and Their Mitigation with SALMONN-Guard

Add code
Nov 14, 2025
Figure 1 for Speech-Audio Compositional Attacks on Multimodal LLMs and Their Mitigation with SALMONN-Guard
Figure 2 for Speech-Audio Compositional Attacks on Multimodal LLMs and Their Mitigation with SALMONN-Guard
Figure 3 for Speech-Audio Compositional Attacks on Multimodal LLMs and Their Mitigation with SALMONN-Guard
Figure 4 for Speech-Audio Compositional Attacks on Multimodal LLMs and Their Mitigation with SALMONN-Guard
Viaarxiv icon

Silent Speech Sentence Recognition with Six-Axis Accelerometers using Conformer and CTC Algorithm

Add code
Feb 25, 2025
Figure 1 for Silent Speech Sentence Recognition with Six-Axis Accelerometers using Conformer and CTC Algorithm
Figure 2 for Silent Speech Sentence Recognition with Six-Axis Accelerometers using Conformer and CTC Algorithm
Figure 3 for Silent Speech Sentence Recognition with Six-Axis Accelerometers using Conformer and CTC Algorithm
Figure 4 for Silent Speech Sentence Recognition with Six-Axis Accelerometers using Conformer and CTC Algorithm
Viaarxiv icon