Picture for Jianjun Zhao

Jianjun Zhao

Efficient Neural Network Verification via Order Leading Exploration of Branch-and-Bound Trees

Add code
Jul 23, 2025
Viaarxiv icon

AR2: Attention-Guided Repair for the Robustness of CNNs Against Common Corruptions

Add code
Jul 08, 2025
Viaarxiv icon

S2ST-Omni: An Efficient and Scalable Multilingual Speech-to-Speech Translation Framework via Seamlessly Speech-Text Alignment and Streaming Speech Decoder

Add code
Jun 16, 2025
Viaarxiv icon

ClapFM-EVC: High-Fidelity and Flexible Emotional Voice Conversion with Dual Control from Natural Language and Speech

Add code
May 20, 2025
Viaarxiv icon

Adaptive Branch-and-Bound Tree Exploration for Neural Network Verification

Add code
May 02, 2025
Viaarxiv icon

CTEFM-VC: Zero-Shot Voice Conversion Based on Content-Aware Timbre Ensemble Modeling and Flow Matching

Add code
Nov 04, 2024
Figure 1 for CTEFM-VC: Zero-Shot Voice Conversion Based on Content-Aware Timbre Ensemble Modeling and Flow Matching
Figure 2 for CTEFM-VC: Zero-Shot Voice Conversion Based on Content-Aware Timbre Ensemble Modeling and Flow Matching
Figure 3 for CTEFM-VC: Zero-Shot Voice Conversion Based on Content-Aware Timbre Ensemble Modeling and Flow Matching
Figure 4 for CTEFM-VC: Zero-Shot Voice Conversion Based on Content-Aware Timbre Ensemble Modeling and Flow Matching
Viaarxiv icon

A Coverage-Guided Testing Framework for Quantum Neural Networks

Add code
Nov 03, 2024
Viaarxiv icon

Takin-VC: Zero-shot Voice Conversion via Jointly Hybrid Content and Memory-Augmented Context-Aware Timbre Modeling

Add code
Oct 02, 2024
Figure 1 for Takin-VC: Zero-shot Voice Conversion via Jointly Hybrid Content and Memory-Augmented Context-Aware Timbre Modeling
Figure 2 for Takin-VC: Zero-shot Voice Conversion via Jointly Hybrid Content and Memory-Augmented Context-Aware Timbre Modeling
Figure 3 for Takin-VC: Zero-shot Voice Conversion via Jointly Hybrid Content and Memory-Augmented Context-Aware Timbre Modeling
Figure 4 for Takin-VC: Zero-shot Voice Conversion via Jointly Hybrid Content and Memory-Augmented Context-Aware Timbre Modeling
Viaarxiv icon

PolyGlotFake: A Novel Multilingual and Multimodal DeepFake Dataset

Add code
May 14, 2024
Viaarxiv icon

GMP-ATL: Gender-augmented Multi-scale Pseudo-label Enhanced Adaptive Transfer Learning for Speech Emotion Recognition via HuBERT

Add code
May 03, 2024
Viaarxiv icon