Picture for Wenhuan Lu

Wenhuan Lu

HalluAudio: A Comprehensive Benchmark for Hallucination Detection in Large Audio-Language Models

Add code
Apr 21, 2026
Viaarxiv icon

TIGFlow-GRPO: Trajectory Forecasting via Interaction-Aware Flow Matching and Reward-Driven Optimization

Add code
Mar 26, 2026
Viaarxiv icon

Whisper-MLA: Reducing GPU Memory Consumption of ASR Models based on MHA2MLA Conversion

Add code
Feb 28, 2026
Viaarxiv icon

Listening for "You": Enhancing Speech Image Retrieval via Target Speaker Extraction

Add code
Sep 11, 2025
Viaarxiv icon

You Only Speak Once to See

Add code
Sep 27, 2024
Figure 1 for You Only Speak Once to See
Figure 2 for You Only Speak Once to See
Figure 3 for You Only Speak Once to See
Figure 4 for You Only Speak Once to See
Viaarxiv icon

Channel Adaptation for Speaker Verification Using Optimal Transport with Pseudo Label

Add code
Sep 14, 2024
Figure 1 for Channel Adaptation for Speaker Verification Using Optimal Transport with Pseudo Label
Figure 2 for Channel Adaptation for Speaker Verification Using Optimal Transport with Pseudo Label
Figure 3 for Channel Adaptation for Speaker Verification Using Optimal Transport with Pseudo Label
Figure 4 for Channel Adaptation for Speaker Verification Using Optimal Transport with Pseudo Label
Viaarxiv icon

Integrated Multi-Level Knowledge Distillation for Enhanced Speaker Verification

Add code
Sep 14, 2024
Figure 1 for Integrated Multi-Level Knowledge Distillation for Enhanced Speaker Verification
Figure 2 for Integrated Multi-Level Knowledge Distillation for Enhanced Speaker Verification
Figure 3 for Integrated Multi-Level Knowledge Distillation for Enhanced Speaker Verification
Figure 4 for Integrated Multi-Level Knowledge Distillation for Enhanced Speaker Verification
Viaarxiv icon

Robust Channel Learning for Large-Scale Radio Speaker Verification

Add code
Jun 16, 2024
Viaarxiv icon

Weakly-Supervised Video Anomaly Detection with Snippet Anomalous Attention

Add code
Sep 28, 2023
Figure 1 for Weakly-Supervised Video Anomaly Detection with Snippet Anomalous Attention
Figure 2 for Weakly-Supervised Video Anomaly Detection with Snippet Anomalous Attention
Figure 3 for Weakly-Supervised Video Anomaly Detection with Snippet Anomalous Attention
Figure 4 for Weakly-Supervised Video Anomaly Detection with Snippet Anomalous Attention
Viaarxiv icon

TMS: A Temporal Multi-scale Backbone Design for Speaker Embedding

Add code
Mar 17, 2022
Figure 1 for TMS: A Temporal Multi-scale Backbone Design for Speaker Embedding
Figure 2 for TMS: A Temporal Multi-scale Backbone Design for Speaker Embedding
Figure 3 for TMS: A Temporal Multi-scale Backbone Design for Speaker Embedding
Figure 4 for TMS: A Temporal Multi-scale Backbone Design for Speaker Embedding
Viaarxiv icon