Picture for Han Yin

Han Yin

Dynamic Fusion Multimodal Network for SpeechWellness Detection

Add code
Aug 25, 2025
Viaarxiv icon

Noise-Robust Sound Event Detection and Counting via Language-Queried Sound Separation

Add code
Aug 10, 2025
Viaarxiv icon

SpeakerLM: End-to-End Versatile Speaker Diarization and Recognition with Multimodal Large Language Models

Add code
Aug 08, 2025
Viaarxiv icon

EnvSDD: Benchmarking Environmental Sound Deepfake Detection

Add code
May 25, 2025
Viaarxiv icon

Pushing the Frontiers of Self-Distillation Prototypes Network with Dimension Regularization and Score Normalization

Add code
May 20, 2025
Figure 1 for Pushing the Frontiers of Self-Distillation Prototypes Network with Dimension Regularization and Score Normalization
Figure 2 for Pushing the Frontiers of Self-Distillation Prototypes Network with Dimension Regularization and Score Normalization
Figure 3 for Pushing the Frontiers of Self-Distillation Prototypes Network with Dimension Regularization and Score Normalization
Figure 4 for Pushing the Frontiers of Self-Distillation Prototypes Network with Dimension Regularization and Score Normalization
Viaarxiv icon

Leveraging LLM and Text-Queried Separation for Noise-Robust Sound Event Detection

Add code
Nov 02, 2024
Viaarxiv icon

Multimodal Clickbait Detection by De-confounding Biases Using Causal Representation Inference

Add code
Oct 10, 2024
Figure 1 for Multimodal Clickbait Detection by De-confounding Biases Using Causal Representation Inference
Figure 2 for Multimodal Clickbait Detection by De-confounding Biases Using Causal Representation Inference
Figure 3 for Multimodal Clickbait Detection by De-confounding Biases Using Causal Representation Inference
Figure 4 for Multimodal Clickbait Detection by De-confounding Biases Using Causal Representation Inference
Viaarxiv icon

Mixstyle based Domain Generalization for Sound Event Detection with Heterogeneous Training Data

Add code
Jul 04, 2024
Viaarxiv icon

FMSG-JLESS Submission for DCASE 2024 Task4 on Sound Event Detection with Heterogeneous Training Dataset and Potentially Missing Labels

Add code
Jun 29, 2024
Figure 1 for FMSG-JLESS Submission for DCASE 2024 Task4 on Sound Event Detection with Heterogeneous Training Dataset and Potentially Missing Labels
Figure 2 for FMSG-JLESS Submission for DCASE 2024 Task4 on Sound Event Detection with Heterogeneous Training Dataset and Potentially Missing Labels
Viaarxiv icon

Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift

Add code
Feb 05, 2024
Figure 1 for Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift
Figure 2 for Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift
Figure 3 for Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift
Figure 4 for Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift
Viaarxiv icon