Picture for Han Yin

Han Yin

Noise-Robust Sound Event Detection and Counting via Language-Queried Sound Separation

Add code
Aug 10, 2025
Viaarxiv icon

SpeakerLM: End-to-End Versatile Speaker Diarization and Recognition with Multimodal Large Language Models

Add code
Aug 08, 2025
Viaarxiv icon

EnvSDD: Benchmarking Environmental Sound Deepfake Detection

Add code
May 25, 2025
Viaarxiv icon

Pushing the Frontiers of Self-Distillation Prototypes Network with Dimension Regularization and Score Normalization

Add code
May 20, 2025
Viaarxiv icon

Leveraging LLM and Text-Queried Separation for Noise-Robust Sound Event Detection

Add code
Nov 02, 2024
Viaarxiv icon

Multimodal Clickbait Detection by De-confounding Biases Using Causal Representation Inference

Add code
Oct 10, 2024
Figure 1 for Multimodal Clickbait Detection by De-confounding Biases Using Causal Representation Inference
Figure 2 for Multimodal Clickbait Detection by De-confounding Biases Using Causal Representation Inference
Figure 3 for Multimodal Clickbait Detection by De-confounding Biases Using Causal Representation Inference
Figure 4 for Multimodal Clickbait Detection by De-confounding Biases Using Causal Representation Inference
Viaarxiv icon

Mixstyle based Domain Generalization for Sound Event Detection with Heterogeneous Training Data

Add code
Jul 04, 2024
Viaarxiv icon

FMSG-JLESS Submission for DCASE 2024 Task4 on Sound Event Detection with Heterogeneous Training Dataset and Potentially Missing Labels

Add code
Jun 29, 2024
Viaarxiv icon

Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift

Add code
Feb 05, 2024
Figure 1 for Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift
Figure 2 for Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift
Figure 3 for Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift
Figure 4 for Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift
Viaarxiv icon

Sub-band and Full-band Interactive U-Net with DPRNN for Demixing Cross-talk Stereo Music

Add code
Jan 11, 2024
Figure 1 for Sub-band and Full-band Interactive U-Net with DPRNN for Demixing Cross-talk Stereo Music
Figure 2 for Sub-band and Full-band Interactive U-Net with DPRNN for Demixing Cross-talk Stereo Music
Figure 3 for Sub-band and Full-band Interactive U-Net with DPRNN for Demixing Cross-talk Stereo Music
Figure 4 for Sub-band and Full-band Interactive U-Net with DPRNN for Demixing Cross-talk Stereo Music
Viaarxiv icon