Picture for Jinzheng Zhao

Jinzheng Zhao

Universal Sound Separation with Self-Supervised Audio Masked Autoencoder

Add code
Jul 16, 2024
Viaarxiv icon

Text-Queried Target Sound Event Localization

Add code
Jun 23, 2024
Viaarxiv icon

Fish Tracking, Counting, and Behaviour Analysis in Digital Aquaculture: A Comprehensive Review

Add code
Jun 20, 2024
Viaarxiv icon

Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection

Add code
Dec 14, 2023
Figure 1 for Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection
Figure 2 for Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection
Figure 3 for Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection
Viaarxiv icon

Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions

Add code
Oct 23, 2023
Figure 1 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Figure 2 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Figure 3 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Figure 4 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Viaarxiv icon

Towards Robust and Generalizable Training: An Empirical Study of Noisy Slot Filling for Input Perturbations

Add code
Oct 05, 2023
Figure 1 for Towards Robust and Generalizable Training: An Empirical Study of Noisy Slot Filling for Input Perturbations
Figure 2 for Towards Robust and Generalizable Training: An Empirical Study of Noisy Slot Filling for Input Perturbations
Figure 3 for Towards Robust and Generalizable Training: An Empirical Study of Noisy Slot Filling for Input Perturbations
Figure 4 for Towards Robust and Generalizable Training: An Empirical Study of Noisy Slot Filling for Input Perturbations
Viaarxiv icon

Audio Visual Speaker Localization from EgoCentric Views

Add code
Sep 28, 2023
Figure 1 for Audio Visual Speaker Localization from EgoCentric Views
Figure 2 for Audio Visual Speaker Localization from EgoCentric Views
Figure 3 for Audio Visual Speaker Localization from EgoCentric Views
Figure 4 for Audio Visual Speaker Localization from EgoCentric Views
Viaarxiv icon

Generative Zero-Shot Prompt Learning for Cross-Domain Slot Filling with Inverse Prompting

Add code
Jul 06, 2023
Viaarxiv icon

PSSAT: A Perturbed Semantic Structure Awareness Transferring Method for Perturbation-Robust Slot Filling

Add code
Aug 31, 2022
Figure 1 for PSSAT: A Perturbed Semantic Structure Awareness Transferring Method for Perturbation-Robust Slot Filling
Figure 2 for PSSAT: A Perturbed Semantic Structure Awareness Transferring Method for Perturbation-Robust Slot Filling
Figure 3 for PSSAT: A Perturbed Semantic Structure Awareness Transferring Method for Perturbation-Robust Slot Filling
Figure 4 for PSSAT: A Perturbed Semantic Structure Awareness Transferring Method for Perturbation-Robust Slot Filling
Viaarxiv icon

A Robust Contrastive Alignment Method For Multi-Domain Text Classification

Add code
Apr 26, 2022
Figure 1 for A Robust Contrastive Alignment Method For Multi-Domain Text Classification
Figure 2 for A Robust Contrastive Alignment Method For Multi-Domain Text Classification
Figure 3 for A Robust Contrastive Alignment Method For Multi-Domain Text Classification
Figure 4 for A Robust Contrastive Alignment Method For Multi-Domain Text Classification
Viaarxiv icon