Picture for Jinzheng Zhao

Jinzheng Zhao

Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection

Add code
Dec 14, 2023
Figure 1 for Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection
Figure 2 for Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection
Figure 3 for Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection
Viaarxiv icon

Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions

Oct 23, 2023
Figure 1 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Figure 2 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Figure 3 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Figure 4 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Viaarxiv icon

Towards Robust and Generalizable Training: An Empirical Study of Noisy Slot Filling for Input Perturbations

Add code
Oct 05, 2023
Figure 1 for Towards Robust and Generalizable Training: An Empirical Study of Noisy Slot Filling for Input Perturbations
Figure 2 for Towards Robust and Generalizable Training: An Empirical Study of Noisy Slot Filling for Input Perturbations
Figure 3 for Towards Robust and Generalizable Training: An Empirical Study of Noisy Slot Filling for Input Perturbations
Figure 4 for Towards Robust and Generalizable Training: An Empirical Study of Noisy Slot Filling for Input Perturbations
Viaarxiv icon

Audio Visual Speaker Localization from EgoCentric Views

Add code
Sep 28, 2023
Figure 1 for Audio Visual Speaker Localization from EgoCentric Views
Figure 2 for Audio Visual Speaker Localization from EgoCentric Views
Figure 3 for Audio Visual Speaker Localization from EgoCentric Views
Figure 4 for Audio Visual Speaker Localization from EgoCentric Views
Viaarxiv icon

Generative Zero-Shot Prompt Learning for Cross-Domain Slot Filling with Inverse Prompting

Add code
Jul 06, 2023
Viaarxiv icon

PSSAT: A Perturbed Semantic Structure Awareness Transferring Method for Perturbation-Robust Slot Filling

Aug 31, 2022
Figure 1 for PSSAT: A Perturbed Semantic Structure Awareness Transferring Method for Perturbation-Robust Slot Filling
Figure 2 for PSSAT: A Perturbed Semantic Structure Awareness Transferring Method for Perturbation-Robust Slot Filling
Figure 3 for PSSAT: A Perturbed Semantic Structure Awareness Transferring Method for Perturbation-Robust Slot Filling
Figure 4 for PSSAT: A Perturbed Semantic Structure Awareness Transferring Method for Perturbation-Robust Slot Filling
Viaarxiv icon

A Robust Contrastive Alignment Method For Multi-Domain Text Classification

Apr 26, 2022
Figure 1 for A Robust Contrastive Alignment Method For Multi-Domain Text Classification
Figure 2 for A Robust Contrastive Alignment Method For Multi-Domain Text Classification
Figure 3 for A Robust Contrastive Alignment Method For Multi-Domain Text Classification
Figure 4 for A Robust Contrastive Alignment Method For Multi-Domain Text Classification
Viaarxiv icon

Separate What You Describe: Language-Queried Audio Source Separation

Add code
Mar 28, 2022
Figure 1 for Separate What You Describe: Language-Queried Audio Source Separation
Figure 2 for Separate What You Describe: Language-Queried Audio Source Separation
Figure 3 for Separate What You Describe: Language-Queried Audio Source Separation
Figure 4 for Separate What You Describe: Language-Queried Audio Source Separation
Viaarxiv icon

Leveraging Pre-trained BERT for Audio Captioning

Mar 27, 2022
Figure 1 for Leveraging Pre-trained BERT for Audio Captioning
Figure 2 for Leveraging Pre-trained BERT for Audio Captioning
Figure 3 for Leveraging Pre-trained BERT for Audio Captioning
Figure 4 for Leveraging Pre-trained BERT for Audio Captioning
Viaarxiv icon

Deep Neural Decision Forest for Acoustic Scene Classification

Mar 07, 2022
Figure 1 for Deep Neural Decision Forest for Acoustic Scene Classification
Figure 2 for Deep Neural Decision Forest for Acoustic Scene Classification
Figure 3 for Deep Neural Decision Forest for Acoustic Scene Classification
Figure 4 for Deep Neural Decision Forest for Acoustic Scene Classification
Viaarxiv icon