Alert button
Picture for Huang Xie

Huang Xie

Alert button

Crowdsourcing and Evaluating Text-Based Audio Retrieval Relevances

Jun 16, 2023
Huang Xie, Khazar Khorrami, Okko Räsänen, Tuomas Virtanen

Figure 1 for Crowdsourcing and Evaluating Text-Based Audio Retrieval Relevances
Figure 2 for Crowdsourcing and Evaluating Text-Based Audio Retrieval Relevances
Figure 3 for Crowdsourcing and Evaluating Text-Based Audio Retrieval Relevances
Figure 4 for Crowdsourcing and Evaluating Text-Based Audio Retrieval Relevances
Viaarxiv icon

On Negative Sampling for Contrastive Audio-Text Retrieval

Nov 08, 2022
Huang Xie, Okko Räsänen, Tuomas Virtanen

Figure 1 for On Negative Sampling for Contrastive Audio-Text Retrieval
Figure 2 for On Negative Sampling for Contrastive Audio-Text Retrieval
Viaarxiv icon

Language-based Audio Retrieval Task in DCASE 2022 Challenge

Oct 04, 2022
Huang Xie, Samuel Lipping, Tuomas Virtanen

Figure 1 for Language-based Audio Retrieval Task in DCASE 2022 Challenge
Figure 2 for Language-based Audio Retrieval Task in DCASE 2022 Challenge
Figure 3 for Language-based Audio Retrieval Task in DCASE 2022 Challenge
Figure 4 for Language-based Audio Retrieval Task in DCASE 2022 Challenge
Viaarxiv icon

DCASE 2022 Challenge Task 6B: Language-Based Audio Retrieval

Jun 15, 2022
Huang Xie, Samuel Lipping, Tuomas Virtanen

Figure 1 for DCASE 2022 Challenge Task 6B: Language-Based Audio Retrieval
Figure 2 for DCASE 2022 Challenge Task 6B: Language-Based Audio Retrieval
Figure 3 for DCASE 2022 Challenge Task 6B: Language-Based Audio Retrieval
Viaarxiv icon

Zero-Shot Audio Classification using Image Embeddings

Jun 10, 2022
Duygu Dogan, Huang Xie, Toni Heittola, Tuomas Virtanen

Figure 1 for Zero-Shot Audio Classification using Image Embeddings
Figure 2 for Zero-Shot Audio Classification using Image Embeddings
Figure 3 for Zero-Shot Audio Classification using Image Embeddings
Figure 4 for Zero-Shot Audio Classification using Image Embeddings
Viaarxiv icon

Unsupervised Audio-Caption Aligning Learns Correspondences between Individual Sound Events and Textual Phrases

Oct 06, 2021
Huang Xie, Okko Räsänen, Konstantinos Drossos, Tuomas Virtanen

Figure 1 for Unsupervised Audio-Caption Aligning Learns Correspondences between Individual Sound Events and Textual Phrases
Figure 2 for Unsupervised Audio-Caption Aligning Learns Correspondences between Individual Sound Events and Textual Phrases
Figure 3 for Unsupervised Audio-Caption Aligning Learns Correspondences between Individual Sound Events and Textual Phrases
Figure 4 for Unsupervised Audio-Caption Aligning Learns Correspondences between Individual Sound Events and Textual Phrases
Viaarxiv icon

Zero-Shot Audio Classification Based on Class Label Embeddings

May 06, 2019
Huang Xie, Tuomas Virtanen

Figure 1 for Zero-Shot Audio Classification Based on Class Label Embeddings
Figure 2 for Zero-Shot Audio Classification Based on Class Label Embeddings
Figure 3 for Zero-Shot Audio Classification Based on Class Label Embeddings
Figure 4 for Zero-Shot Audio Classification Based on Class Label Embeddings
Viaarxiv icon