Picture for Yongqing Wang

Yongqing Wang

Enhancing Automated Audio Captioning via Large Language Models with Optimized Audio Encoding

Add code
Jun 19, 2024
Figure 1 for Enhancing Automated Audio Captioning via Large Language Models with Optimized Audio Encoding
Figure 2 for Enhancing Automated Audio Captioning via Large Language Models with Optimized Audio Encoding
Figure 3 for Enhancing Automated Audio Captioning via Large Language Models with Optimized Audio Encoding
Figure 4 for Enhancing Automated Audio Captioning via Large Language Models with Optimized Audio Encoding
Viaarxiv icon

Bridging Language Gaps in Audio-Text Retrieval

Add code
Jun 11, 2024
Figure 1 for Bridging Language Gaps in Audio-Text Retrieval
Figure 2 for Bridging Language Gaps in Audio-Text Retrieval
Figure 3 for Bridging Language Gaps in Audio-Text Retrieval
Figure 4 for Bridging Language Gaps in Audio-Text Retrieval
Viaarxiv icon

Scaling up masked audio encoder learning for general audio classification

Add code
Jun 11, 2024
Viaarxiv icon

Graph Domain Adaptation: Challenges, Progress and Prospects

Add code
Feb 01, 2024
Viaarxiv icon

Causality and Independence Enhancement for Biased Node Classification

Add code
Oct 14, 2023
Figure 1 for Causality and Independence Enhancement for Biased Node Classification
Figure 2 for Causality and Independence Enhancement for Biased Node Classification
Figure 3 for Causality and Independence Enhancement for Biased Node Classification
Figure 4 for Causality and Independence Enhancement for Biased Node Classification
Viaarxiv icon

CED: Consistent ensemble distillation for audio tagging

Add code
Sep 08, 2023
Figure 1 for CED: Consistent ensemble distillation for audio tagging
Figure 2 for CED: Consistent ensemble distillation for audio tagging
Figure 3 for CED: Consistent ensemble distillation for audio tagging
Figure 4 for CED: Consistent ensemble distillation for audio tagging
Viaarxiv icon

OpenGDA: Graph Domain Adaptation Benchmark for Cross-network Learning

Add code
Jul 21, 2023
Figure 1 for OpenGDA: Graph Domain Adaptation Benchmark for Cross-network Learning
Figure 2 for OpenGDA: Graph Domain Adaptation Benchmark for Cross-network Learning
Figure 3 for OpenGDA: Graph Domain Adaptation Benchmark for Cross-network Learning
Figure 4 for OpenGDA: Graph Domain Adaptation Benchmark for Cross-network Learning
Viaarxiv icon

Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information

Add code
Jun 28, 2023
Figure 1 for Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information
Figure 2 for Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information
Figure 3 for Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information
Figure 4 for Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information
Viaarxiv icon

AV-SepFormer: Cross-Attention SepFormer for Audio-Visual Target Speaker Extraction

Add code
Jun 25, 2023
Figure 1 for AV-SepFormer: Cross-Attention SepFormer for Audio-Visual Target Speaker Extraction
Figure 2 for AV-SepFormer: Cross-Attention SepFormer for Audio-Visual Target Speaker Extraction
Figure 3 for AV-SepFormer: Cross-Attention SepFormer for Audio-Visual Target Speaker Extraction
Figure 4 for AV-SepFormer: Cross-Attention SepFormer for Audio-Visual Target Speaker Extraction
Viaarxiv icon

Understanding temporally weakly supervised training: A case study for keyword spotting

Add code
May 30, 2023
Figure 1 for Understanding temporally weakly supervised training: A case study for keyword spotting
Figure 2 for Understanding temporally weakly supervised training: A case study for keyword spotting
Figure 3 for Understanding temporally weakly supervised training: A case study for keyword spotting
Figure 4 for Understanding temporally weakly supervised training: A case study for keyword spotting
Viaarxiv icon