Alert button
Picture for Xuenan Xu

Xuenan Xu

Alert button

A Detailed Audio-Text Data Simulation Pipeline using Single-Event Sounds

Add code
Bookmark button
Alert button
Mar 07, 2024
Xuenan Xu, Xiaohang Xu, Zeyu Xie, Pingyue Zhang, Mengyue Wu, Kai Yu

Figure 1 for A Detailed Audio-Text Data Simulation Pipeline using Single-Event Sounds
Figure 2 for A Detailed Audio-Text Data Simulation Pipeline using Single-Event Sounds
Figure 3 for A Detailed Audio-Text Data Simulation Pipeline using Single-Event Sounds
Figure 4 for A Detailed Audio-Text Data Simulation Pipeline using Single-Event Sounds
Viaarxiv icon

Enhancing Audio Generation Diversity with Visual Information

Add code
Bookmark button
Alert button
Mar 02, 2024
Zeyu Xie, Baihan Li, Xuenan Xu, Mengyue Wu, Kai Yu

Figure 1 for Enhancing Audio Generation Diversity with Visual Information
Figure 2 for Enhancing Audio Generation Diversity with Visual Information
Figure 3 for Enhancing Audio Generation Diversity with Visual Information
Figure 4 for Enhancing Audio Generation Diversity with Visual Information
Viaarxiv icon

Towards Weakly Supervised Text-to-Audio Grounding

Add code
Bookmark button
Alert button
Jan 05, 2024
Xuenan Xu, Ziyang Ma, Mengyue Wu, Kai Yu

Viaarxiv icon

A Large-scale Dataset for Audio-Language Representation Learning

Add code
Bookmark button
Alert button
Oct 03, 2023
Luoyi Sun, Xuenan Xu, Mengyue Wu, Weidi Xie

Figure 1 for A Large-scale Dataset for Audio-Language Representation Learning
Figure 2 for A Large-scale Dataset for Audio-Language Representation Learning
Figure 3 for A Large-scale Dataset for Audio-Language Representation Learning
Figure 4 for A Large-scale Dataset for Audio-Language Representation Learning
Viaarxiv icon

Improving Audio Caption Fluency with Automatic Error Correction

Add code
Bookmark button
Alert button
Jun 16, 2023
Hanxue Zhang, Zeyu Xie, Xuenan Xu, Mengyue Wu, Kai Yu

Figure 1 for Improving Audio Caption Fluency with Automatic Error Correction
Figure 2 for Improving Audio Caption Fluency with Automatic Error Correction
Figure 3 for Improving Audio Caption Fluency with Automatic Error Correction
Figure 4 for Improving Audio Caption Fluency with Automatic Error Correction
Viaarxiv icon

Enhance Temporal Relations in Audio Captioning with Sound Event Detection

Add code
Bookmark button
Alert button
Jun 02, 2023
Zeyu Xie, Xuenan Xu, Mengyue Wu, Kai Yu

Figure 1 for Enhance Temporal Relations in Audio Captioning with Sound Event Detection
Figure 2 for Enhance Temporal Relations in Audio Captioning with Sound Event Detection
Figure 3 for Enhance Temporal Relations in Audio Captioning with Sound Event Detection
Figure 4 for Enhance Temporal Relations in Audio Captioning with Sound Event Detection
Viaarxiv icon

Diverse and Vivid Sound Generation from Text Descriptions

Add code
Bookmark button
Alert button
May 03, 2023
Guangwei Li, Xuenan Xu, Lingfeng Dai, Mengyue Wu, Kai Yu

Figure 1 for Diverse and Vivid Sound Generation from Text Descriptions
Figure 2 for Diverse and Vivid Sound Generation from Text Descriptions
Figure 3 for Diverse and Vivid Sound Generation from Text Descriptions
Figure 4 for Diverse and Vivid Sound Generation from Text Descriptions
Viaarxiv icon

BLAT: Bootstrapping Language-Audio Pre-training based on AudioSet Tag-guided Synthetic Data

Add code
Bookmark button
Alert button
Mar 14, 2023
Xuenan Xu, Zhiling Zhang, Zelin Zhou, Pingyue Zhang, Zeyu Xie, Mengyue Wu, Kenny Q. Zhu

Figure 1 for BLAT: Bootstrapping Language-Audio Pre-training based on AudioSet Tag-guided Synthetic Data
Figure 2 for BLAT: Bootstrapping Language-Audio Pre-training based on AudioSet Tag-guided Synthetic Data
Figure 3 for BLAT: Bootstrapping Language-Audio Pre-training based on AudioSet Tag-guided Synthetic Data
Figure 4 for BLAT: Bootstrapping Language-Audio Pre-training based on AudioSet Tag-guided Synthetic Data
Viaarxiv icon

A Comprehensive Survey of Automated Audio Captioning

Add code
Bookmark button
Alert button
May 11, 2022
Xuenan Xu, Mengyue Wu, Kai Yu

Figure 1 for A Comprehensive Survey of Automated Audio Captioning
Figure 2 for A Comprehensive Survey of Automated Audio Captioning
Figure 3 for A Comprehensive Survey of Automated Audio Captioning
Figure 4 for A Comprehensive Survey of Automated Audio Captioning
Viaarxiv icon