Alert button
Picture for Zeyu Xie

Zeyu Xie

Alert button

A Detailed Audio-Text Data Simulation Pipeline using Single-Event Sounds

Add code
Bookmark button
Alert button
Mar 07, 2024
Xuenan Xu, Xiaohang Xu, Zeyu Xie, Pingyue Zhang, Mengyue Wu, Kai Yu

Figure 1 for A Detailed Audio-Text Data Simulation Pipeline using Single-Event Sounds
Figure 2 for A Detailed Audio-Text Data Simulation Pipeline using Single-Event Sounds
Figure 3 for A Detailed Audio-Text Data Simulation Pipeline using Single-Event Sounds
Figure 4 for A Detailed Audio-Text Data Simulation Pipeline using Single-Event Sounds
Viaarxiv icon

Enhancing Audio Generation Diversity with Visual Information

Add code
Bookmark button
Alert button
Mar 02, 2024
Zeyu Xie, Baihan Li, Xuenan Xu, Mengyue Wu, Kai Yu

Figure 1 for Enhancing Audio Generation Diversity with Visual Information
Figure 2 for Enhancing Audio Generation Diversity with Visual Information
Figure 3 for Enhancing Audio Generation Diversity with Visual Information
Figure 4 for Enhancing Audio Generation Diversity with Visual Information
Viaarxiv icon

Phonetic and Lexical Discovery of a Canine Language using HuBERT

Add code
Bookmark button
Alert button
Feb 25, 2024
Xingyuan Li, Sinong Wang, Zeyu Xie, Mengyue Wu, Kenny Q. Zhu

Viaarxiv icon

Improving Audio Caption Fluency with Automatic Error Correction

Add code
Bookmark button
Alert button
Jun 16, 2023
Hanxue Zhang, Zeyu Xie, Xuenan Xu, Mengyue Wu, Kai Yu

Figure 1 for Improving Audio Caption Fluency with Automatic Error Correction
Figure 2 for Improving Audio Caption Fluency with Automatic Error Correction
Figure 3 for Improving Audio Caption Fluency with Automatic Error Correction
Figure 4 for Improving Audio Caption Fluency with Automatic Error Correction
Viaarxiv icon

Enhance Temporal Relations in Audio Captioning with Sound Event Detection

Add code
Bookmark button
Alert button
Jun 02, 2023
Zeyu Xie, Xuenan Xu, Mengyue Wu, Kai Yu

Figure 1 for Enhance Temporal Relations in Audio Captioning with Sound Event Detection
Figure 2 for Enhance Temporal Relations in Audio Captioning with Sound Event Detection
Figure 3 for Enhance Temporal Relations in Audio Captioning with Sound Event Detection
Figure 4 for Enhance Temporal Relations in Audio Captioning with Sound Event Detection
Viaarxiv icon

BLAT: Bootstrapping Language-Audio Pre-training based on AudioSet Tag-guided Synthetic Data

Add code
Bookmark button
Alert button
Mar 14, 2023
Xuenan Xu, Zhiling Zhang, Zelin Zhou, Pingyue Zhang, Zeyu Xie, Mengyue Wu, Kenny Q. Zhu

Figure 1 for BLAT: Bootstrapping Language-Audio Pre-training based on AudioSet Tag-guided Synthetic Data
Figure 2 for BLAT: Bootstrapping Language-Audio Pre-training based on AudioSet Tag-guided Synthetic Data
Figure 3 for BLAT: Bootstrapping Language-Audio Pre-training based on AudioSet Tag-guided Synthetic Data
Figure 4 for BLAT: Bootstrapping Language-Audio Pre-training based on AudioSet Tag-guided Synthetic Data
Viaarxiv icon

Can Audio Captions Be Evaluated with Image Caption Metrics?

Add code
Bookmark button
Alert button
Oct 10, 2021
Zelin Zhou, Zhiling Zhang, Xuenan Xu, Zeyu Xie, Mengyue Wu, Kenny Q. Zhu

Figure 1 for Can Audio Captions Be Evaluated with Image Caption Metrics?
Figure 2 for Can Audio Captions Be Evaluated with Image Caption Metrics?
Figure 3 for Can Audio Captions Be Evaluated with Image Caption Metrics?
Figure 4 for Can Audio Captions Be Evaluated with Image Caption Metrics?
Viaarxiv icon

Investigating Local and Global Information for Automated Audio Captioning with Transfer Learning

Add code
Bookmark button
Alert button
Feb 23, 2021
Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Zeyu Xie, Kai Yu

Figure 1 for Investigating Local and Global Information for Automated Audio Captioning with Transfer Learning
Figure 2 for Investigating Local and Global Information for Automated Audio Captioning with Transfer Learning
Figure 3 for Investigating Local and Global Information for Automated Audio Captioning with Transfer Learning
Viaarxiv icon