Alert button

"music": models, code, and papers
Alert button

MuLan: A Joint Embedding of Music Audio and Natural Language

Aug 26, 2022
Qingqing Huang, Aren Jansen, Joonseok Lee, Ravi Ganti, Judith Yue Li, Daniel P. W. Ellis

Figure 1 for MuLan: A Joint Embedding of Music Audio and Natural Language
Figure 2 for MuLan: A Joint Embedding of Music Audio and Natural Language
Figure 3 for MuLan: A Joint Embedding of Music Audio and Natural Language
Figure 4 for MuLan: A Joint Embedding of Music Audio and Natural Language
Viaarxiv icon

Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Tim

Add code
Bookmark button
Alert button
Aug 02, 2023
Xinfeng Li, Chen Yan, Xuancun Lu, Zihan Zeng, Xiaoyu Ji, Wenyuan Xu

Figure 1 for Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Tim
Figure 2 for Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Tim
Figure 3 for Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Tim
Figure 4 for Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Tim
Viaarxiv icon

Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers

Add code
Bookmark button
Alert button
Jul 06, 2023
Yuan Gong, Sameer Khurana, Leonid Karlinsky, James Glass

Figure 1 for Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers
Figure 2 for Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers
Figure 3 for Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers
Figure 4 for Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers
Viaarxiv icon

RAWIW: RAW Image Watermarking Robust to ISP Pipeline

Jul 28, 2023
Kang Fu, Xiaohong Liu, Jun Jia, Zicheng Zhang, Yicong Peng, Jia Wang, Guangtao Zhai

Figure 1 for RAWIW: RAW Image Watermarking Robust to ISP Pipeline
Figure 2 for RAWIW: RAW Image Watermarking Robust to ISP Pipeline
Figure 3 for RAWIW: RAW Image Watermarking Robust to ISP Pipeline
Figure 4 for RAWIW: RAW Image Watermarking Robust to ISP Pipeline
Viaarxiv icon

Generating coherent comic with rich story using ChatGPT and Stable Diffusion

May 19, 2023
Ze Jin, Zorina Song

Viaarxiv icon

Computing Melodic Templates in Oral Music Traditions

Sep 27, 2022
Sergey Bereg, José-Miguel Díaz-Báñez, Nadine Kroher, Inmaculada Ventura

Figure 1 for Computing Melodic Templates in Oral Music Traditions
Figure 2 for Computing Melodic Templates in Oral Music Traditions
Figure 3 for Computing Melodic Templates in Oral Music Traditions
Figure 4 for Computing Melodic Templates in Oral Music Traditions
Viaarxiv icon

Improving Choral Music Separation through Expressive Synthesized Data from Sampled Instruments

Add code
Bookmark button
Alert button
Sep 07, 2022
Ke Chen, Hao-Wen Dong, Yi Luo, Julian McAuley, Taylor Berg-Kirkpatrick, Miller Puckette, Shlomo Dubnov

Figure 1 for Improving Choral Music Separation through Expressive Synthesized Data from Sampled Instruments
Figure 2 for Improving Choral Music Separation through Expressive Synthesized Data from Sampled Instruments
Figure 3 for Improving Choral Music Separation through Expressive Synthesized Data from Sampled Instruments
Figure 4 for Improving Choral Music Separation through Expressive Synthesized Data from Sampled Instruments
Viaarxiv icon

Music Recommendation System based on Emotion, Age and Ethnicity

Dec 09, 2022
Ramiz Mammadli, Huma Bilgin, Ali Can Karaca

Figure 1 for Music Recommendation System based on Emotion, Age and Ethnicity
Figure 2 for Music Recommendation System based on Emotion, Age and Ethnicity
Figure 3 for Music Recommendation System based on Emotion, Age and Ethnicity
Figure 4 for Music Recommendation System based on Emotion, Age and Ethnicity
Viaarxiv icon

Supervised and Unsupervised Learning of Audio Representations for Music Understanding

Add code
Bookmark button
Alert button
Oct 07, 2022
Matthew C. McCallum, Filip Korzeniowski, Sergio Oramas, Fabien Gouyon, Andreas F. Ehmann

Figure 1 for Supervised and Unsupervised Learning of Audio Representations for Music Understanding
Figure 2 for Supervised and Unsupervised Learning of Audio Representations for Music Understanding
Figure 3 for Supervised and Unsupervised Learning of Audio Representations for Music Understanding
Figure 4 for Supervised and Unsupervised Learning of Audio Representations for Music Understanding
Viaarxiv icon

WavJourney: Compositional Audio Creation with Large Language Models

Add code
Bookmark button
Alert button
Jul 26, 2023
Xubo Liu, Zhongkai Zhu, Haohe Liu, Yi Yuan, Meng Cui, Qiushi Huang, Jinhua Liang, Yin Cao, Qiuqiang Kong, Mark D. Plumbley, Wenwu Wang

Viaarxiv icon