Alert button

"music": models, code, and papers
Alert button

Multi-Channel Target Speaker Extraction with Refinement: The WavLab Submission to the Second Clarity Enhancement Challenge

Feb 15, 2023
Samuele Cornell, Zhong-Qiu Wang, Yoshiki Masuyama, Shinji Watanabe, Manuel Pariente, Nobutaka Ono

Figure 1 for Multi-Channel Target Speaker Extraction with Refinement: The WavLab Submission to the Second Clarity Enhancement Challenge
Figure 2 for Multi-Channel Target Speaker Extraction with Refinement: The WavLab Submission to the Second Clarity Enhancement Challenge
Figure 3 for Multi-Channel Target Speaker Extraction with Refinement: The WavLab Submission to the Second Clarity Enhancement Challenge
Viaarxiv icon

SpecTNT: a Time-Frequency Transformer for Music Audio

Oct 18, 2021
Wei-Tsung Lu, Ju-Chiang Wang, Minz Won, Keunwoo Choi, Xuchen Song

Figure 1 for SpecTNT: a Time-Frequency Transformer for Music Audio
Figure 2 for SpecTNT: a Time-Frequency Transformer for Music Audio
Figure 3 for SpecTNT: a Time-Frequency Transformer for Music Audio
Figure 4 for SpecTNT: a Time-Frequency Transformer for Music Audio
Viaarxiv icon

Comparison and Analysis of Deep Audio Embeddings for Music Emotion Recognition

Apr 13, 2021
Eunjeong Koh, Shlomo Dubnov

Figure 1 for Comparison and Analysis of Deep Audio Embeddings for Music Emotion Recognition
Figure 2 for Comparison and Analysis of Deep Audio Embeddings for Music Emotion Recognition
Figure 3 for Comparison and Analysis of Deep Audio Embeddings for Music Emotion Recognition
Figure 4 for Comparison and Analysis of Deep Audio Embeddings for Music Emotion Recognition
Viaarxiv icon

Analysis and Detection of Singing Techniques in Repertoires of J-POP Solo Singers

Oct 31, 2022
Yuya Yamamoto, Juhan Nam, Hiroko Terasawa

Figure 1 for Analysis and Detection of Singing Techniques in Repertoires of J-POP Solo Singers
Figure 2 for Analysis and Detection of Singing Techniques in Repertoires of J-POP Solo Singers
Figure 3 for Analysis and Detection of Singing Techniques in Repertoires of J-POP Solo Singers
Figure 4 for Analysis and Detection of Singing Techniques in Repertoires of J-POP Solo Singers
Viaarxiv icon

Embedding Calibration for Music Semantic Similarity using Auto-regressive Transformer

Mar 13, 2021
Xinran Zhang, Maosong Sun, Jiafeng Liu, Xiaobing Li

Figure 1 for Embedding Calibration for Music Semantic Similarity using Auto-regressive Transformer
Figure 2 for Embedding Calibration for Music Semantic Similarity using Auto-regressive Transformer
Figure 3 for Embedding Calibration for Music Semantic Similarity using Auto-regressive Transformer
Figure 4 for Embedding Calibration for Music Semantic Similarity using Auto-regressive Transformer
Viaarxiv icon

TunesFormer: Forming Tunes with Control Codes

Jan 07, 2023
Shangda Wu, Maosong Sun

Figure 1 for TunesFormer: Forming Tunes with Control Codes
Figure 2 for TunesFormer: Forming Tunes with Control Codes
Figure 3 for TunesFormer: Forming Tunes with Control Codes
Figure 4 for TunesFormer: Forming Tunes with Control Codes
Viaarxiv icon

Learning to Denoise Historical Music

Aug 05, 2020
Yunpeng Li, Beat Gfeller, Marco Tagliasacchi, Dominik Roblek

Figure 1 for Learning to Denoise Historical Music
Figure 2 for Learning to Denoise Historical Music
Figure 3 for Learning to Denoise Historical Music
Figure 4 for Learning to Denoise Historical Music
Viaarxiv icon

MR4MR: Mixed Reality for Melody Reincarnation

Sep 15, 2022
Atsuya Kobayashi, Ryogo Ishino, Ryuku Nobusue, Takumi Inoue, Keisuke Okazaki, Shoma Sawa, Nao Tokui

Figure 1 for MR4MR: Mixed Reality for Melody Reincarnation
Figure 2 for MR4MR: Mixed Reality for Melody Reincarnation
Figure 3 for MR4MR: Mixed Reality for Melody Reincarnation
Figure 4 for MR4MR: Mixed Reality for Melody Reincarnation
Viaarxiv icon

Beurling-Selberg Extremization for Dual-Blind Deconvolution Recovery in Joint Radar-Communications

Nov 18, 2022
Jonathan Monsalve, Edwin Vargas, Kumar Vijay Mishra, Brian M. Sadler, Henry Arguello

Figure 1 for Beurling-Selberg Extremization for Dual-Blind Deconvolution Recovery in Joint Radar-Communications
Figure 2 for Beurling-Selberg Extremization for Dual-Blind Deconvolution Recovery in Joint Radar-Communications
Viaarxiv icon

CCOM-HuQin: an Annotated Multimodal Chinese Fiddle Performance Dataset

Sep 14, 2022
Yu Zhang, Ziya Zhou, Xiaobing Li, Feng Yu, Maosong Sun

Figure 1 for CCOM-HuQin: an Annotated Multimodal Chinese Fiddle Performance Dataset
Figure 2 for CCOM-HuQin: an Annotated Multimodal Chinese Fiddle Performance Dataset
Figure 3 for CCOM-HuQin: an Annotated Multimodal Chinese Fiddle Performance Dataset
Figure 4 for CCOM-HuQin: an Annotated Multimodal Chinese Fiddle Performance Dataset
Viaarxiv icon