Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Attention-based cross-modal fusion for audio-visual voice activity detection in musical video streams

Yuanbo Hou , Zhesong Yu , Xia Liang , Xingjian Du , Bilei Zhu , Zejun Ma , Dick Botteldooren

* Accepted by INTERSPEECH 2021 

   Access Paper or Ask Questions

MIDI-Sandwich2: RNN-based Hierarchical Multi-modal Fusion Generation VAE networks for multi-track symbolic music generation

Xia Liang , Junmin Wu , Jing Cao

   Access Paper or Ask Questions

MIDI-Sandwich: Multi-model Multi-task Hierarchical Conditional VAE-GAN networks for Symbolic Single-track Music Generation

Xia Liang , Junmin Wu , Yan Yin

* cast KSEM2019 on May 3, 2019 (weak rejected) 

   Access Paper or Ask Questions