Alert button
Picture for Shansong Liu

Shansong Liu

Alert button

M$^{2}$UGen: Multi-modal Music Understanding and Generation with the Power of Large Language Models

Add code
Bookmark button
Alert button
Nov 28, 2023
Atin Sakkeer Hussain, Shansong Liu, Chenshuo Sun, Ying Shan

Figure 1 for M$^{2}$UGen: Multi-modal Music Understanding and Generation with the Power of Large Language Models
Figure 2 for M$^{2}$UGen: Multi-modal Music Understanding and Generation with the Power of Large Language Models
Figure 3 for M$^{2}$UGen: Multi-modal Music Understanding and Generation with the Power of Large Language Models
Figure 4 for M$^{2}$UGen: Multi-modal Music Understanding and Generation with the Power of Large Language Models
Viaarxiv icon

HumTrans: A Novel Open-Source Dataset for Humming Melody Transcription and Beyond

Add code
Bookmark button
Alert button
Sep 18, 2023
Shansong Liu, Xu Li, Dian Li, Ying Shan

Figure 1 for HumTrans: A Novel Open-Source Dataset for Humming Melody Transcription and Beyond
Figure 2 for HumTrans: A Novel Open-Source Dataset for Humming Melody Transcription and Beyond
Figure 3 for HumTrans: A Novel Open-Source Dataset for Humming Melody Transcription and Beyond
Figure 4 for HumTrans: A Novel Open-Source Dataset for Humming Melody Transcription and Beyond
Viaarxiv icon

Music Understanding LLaMA: Advancing Text-to-Music Generation with Question Answering and Captioning

Add code
Bookmark button
Alert button
Aug 22, 2023
Shansong Liu, Atin Sakkeer Hussain, Chenshuo Sun, Ying Shan

Figure 1 for Music Understanding LLaMA: Advancing Text-to-Music Generation with Question Answering and Captioning
Figure 2 for Music Understanding LLaMA: Advancing Text-to-Music Generation with Question Answering and Captioning
Figure 3 for Music Understanding LLaMA: Advancing Text-to-Music Generation with Question Answering and Captioning
Figure 4 for Music Understanding LLaMA: Advancing Text-to-Music Generation with Question Answering and Captioning
Viaarxiv icon

A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion

Add code
Bookmark button
Alert button
Jul 06, 2022
Xu Li, Shansong Liu, Ying Shan

Figure 1 for A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion
Figure 2 for A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion
Figure 3 for A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion
Figure 4 for A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion
Viaarxiv icon

Exploiting Cross Domain Acoustic-to-articulatory Inverted Features For Disordered Speech Recognition

Add code
Bookmark button
Alert button
Mar 19, 2022
Shujie Hu, Shansong Liu, Xurong Xie, Mengzhe Geng, Tianzi Wang, Shoukang Hu, Mingyu Cui, Xunying Liu, Helen Meng

Figure 1 for Exploiting Cross Domain Acoustic-to-articulatory Inverted Features For Disordered Speech Recognition
Figure 2 for Exploiting Cross Domain Acoustic-to-articulatory Inverted Features For Disordered Speech Recognition
Figure 3 for Exploiting Cross Domain Acoustic-to-articulatory Inverted Features For Disordered Speech Recognition
Figure 4 for Exploiting Cross Domain Acoustic-to-articulatory Inverted Features For Disordered Speech Recognition
Viaarxiv icon

Recent Progress in the CUHK Dysarthric Speech Recognition System

Add code
Bookmark button
Alert button
Jan 15, 2022
Shansong Liu, Mengzhe Geng, Shoukang Hu, Xurong Xie, Mingyu Cui, Jianwei Yu, Xunying Liu, Helen Meng

Figure 1 for Recent Progress in the CUHK Dysarthric Speech Recognition System
Figure 2 for Recent Progress in the CUHK Dysarthric Speech Recognition System
Figure 3 for Recent Progress in the CUHK Dysarthric Speech Recognition System
Figure 4 for Recent Progress in the CUHK Dysarthric Speech Recognition System
Viaarxiv icon

Investigation of Data Augmentation Techniques for Disordered Speech Recognition

Add code
Bookmark button
Alert button
Jan 14, 2022
Mengzhe Geng, Xurong Xie, Shansong Liu, Jianwei Yu, Shoukang Hu, Xunying Liu, Helen Meng

Figure 1 for Investigation of Data Augmentation Techniques for Disordered Speech Recognition
Figure 2 for Investigation of Data Augmentation Techniques for Disordered Speech Recognition
Figure 3 for Investigation of Data Augmentation Techniques for Disordered Speech Recognition
Figure 4 for Investigation of Data Augmentation Techniques for Disordered Speech Recognition
Viaarxiv icon

Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition

Add code
Bookmark button
Alert button
Jan 14, 2022
Mengzhe Geng, Shansong Liu, Jianwei Yu, Xurong Xie, Shoukang Hu, Zi Ye, Zengrui Jin, Xunying Liu, Helen Meng

Figure 1 for Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition
Figure 2 for Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition
Figure 3 for Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition
Figure 4 for Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition
Viaarxiv icon