Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Self-supervised Context-aware Style Representation for Expressive Speech Synthesis



Yihan Wu , Xi Wang , Shaofei Zhang , Lei He , Ruihua Song , Jian-Yun Nie

* Accepted by Interspeech 2022 

   Access Paper or Ask Questions

A Roadmap for Big Model



Sha Yuan , Hanyu Zhao , Shuai Zhao , Jiahong Leng , Yangxiao Liang , Xiaozhi Wang , Jifan Yu , Xin Lv , Zhou Shao , Jiaao He , Yankai Lin , Xu Han , Zhenghao Liu , Ning Ding , Yongming Rao , Yizhao Gao , Liang Zhang , Ming Ding , Cong Fang , Yisen Wang , Mingsheng Long , Jing Zhang , Yinpeng Dong , Tianyu Pang , Peng Cui , Lingxiao Huang , Zheng Liang , Huawei Shen , Hui Zhang , Quanshi Zhang , Qingxiu Dong , Zhixing Tan , Mingxuan Wang , Shuo Wang , Long Zhou , Haoran Li , Junwei Bao , Yingwei Pan , Weinan Zhang , Zhou Yu , Rui Yan , Chence Shi , Minghao Xu , Zuobai Zhang , Guoqiang Wang , Xiang Pan , Mengjie Li , Xiaoyu Chu , Zijun Yao , Fangwei Zhu , Shulin Cao , Weicheng Xue , Zixuan Ma , Zhengyan Zhang , Shengding Hu , Yujia Qin , Chaojun Xiao , Zheni Zeng , Ganqu Cui , Weize Chen , Weilin Zhao , Yuan Yao , Peng Li , Wenzhao Zheng , Wenliang Zhao , Ziyi Wang , Borui Zhang , Nanyi Fei , Anwen Hu , Zenan Ling , Haoyang Li , Boxi Cao , Xianpei Han , Weidong Zhan , Baobao Chang , Hao Sun , Jiawen Deng , Chujie Zheng , Juanzi Li , Lei Hou , Xigang Cao , Jidong Zhai , Zhiyuan Liu , Maosong Sun , Jiwen Lu , Zhiwu Lu , Qin Jin , Ruihua Song , Ji-Rong Wen , Zhouchen Lin , Liwei Wang , Hang Su , Jun Zhu , Zhifang Sui , Jiajun Zhang , Yang Liu , Xiaodong He , Minlie Huang , Jian Tang , Jie Tang

* arXiv admin note: text overlap with arXiv:2107.06499 by other authors 

   Access Paper or Ask Questions

AdaSpeech 4: Adaptive Text to Speech in Zero-Shot Scenarios



Yihan Wu , Xu Tan , Bohan Li , Lei He , Sheng Zhao , Ruihua Song , Tao Qin , Tie-Yan Liu

* 5 pages, 2 tables, 2 figure. Submitted to Interspeech 2022 

   Access Paper or Ask Questions

Class-aware Sounding Objects Localization via Audiovisual Correspondence



Di Hu , Yake Wei , Rui Qian , Weiyao Lin , Ruihua Song , Ji-Rong Wen

* accepted by TPAMI 2021. Code: https://github.com/GeWu-Lab/CSOL_TPAMI2021 

   Access Paper or Ask Questions

WenLan 2.0: Make AI Imagine via a Multimodal Foundation Model



Nanyi Fei , Zhiwu Lu , Yizhao Gao , Guoxing Yang , Yuqi Huo , Jingyuan Wen , Haoyu Lu , Ruihua Song , Xin Gao , Tao Xiang , Hao Sun , Ji-Rong Wen


   Access Paper or Ask Questions

Stylistic Retrieval-based Dialogue System with Unparallel Training Data



Hao Fu , Yan Wang , Ruihua Song , Tianran Hu , Jianyun Nie


   Access Paper or Ask Questions

Pre-Trained Models: Past, Present and Future



Xu Han , Zhengyan Zhang , Ning Ding , Yuxian Gu , Xiao Liu , Yuqi Huo , Jiezhong Qiu , Liang Zhang , Wentao Han , Minlie Huang , Qin Jin , Yanyan Lan , Yang Liu , Zhiyuan Liu , Zhiwu Lu , Xipeng Qiu , Ruihua Song , Jie Tang , Ji-Rong Wen , Jinhui Yuan , Wayne Xin Zhao , Jun Zhu


   Access Paper or Ask Questions

WenLan: Bridging Vision and Language by Large-Scale Multi-Modal Pre-Training



Yuqi Huo , Manli Zhang , Guangzhen Liu , Haoyu Lu , Yizhao Gao , Guoxing Yang , Jingyuan Wen , Heng Zhang , Baogui Xu , Weihao Zheng , Zongzheng Xi , Yueqian Yang , Anwen Hu , Jinming Zhao , Ruichen Li , Yida Zhao , Liang Zhang , Yuqing Song , Xin Hong , Wanqing Cui , Danyang Hou , Yingyan Li , Junyi Li , Peiyu Liu , Zheng Gong , Chuhao Jin , Yuchong Sun , Shizhe Chen , Zhiwu Lu , Zhicheng Dou , Qin Jin , Yanyan Lan , Wayne Xin Zhao , Ruihua Song , Ji-Rong Wen

* This paper is the outcome of the Chinese multi-modal pre-training project called 'WenLan' 

   Access Paper or Ask Questions

1
2
>>