Alert button
Picture for Ruihua Song

Ruihua Song

Alert button

TikTalk: A Multi-Modal Dialogue Dataset for Real-World Chitchat

Add code
Bookmark button
Alert button
Jan 14, 2023
Hongpeng Lin, Ludan Ruan, Wenke Xia, Peiyu Liu, Jingyuan Wen, Yixin Xu, Di Hu, Ruihua Song, Wayne Xin Zhao, Qin Jin, Zhiwu Lu

Figure 1 for TikTalk: A Multi-Modal Dialogue Dataset for Real-World Chitchat
Figure 2 for TikTalk: A Multi-Modal Dialogue Dataset for Real-World Chitchat
Figure 3 for TikTalk: A Multi-Modal Dialogue Dataset for Real-World Chitchat
Figure 4 for TikTalk: A Multi-Modal Dialogue Dataset for Real-World Chitchat
Viaarxiv icon

Text2Poster: Laying out Stylized Texts on Retrieved Images

Add code
Bookmark button
Alert button
Jan 06, 2023
Chuhao Jin, Hongteng Xu, Ruihua Song, Zhiwu Lu

Figure 1 for Text2Poster: Laying out Stylized Texts on Retrieved Images
Figure 2 for Text2Poster: Laying out Stylized Texts on Retrieved Images
Figure 3 for Text2Poster: Laying out Stylized Texts on Retrieved Images
Figure 4 for Text2Poster: Laying out Stylized Texts on Retrieved Images
Viaarxiv icon

Translating Text Synopses to Video Storyboards

Add code
Bookmark button
Alert button
Dec 31, 2022
Xu Gu, Yuchong Sun, Feiyue Ni, Shizhe Chen, Ruihua Song, Boyuan Li, Xiang Cao

Figure 1 for Translating Text Synopses to Video Storyboards
Figure 2 for Translating Text Synopses to Video Storyboards
Figure 3 for Translating Text Synopses to Video Storyboards
Figure 4 for Translating Text Synopses to Video Storyboards
Viaarxiv icon

VideoDubber: Machine Translation with Speech-Aware Length Control for Video Dubbing

Add code
Bookmark button
Alert button
Nov 30, 2022
Yihan Wu, Junliang Guo, Xu Tan, Chen Zhang, Bohan Li, Ruihua Song, Lei He, Sheng Zhao, Arul Menezes, Jiang Bian

Figure 1 for VideoDubber: Machine Translation with Speech-Aware Length Control for Video Dubbing
Figure 2 for VideoDubber: Machine Translation with Speech-Aware Length Control for Video Dubbing
Figure 3 for VideoDubber: Machine Translation with Speech-Aware Length Control for Video Dubbing
Figure 4 for VideoDubber: Machine Translation with Speech-Aware Length Control for Video Dubbing
Viaarxiv icon

Long-Form Video-Language Pre-Training with Multimodal Temporal Contrastive Learning

Add code
Bookmark button
Alert button
Oct 12, 2022
Yuchong Sun, Hongwei Xue, Ruihua Song, Bei Liu, Huan Yang, Jianlong Fu

Figure 1 for Long-Form Video-Language Pre-Training with Multimodal Temporal Contrastive Learning
Figure 2 for Long-Form Video-Language Pre-Training with Multimodal Temporal Contrastive Learning
Figure 3 for Long-Form Video-Language Pre-Training with Multimodal Temporal Contrastive Learning
Figure 4 for Long-Form Video-Language Pre-Training with Multimodal Temporal Contrastive Learning
Viaarxiv icon

CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Representation Alignment

Add code
Bookmark button
Alert button
Sep 23, 2022
Hongwei Xue, Yuchong Sun, Bei Liu, Jianlong Fu, Ruihua Song, Houqiang Li, Jiebo Luo

Figure 1 for CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Representation Alignment
Figure 2 for CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Representation Alignment
Figure 3 for CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Representation Alignment
Figure 4 for CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Representation Alignment
Viaarxiv icon

Multi-Modal Experience Inspired AI Creation

Add code
Bookmark button
Alert button
Sep 02, 2022
Qian Cao, Xu Chen, Ruihua Song, Hao Jiang, Guang Yang, Zhao Cao

Figure 1 for Multi-Modal Experience Inspired AI Creation
Figure 2 for Multi-Modal Experience Inspired AI Creation
Figure 3 for Multi-Modal Experience Inspired AI Creation
Figure 4 for Multi-Modal Experience Inspired AI Creation
Viaarxiv icon

Self-supervised Context-aware Style Representation for Expressive Speech Synthesis

Add code
Bookmark button
Alert button
Jun 25, 2022
Yihan Wu, Xi Wang, Shaofei Zhang, Lei He, Ruihua Song, Jian-Yun Nie

Figure 1 for Self-supervised Context-aware Style Representation for Expressive Speech Synthesis
Figure 2 for Self-supervised Context-aware Style Representation for Expressive Speech Synthesis
Figure 3 for Self-supervised Context-aware Style Representation for Expressive Speech Synthesis
Figure 4 for Self-supervised Context-aware Style Representation for Expressive Speech Synthesis
Viaarxiv icon

A Roadmap for Big Model

Add code
Bookmark button
Alert button
Apr 02, 2022
Sha Yuan, Hanyu Zhao, Shuai Zhao, Jiahong Leng, Yangxiao Liang, Xiaozhi Wang, Jifan Yu, Xin Lv, Zhou Shao, Jiaao He, Yankai Lin, Xu Han, Zhenghao Liu, Ning Ding, Yongming Rao, Yizhao Gao, Liang Zhang, Ming Ding, Cong Fang, Yisen Wang, Mingsheng Long, Jing Zhang, Yinpeng Dong, Tianyu Pang, Peng Cui, Lingxiao Huang, Zheng Liang, Huawei Shen, Hui Zhang, Quanshi Zhang, Qingxiu Dong, Zhixing Tan, Mingxuan Wang, Shuo Wang, Long Zhou, Haoran Li, Junwei Bao, Yingwei Pan, Weinan Zhang, Zhou Yu, Rui Yan, Chence Shi, Minghao Xu, Zuobai Zhang, Guoqiang Wang, Xiang Pan, Mengjie Li, Xiaoyu Chu, Zijun Yao, Fangwei Zhu, Shulin Cao, Weicheng Xue, Zixuan Ma, Zhengyan Zhang, Shengding Hu, Yujia Qin, Chaojun Xiao, Zheni Zeng, Ganqu Cui, Weize Chen, Weilin Zhao, Yuan Yao, Peng Li, Wenzhao Zheng, Wenliang Zhao, Ziyi Wang, Borui Zhang, Nanyi Fei, Anwen Hu, Zenan Ling, Haoyang Li, Boxi Cao, Xianpei Han, Weidong Zhan, Baobao Chang, Hao Sun, Jiawen Deng, Chujie Zheng, Juanzi Li, Lei Hou, Xigang Cao, Jidong Zhai, Zhiyuan Liu, Maosong Sun, Jiwen Lu, Zhiwu Lu, Qin Jin, Ruihua Song, Ji-Rong Wen, Zhouchen Lin, Liwei Wang, Hang Su, Jun Zhu, Zhifang Sui, Jiajun Zhang, Yang Liu, Xiaodong He, Minlie Huang, Jian Tang, Jie Tang

Figure 1 for A Roadmap for Big Model
Figure 2 for A Roadmap for Big Model
Figure 3 for A Roadmap for Big Model
Figure 4 for A Roadmap for Big Model
Viaarxiv icon