Alert button
Picture for Qin Jin

Qin Jin

Alert button

TikTalk: A Multi-Modal Dialogue Dataset for Real-World Chitchat

Add code
Bookmark button
Alert button
Jan 14, 2023
Hongpeng Lin, Ludan Ruan, Wenke Xia, Peiyu Liu, Jingyuan Wen, Yixin Xu, Di Hu, Ruihua Song, Wayne Xin Zhao, Qin Jin, Zhiwu Lu

Figure 1 for TikTalk: A Multi-Modal Dialogue Dataset for Real-World Chitchat
Figure 2 for TikTalk: A Multi-Modal Dialogue Dataset for Real-World Chitchat
Figure 3 for TikTalk: A Multi-Modal Dialogue Dataset for Real-World Chitchat
Figure 4 for TikTalk: A Multi-Modal Dialogue Dataset for Real-World Chitchat
Viaarxiv icon

MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation

Add code
Bookmark button
Alert button
Dec 19, 2022
Ludan Ruan, Yiyang Ma, Huan Yang, Huiguo He, Bei Liu, Jianlong Fu, Nicholas Jing Yuan, Qin Jin, Baining Guo

Figure 1 for MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
Figure 2 for MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
Figure 3 for MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
Figure 4 for MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
Viaarxiv icon

CapEnrich: Enriching Caption Semantics for Web Images via Cross-modal Pre-trained Knowledge

Add code
Bookmark button
Alert button
Nov 17, 2022
Linli Yao, Weijing Chen, Qin Jin

Figure 1 for CapEnrich: Enriching Caption Semantics for Web Images via Cross-modal Pre-trained Knowledge
Figure 2 for CapEnrich: Enriching Caption Semantics for Web Images via Cross-modal Pre-trained Knowledge
Figure 3 for CapEnrich: Enriching Caption Semantics for Web Images via Cross-modal Pre-trained Knowledge
Figure 4 for CapEnrich: Enriching Caption Semantics for Web Images via Cross-modal Pre-trained Knowledge
Viaarxiv icon

Exploring Anchor-based Detection for Ego4D Natural Language Query

Add code
Bookmark button
Alert button
Aug 10, 2022
Sipeng Zheng, Qi Zhang, Bei Liu, Qin Jin, Jianlong Fu

Figure 1 for Exploring Anchor-based Detection for Ego4D Natural Language Query
Figure 2 for Exploring Anchor-based Detection for Ego4D Natural Language Query
Figure 3 for Exploring Anchor-based Detection for Ego4D Natural Language Query
Figure 4 for Exploring Anchor-based Detection for Ego4D Natural Language Query
Viaarxiv icon

Unifying Event Detection and Captioning as Sequence Generation via Pre-Training

Add code
Bookmark button
Alert button
Jul 18, 2022
Qi Zhang, Yuqing Song, Qin Jin

Figure 1 for Unifying Event Detection and Captioning as Sequence Generation via Pre-Training
Figure 2 for Unifying Event Detection and Captioning as Sequence Generation via Pre-Training
Figure 3 for Unifying Event Detection and Captioning as Sequence Generation via Pre-Training
Figure 4 for Unifying Event Detection and Captioning as Sequence Generation via Pre-Training
Viaarxiv icon

TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval

Add code
Bookmark button
Alert button
Jul 16, 2022
Yuqi Liu, Pengfei Xiong, Luhui Xu, Shengming Cao, Qin Jin

Figure 1 for TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval
Figure 2 for TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval
Figure 3 for TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval
Figure 4 for TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval
Viaarxiv icon

M3ED: Multi-modal Multi-scene Multi-label Emotional Dialogue Database

Add code
Bookmark button
Alert button
May 09, 2022
Jinming Zhao, Tenggan Zhang, Jingwen Hu, Yuchen Liu, Qin Jin, Xinchao Wang, Haizhou Li

Figure 1 for M3ED: Multi-modal Multi-scene Multi-label Emotional Dialogue Database
Figure 2 for M3ED: Multi-modal Multi-scene Multi-label Emotional Dialogue Database
Figure 3 for M3ED: Multi-modal Multi-scene Multi-label Emotional Dialogue Database
Figure 4 for M3ED: Multi-modal Multi-scene Multi-label Emotional Dialogue Database
Viaarxiv icon

Muskits: an End-to-End Music Processing Toolkit for Singing Voice Synthesis

Add code
Bookmark button
Alert button
May 09, 2022
Jiatong Shi, Shuai Guo, Tao Qian, Nan Huo, Tomoki Hayashi, Yuning Wu, Frank Xu, Xuankai Chang, Huazhe Li, Peter Wu, Shinji Watanabe, Qin Jin

Figure 1 for Muskits: an End-to-End Music Processing Toolkit for Singing Voice Synthesis
Figure 2 for Muskits: an End-to-End Music Processing Toolkit for Singing Voice Synthesis
Figure 3 for Muskits: an End-to-End Music Processing Toolkit for Singing Voice Synthesis
Figure 4 for Muskits: an End-to-End Music Processing Toolkit for Singing Voice Synthesis
Viaarxiv icon

Progressive Learning for Image Retrieval with Hybrid-Modality Queries

Add code
Bookmark button
Alert button
Apr 24, 2022
Yida Zhao, Yuqing Song, Qin Jin

Figure 1 for Progressive Learning for Image Retrieval with Hybrid-Modality Queries
Figure 2 for Progressive Learning for Image Retrieval with Hybrid-Modality Queries
Figure 3 for Progressive Learning for Image Retrieval with Hybrid-Modality Queries
Figure 4 for Progressive Learning for Image Retrieval with Hybrid-Modality Queries
Viaarxiv icon

A Roadmap for Big Model

Add code
Bookmark button
Alert button
Apr 02, 2022
Sha Yuan, Hanyu Zhao, Shuai Zhao, Jiahong Leng, Yangxiao Liang, Xiaozhi Wang, Jifan Yu, Xin Lv, Zhou Shao, Jiaao He, Yankai Lin, Xu Han, Zhenghao Liu, Ning Ding, Yongming Rao, Yizhao Gao, Liang Zhang, Ming Ding, Cong Fang, Yisen Wang, Mingsheng Long, Jing Zhang, Yinpeng Dong, Tianyu Pang, Peng Cui, Lingxiao Huang, Zheng Liang, Huawei Shen, Hui Zhang, Quanshi Zhang, Qingxiu Dong, Zhixing Tan, Mingxuan Wang, Shuo Wang, Long Zhou, Haoran Li, Junwei Bao, Yingwei Pan, Weinan Zhang, Zhou Yu, Rui Yan, Chence Shi, Minghao Xu, Zuobai Zhang, Guoqiang Wang, Xiang Pan, Mengjie Li, Xiaoyu Chu, Zijun Yao, Fangwei Zhu, Shulin Cao, Weicheng Xue, Zixuan Ma, Zhengyan Zhang, Shengding Hu, Yujia Qin, Chaojun Xiao, Zheni Zeng, Ganqu Cui, Weize Chen, Weilin Zhao, Yuan Yao, Peng Li, Wenzhao Zheng, Wenliang Zhao, Ziyi Wang, Borui Zhang, Nanyi Fei, Anwen Hu, Zenan Ling, Haoyang Li, Boxi Cao, Xianpei Han, Weidong Zhan, Baobao Chang, Hao Sun, Jiawen Deng, Chujie Zheng, Juanzi Li, Lei Hou, Xigang Cao, Jidong Zhai, Zhiyuan Liu, Maosong Sun, Jiwen Lu, Zhiwu Lu, Qin Jin, Ruihua Song, Ji-Rong Wen, Zhouchen Lin, Liwei Wang, Hang Su, Jun Zhu, Zhifang Sui, Jiajun Zhang, Yang Liu, Xiaodong He, Minlie Huang, Jian Tang, Jie Tang

Figure 1 for A Roadmap for Big Model
Figure 2 for A Roadmap for Big Model
Figure 3 for A Roadmap for Big Model
Figure 4 for A Roadmap for Big Model
Viaarxiv icon