Alert button
Picture for Yuqing Song

Yuqing Song

Alert button

Renmin University of China

Accommodating Audio Modality in CLIP for Multimodal Processing

Add code
Bookmark button
Alert button
Mar 12, 2023
Ludan Ruan, Anwen Hu, Yuqing Song, Liang Zhang, Sipeng Zheng, Qin Jin

Figure 1 for Accommodating Audio Modality in CLIP for Multimodal Processing
Figure 2 for Accommodating Audio Modality in CLIP for Multimodal Processing
Figure 3 for Accommodating Audio Modality in CLIP for Multimodal Processing
Figure 4 for Accommodating Audio Modality in CLIP for Multimodal Processing
Viaarxiv icon

Unifying Event Detection and Captioning as Sequence Generation via Pre-Training

Add code
Bookmark button
Alert button
Jul 18, 2022
Qi Zhang, Yuqing Song, Qin Jin

Figure 1 for Unifying Event Detection and Captioning as Sequence Generation via Pre-Training
Figure 2 for Unifying Event Detection and Captioning as Sequence Generation via Pre-Training
Figure 3 for Unifying Event Detection and Captioning as Sequence Generation via Pre-Training
Figure 4 for Unifying Event Detection and Captioning as Sequence Generation via Pre-Training
Viaarxiv icon

Some theoretical results on discrete contour trees

Add code
Bookmark button
Alert button
Jun 24, 2022
Yuqing Song

Figure 1 for Some theoretical results on discrete contour trees
Figure 2 for Some theoretical results on discrete contour trees
Viaarxiv icon

Progressive Learning for Image Retrieval with Hybrid-Modality Queries

Add code
Bookmark button
Alert button
Apr 24, 2022
Yida Zhao, Yuqing Song, Qin Jin

Figure 1 for Progressive Learning for Image Retrieval with Hybrid-Modality Queries
Figure 2 for Progressive Learning for Image Retrieval with Hybrid-Modality Queries
Figure 3 for Progressive Learning for Image Retrieval with Hybrid-Modality Queries
Figure 4 for Progressive Learning for Image Retrieval with Hybrid-Modality Queries
Viaarxiv icon

Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training

Add code
Bookmark button
Alert button
Aug 25, 2021
Yuqing Song, Shizhe Chen, Qin Jin, Wei Luo, Jun Xie, Fei Huang

Figure 1 for Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training
Figure 2 for Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training
Figure 3 for Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training
Figure 4 for Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training
Viaarxiv icon

Team RUC_AIM3 Technical Report at ActivityNet 2021: Entities Object Localization

Add code
Bookmark button
Alert button
Jun 11, 2021
Ludan Ruan, Jieting Chen, Yuqing Song, Shizhe Chen, Qin Jin

Figure 1 for Team RUC_AIM3 Technical Report at ActivityNet 2021: Entities Object Localization
Figure 2 for Team RUC_AIM3 Technical Report at ActivityNet 2021: Entities Object Localization
Figure 3 for Team RUC_AIM3 Technical Report at ActivityNet 2021: Entities Object Localization
Figure 4 for Team RUC_AIM3 Technical Report at ActivityNet 2021: Entities Object Localization
Viaarxiv icon

Towards Diverse Paragraph Captioning for Untrimmed Videos

Add code
Bookmark button
Alert button
May 30, 2021
Yuqing Song, Shizhe Chen, Qin Jin

Figure 1 for Towards Diverse Paragraph Captioning for Untrimmed Videos
Figure 2 for Towards Diverse Paragraph Captioning for Untrimmed Videos
Figure 3 for Towards Diverse Paragraph Captioning for Untrimmed Videos
Figure 4 for Towards Diverse Paragraph Captioning for Untrimmed Videos
Viaarxiv icon

WenLan: Bridging Vision and Language by Large-Scale Multi-Modal Pre-Training

Add code
Bookmark button
Alert button
Mar 19, 2021
Yuqi Huo, Manli Zhang, Guangzhen Liu, Haoyu Lu, Yizhao Gao, Guoxing Yang, Jingyuan Wen, Heng Zhang, Baogui Xu, Weihao Zheng, Zongzheng Xi, Yueqian Yang, Anwen Hu, Jinming Zhao, Ruichen Li, Yida Zhao, Liang Zhang, Yuqing Song, Xin Hong, Wanqing Cui, Danyang Hou, Yingyan Li, Junyi Li, Peiyu Liu, Zheng Gong, Chuhao Jin, Yuchong Sun, Shizhe Chen, Zhiwu Lu, Zhicheng Dou, Qin Jin, Yanyan Lan, Wayne Xin Zhao, Ruihua Song, Ji-Rong Wen

Figure 1 for WenLan: Bridging Vision and Language by Large-Scale Multi-Modal Pre-Training
Figure 2 for WenLan: Bridging Vision and Language by Large-Scale Multi-Modal Pre-Training
Figure 3 for WenLan: Bridging Vision and Language by Large-Scale Multi-Modal Pre-Training
Figure 4 for WenLan: Bridging Vision and Language by Large-Scale Multi-Modal Pre-Training
Viaarxiv icon