Alert button
Picture for Zhi-Qi Cheng

Zhi-Qi Cheng

Alert button

MIPS at SemEval-2024 Task 3: Multimodal Emotion-Cause Pair Extraction in Conversations with Multimodal Language Models

Add code
Bookmark button
Alert button
Apr 11, 2024
Zebang Cheng, Fuqiang Niu, Yuxiang Lin, Zhi-Qi Cheng, Bowen Zhang, Xiaojiang Peng

Viaarxiv icon

IVAC-P2L: Leveraging Irregular Repetition Priors for Improving Video Action Counting

Add code
Bookmark button
Alert button
Mar 20, 2024
Hang Wang, Zhi-Qi Cheng, Youtian Du, Lei Zhang

Figure 1 for IVAC-P2L: Leveraging Irregular Repetition Priors for Improving Video Action Counting
Figure 2 for IVAC-P2L: Leveraging Irregular Repetition Priors for Improving Video Action Counting
Figure 3 for IVAC-P2L: Leveraging Irregular Repetition Priors for Improving Video Action Counting
Figure 4 for IVAC-P2L: Leveraging Irregular Repetition Priors for Improving Video Action Counting
Viaarxiv icon

DyRoNet: A Low-Rank Adapter Enhanced Dynamic Routing Network for Streaming Perception

Add code
Bookmark button
Alert button
Mar 15, 2024
Xiang Huang, Zhi-Qi Cheng, Jun-Yan He, Chenyang Li, Wangmeng Xiang, Baigui Sun, Xiao Wu

Figure 1 for DyRoNet: A Low-Rank Adapter Enhanced Dynamic Routing Network for Streaming Perception
Figure 2 for DyRoNet: A Low-Rank Adapter Enhanced Dynamic Routing Network for Streaming Perception
Figure 3 for DyRoNet: A Low-Rank Adapter Enhanced Dynamic Routing Network for Streaming Perception
Figure 4 for DyRoNet: A Low-Rank Adapter Enhanced Dynamic Routing Network for Streaming Perception
Viaarxiv icon

FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces from Disentangled Audio

Add code
Bookmark button
Alert button
Mar 04, 2024
Chao Xu, Yang Liu, Jiazheng Xing, Weida Wang, Mingze Sun, Jun Dan, Tianxin Huang, Siyuan Li, Zhi-Qi Cheng, Ying Tai, Baigui Sun

Figure 1 for FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces from Disentangled Audio
Figure 2 for FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces from Disentangled Audio
Figure 3 for FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces from Disentangled Audio
Figure 4 for FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces from Disentangled Audio
Viaarxiv icon

WordArt Designer API: User-Driven Artistic Typography Synthesis with Large Language Models on ModelScope

Add code
Bookmark button
Alert button
Jan 12, 2024
Jun-Yan He, Zhi-Qi Cheng, Chenyang Li, Jingdong Sun, Wangmeng Xiang, Yusen Hu, Xianhui Lin, Xiaoyang Kang, Zengke Jin, Bin Luo, Yifeng Geng, Xuansong Xie, Jingren Zhou

Viaarxiv icon

Tracking with Human-Intent Reasoning

Add code
Bookmark button
Alert button
Dec 29, 2023
Jiawen Zhu, Zhi-Qi Cheng, Jun-Yan He, Chenyang Li, Bin Luo, Huchuan Lu, Yifeng Geng, Xuansong Xie

Viaarxiv icon

ProS: Prompting-to-simulate Generalized knowledge for Universal Cross-Domain Retrieval

Add code
Bookmark button
Alert button
Dec 19, 2023
Kaipeng Fang, Jingkuan Song, Lianli Gao, Pengpeng Zeng, Zhi-Qi Cheng, Xiyao Li, Heng Tao Shen

Viaarxiv icon

MotionEditor: Editing Video Motion via Content-Aware Diffusion

Add code
Bookmark button
Alert button
Nov 30, 2023
Shuyuan Tu, Qi Dai, Zhi-Qi Cheng, Han Hu, Xintong Han, Zuxuan Wu, Yu-Gang Jiang

Viaarxiv icon