Alert button
Picture for Jinfa Huang

Jinfa Huang

Alert button

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Add code
Bookmark button
Alert button
Apr 07, 2024
Shenghai Yuan, Jinfa Huang, Yujun Shi, Yongqi Xu, Ruijie Zhu, Bin Lin, Xinhua Cheng, Li Yuan, Jiebo Luo

Viaarxiv icon

LLMBind: A Unified Modality-Task Integration Framework

Add code
Bookmark button
Alert button
Mar 08, 2024
Bin Zhu, Peng Jin, Munan Ning, Bin Lin, Jinfa Huang, Qi Song, Jiaxi Cui, Junwu Zhang, Zhenyu Tang, Mingjun Pan, Xing Zhou, Li Yuan

Viaarxiv icon

MoE-LLaVA: Mixture of Experts for Large Vision-Language Models

Add code
Bookmark button
Alert button
Feb 04, 2024
Bin Lin, Zhenyu Tang, Yang Ye, Jiaxi Cui, Bin Zhu, Peng Jin, Jinfa Huang, Junwu Zhang, Munan Ning, Li Yuan

Viaarxiv icon

Continuous-Multiple Image Outpainting in One-Step via Positional Query and A Diffusion-based Approach

Add code
Bookmark button
Alert button
Jan 28, 2024
Shaofeng Zhang, Jinfa Huang, Qiang Zhou, Zhibin Wang, Fan Wang, Jiebo Luo, Junchi Yan

Viaarxiv icon

GPT-4V(ision) as A Social Media Analysis Engine

Add code
Bookmark button
Alert button
Nov 13, 2023
Hanjia Lyu, Jinfa Huang, Daoan Zhang, Yongsheng Yu, Xinyi Mou, Jinsheng Pan, Zhengyuan Yang, Zhongyu Wei, Jiebo Luo

Viaarxiv icon

Improving Scene Graph Generation with Superpixel-Based Interaction Learning

Add code
Bookmark button
Alert button
Aug 04, 2023
Jingyi Wang, Can Zhang, Jinfa Huang, Botao Ren, Zhidong Deng

Viaarxiv icon

Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment

Add code
Bookmark button
Alert button
May 20, 2023
Peng Jin, Hao Li, Zesen Cheng, Jinfa Huang, Zhennan Wang, Li Yuan, Chang Liu, Jie Chen

Figure 1 for Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment
Figure 2 for Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment
Figure 3 for Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment
Figure 4 for Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment
Viaarxiv icon

Cross-Modality Time-Variant Relation Learning for Generating Dynamic Scene Graphs

Add code
Bookmark button
Alert button
May 15, 2023
Jingyi Wang, Jinfa Huang, Can Zhang, Zhidong Deng

Figure 1 for Cross-Modality Time-Variant Relation Learning for Generating Dynamic Scene Graphs
Figure 2 for Cross-Modality Time-Variant Relation Learning for Generating Dynamic Scene Graphs
Figure 3 for Cross-Modality Time-Variant Relation Learning for Generating Dynamic Scene Graphs
Figure 4 for Cross-Modality Time-Variant Relation Learning for Generating Dynamic Scene Graphs
Viaarxiv icon

Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning

Add code
Bookmark button
Alert button
Mar 25, 2023
Peng Jin, Jinfa Huang, Pengfei Xiong, Shangxuan Tian, Chang Liu, Xiangyang Ji, Li Yuan, Jie Chen

Figure 1 for Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning
Figure 2 for Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning
Figure 3 for Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning
Figure 4 for Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning
Viaarxiv icon