Alert button
Picture for Yu-Gang Jiang

Yu-Gang Jiang

Alert button

Instruction-Guided Scene Text Recognition

Jan 31, 2024
Yongkun Du, Zhineng Chen, Yuchen Su, Caiyan Jia, Yu-Gang Jiang

Viaarxiv icon

MouSi: Poly-Visual-Expert Vision-Language Models

Jan 30, 2024
Xiaoran Fan, Tao Ji, Changhao Jiang, Shuo Li, Senjie Jin, Sirui Song, Junke Wang, Boyang Hong, Lu Chen, Guodong Zheng, Ming Zhang, Caishuang Huang, Rui Zheng, Zhiheng Xi, Yuhao Zhou, Shihan Dou, Junjie Ye, Hang Yan, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Zuxuan Wu, Yu-Gang Jiang

Viaarxiv icon

Multi-Trigger Backdoor Attacks: More Triggers, More Threats

Jan 27, 2024
Yige Li, Xingjun Ma, Jiabo He, Hanxun Huang, Yu-Gang Jiang

Viaarxiv icon

Secrets of RLHF in Large Language Models Part II: Reward Modeling

Jan 12, 2024
Binghai Wang, Rui Zheng, Lu Chen, Yan Liu, Shihan Dou, Caishuang Huang, Wei Shen, Senjie Jin, Enyu Zhou, Chenyu Shi, Songyang Gao, Nuo Xu, Yuhao Zhou, Xiaoran Fan, Zhiheng Xi, Jun Zhao, Xiao Wang, Tao Ji, Hang Yan, Lixing Shen, Zhan Chen, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Zuxuan Wu, Yu-Gang Jiang

Viaarxiv icon

SonicVisionLM: Playing Sound with Vision Language Models

Jan 09, 2024
Zhifeng Xie, Shengye Yu, Mengtian Li, Qile He, Chaofeng Chen, Yu-Gang Jiang

Viaarxiv icon

FoodLMM: A Versatile Food Assistant using Large Multi-modal Model

Dec 22, 2023
Yuehao Yin, Huiyan Qi, Bin Zhu, Jingjing Chen, Yu-Gang Jiang, Chong-Wah Ngo

Viaarxiv icon

Instance-aware Multi-Camera 3D Object Detection with Structural Priors Mining and Self-Boosting Learning

Dec 13, 2023
Yang Jiao, Zequn Jie, Shaoxiang Chen, Lechao Cheng, Jingjing Chen, Lin Ma, Yu-Gang Jiang

Viaarxiv icon

VideoAssembler: Identity-Consistent Video Generation with Reference Entities using Diffusion Model

Dec 01, 2023
Haoyu Zhao, Tianyi Lu, Jiaxi Gu, Xing Zhang, Zuxuan Wu, Hang Xu, Yu-Gang Jiang

Viaarxiv icon

VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models

Nov 30, 2023
Zhen Xing, Qi Dai, Zihao Zhang, Hui Zhang, Han Hu, Zuxuan Wu, Yu-Gang Jiang

Viaarxiv icon

MotionEditor: Editing Video Motion via Content-Aware Diffusion

Nov 30, 2023
Shuyuan Tu, Qi Dai, Zhi-Qi Cheng, Han Hu, Xintong Han, Zuxuan Wu, Yu-Gang Jiang

Viaarxiv icon