Alert button
Picture for Runhui Huang

Runhui Huang

Alert button

LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model

Add code
Bookmark button
Alert button
Mar 18, 2024
Runhui Huang, Kaixin Cai, Jianhua Han, Xiaodan Liang, Renjing Pei, Guansong Lu, Songcen Xu, Wei Zhang, Hang Xu

Figure 1 for LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model
Figure 2 for LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model
Figure 3 for LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model
Figure 4 for LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model
Viaarxiv icon

GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training

Add code
Bookmark button
Alert button
Aug 22, 2023
Xinchi Deng, Han Shi, Runhui Huang, Changlin Li, Hang Xu, Jianhua Han, James Kwok, Shen Zhao, Wei Zhang, Xiaodan Liang

Figure 1 for GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training
Figure 2 for GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training
Figure 3 for GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training
Figure 4 for GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training
Viaarxiv icon

DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability

Add code
Bookmark button
Alert button
Aug 18, 2023
Runhui Huang, Jianhua Han, Guansong Lu, Xiaodan Liang, Yihan Zeng, Wei Zhang, Hang Xu

Figure 1 for DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability
Figure 2 for DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability
Figure 3 for DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability
Figure 4 for DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability
Viaarxiv icon

UniDiff: Advancing Vision-Language Models with Generative and Discriminative Learning

Add code
Bookmark button
Alert button
Jun 01, 2023
Xiao Dong, Runhui Huang, Xiaoyong Wei, Zequn Jie, Jianxing Yu, Jian Yin, Xiaodan Liang

Figure 1 for UniDiff: Advancing Vision-Language Models with Generative and Discriminative Learning
Figure 2 for UniDiff: Advancing Vision-Language Models with Generative and Discriminative Learning
Figure 3 for UniDiff: Advancing Vision-Language Models with Generative and Discriminative Learning
Figure 4 for UniDiff: Advancing Vision-Language Models with Generative and Discriminative Learning
Viaarxiv icon

Boosting Visual-Language Models by Exploiting Hard Samples

Add code
Bookmark button
Alert button
May 09, 2023
Haonan Wang, Minbin Huang, Runhui Huang, Lanqing Hong, Hang Xu, Tianyang Hu, Xiaodan Liang, Zhenguo Li

Figure 1 for Boosting Visual-Language Models by Exploiting Hard Samples
Figure 2 for Boosting Visual-Language Models by Exploiting Hard Samples
Figure 3 for Boosting Visual-Language Models by Exploiting Hard Samples
Figure 4 for Boosting Visual-Language Models by Exploiting Hard Samples
Viaarxiv icon

NLIP: Noise-robust Language-Image Pre-training

Add code
Bookmark button
Alert button
Jan 04, 2023
Runhui Huang, Yanxin Long, Jianhua Han, Hang Xu, Xiwen Liang, Chunjing Xu, Xiaodan Liang

Figure 1 for NLIP: Noise-robust Language-Image Pre-training
Figure 2 for NLIP: Noise-robust Language-Image Pre-training
Figure 3 for NLIP: Noise-robust Language-Image Pre-training
Figure 4 for NLIP: Noise-robust Language-Image Pre-training
Viaarxiv icon

P$^3$OVD: Fine-grained Visual-Text Prompt-Driven Self-Training for Open-Vocabulary Object Detection

Add code
Bookmark button
Alert button
Nov 02, 2022
Yanxin Long, Jianhua Han, Runhui Huang, Xu Hang, Yi Zhu, Chunjing Xu, Xiaodan Liang

Figure 1 for P$^3$OVD: Fine-grained Visual-Text Prompt-Driven Self-Training for Open-Vocabulary Object Detection
Figure 2 for P$^3$OVD: Fine-grained Visual-Text Prompt-Driven Self-Training for Open-Vocabulary Object Detection
Figure 3 for P$^3$OVD: Fine-grained Visual-Text Prompt-Driven Self-Training for Open-Vocabulary Object Detection
Figure 4 for P$^3$OVD: Fine-grained Visual-Text Prompt-Driven Self-Training for Open-Vocabulary Object Detection
Viaarxiv icon

Wukong: 100 Million Large-scale Chinese Cross-modal Pre-training Dataset and A Foundation Framework

Add code
Bookmark button
Alert button
Mar 10, 2022
Jiaxi Gu, Xiaojun Meng, Guansong Lu, Lu Hou, Minzhe Niu, Xiaodan Liang, Lewei Yao, Runhui Huang, Wei Zhang, Xin Jiang, Chunjing Xu, Hang Xu

Figure 1 for Wukong: 100 Million Large-scale Chinese Cross-modal Pre-training Dataset and A Foundation Framework
Figure 2 for Wukong: 100 Million Large-scale Chinese Cross-modal Pre-training Dataset and A Foundation Framework
Figure 3 for Wukong: 100 Million Large-scale Chinese Cross-modal Pre-training Dataset and A Foundation Framework
Figure 4 for Wukong: 100 Million Large-scale Chinese Cross-modal Pre-training Dataset and A Foundation Framework
Viaarxiv icon

FILIP: Fine-grained Interactive Language-Image Pre-Training

Add code
Bookmark button
Alert button
Nov 09, 2021
Lewei Yao, Runhui Huang, Lu Hou, Guansong Lu, Minzhe Niu, Hang Xu, Xiaodan Liang, Zhenguo Li, Xin Jiang, Chunjing Xu

Figure 1 for FILIP: Fine-grained Interactive Language-Image Pre-Training
Figure 2 for FILIP: Fine-grained Interactive Language-Image Pre-Training
Figure 3 for FILIP: Fine-grained Interactive Language-Image Pre-Training
Figure 4 for FILIP: Fine-grained Interactive Language-Image Pre-Training
Viaarxiv icon