Alert button
Picture for Huaibo Huang

Huaibo Huang

Alert button

ViTAR: Vision Transformer with Any Resolution

Add code
Bookmark button
Alert button
Mar 28, 2024
Qihang Fan, Quanzeng You, Xiaotian Han, Yongfei Liu, Yunzhe Tao, Huaibo Huang, Ran He, Hongxia Yang

Viaarxiv icon

DiffMAC: Diffusion Manifold Hallucination Correction for High Generalization Blind Face Restoration

Add code
Bookmark button
Alert button
Mar 15, 2024
Nan Gao, Jia Li, Huaibo Huang, Zhi Zeng, Ke Shang, Shuwu Zhang, Ran He

Figure 1 for DiffMAC: Diffusion Manifold Hallucination Correction for High Generalization Blind Face Restoration
Figure 2 for DiffMAC: Diffusion Manifold Hallucination Correction for High Generalization Blind Face Restoration
Figure 3 for DiffMAC: Diffusion Manifold Hallucination Correction for High Generalization Blind Face Restoration
Figure 4 for DiffMAC: Diffusion Manifold Hallucination Correction for High Generalization Blind Face Restoration
Viaarxiv icon

FakeNewsGPT4: Advancing Multimodal Fake News Detection through Knowledge-Augmented LVLMs

Add code
Bookmark button
Alert button
Mar 04, 2024
Xuannan Liu, Peipei Li, Huaibo Huang, Zekun Li, Xing Cui, Jiahao Liang, Lixiong Qin, Weihong Deng, Zhaofeng He

Figure 1 for FakeNewsGPT4: Advancing Multimodal Fake News Detection through Knowledge-Augmented LVLMs
Figure 2 for FakeNewsGPT4: Advancing Multimodal Fake News Detection through Knowledge-Augmented LVLMs
Figure 3 for FakeNewsGPT4: Advancing Multimodal Fake News Detection through Knowledge-Augmented LVLMs
Figure 4 for FakeNewsGPT4: Advancing Multimodal Fake News Detection through Knowledge-Augmented LVLMs
Viaarxiv icon

InfiMM-HD: A Leap Forward in High-Resolution Multimodal Understanding

Add code
Bookmark button
Alert button
Mar 03, 2024
Haogeng Liu, Quanzeng You, Xiaotian Han, Yiqi Wang, Bohan Zhai, Yongfei Liu, Yunzhe Tao, Huaibo Huang, Ran He, Hongxia Yang

Figure 1 for InfiMM-HD: A Leap Forward in High-Resolution Multimodal Understanding
Figure 2 for InfiMM-HD: A Leap Forward in High-Resolution Multimodal Understanding
Figure 3 for InfiMM-HD: A Leap Forward in High-Resolution Multimodal Understanding
Figure 4 for InfiMM-HD: A Leap Forward in High-Resolution Multimodal Understanding
Viaarxiv icon

Multimodal Prompt Perceiver: Empower Adaptiveness, Generalizability and Fidelity for All-in-One Image Restoration

Add code
Bookmark button
Alert button
Dec 05, 2023
Yuang Ai, Huaibo Huang, Xiaoqiang Zhou, Jiexiang Wang, Ran He

Viaarxiv icon

Portrait Diffusion: Training-free Face Stylization with Chain-of-Painting

Add code
Bookmark button
Alert button
Dec 03, 2023
Jin Liu, Huaibo Huang, Chao Jin, Ran He

Viaarxiv icon

Exploring Straighter Trajectories of Flow Matching with Diffusion Guidance

Add code
Bookmark button
Alert button
Nov 28, 2023
Siyu Xing, Jie Cao, Huaibo Huang, Xiao-Yu Zhang, Ran He

Viaarxiv icon

InstaStyle: Inversion Noise of a Stylized Image is Secretly a Style Adviser

Add code
Bookmark button
Alert button
Nov 25, 2023
Xing Cui, Zekun Li, Pei Pei Li, Huaibo Huang, Zhaofeng He

Viaarxiv icon

Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling

Add code
Bookmark button
Alert button
Oct 11, 2023
Haogeng Liu, Qihang Fan, Tingkai Liu, Linjie Yang, Yunzhe Tao, Huaibo Huang, Ran He, Hongxia Yang

Figure 1 for Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling
Figure 2 for Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling
Figure 3 for Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling
Figure 4 for Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling
Viaarxiv icon

Video-CSR: Complex Video Digest Creation for Visual-Language Models

Add code
Bookmark button
Alert button
Oct 08, 2023
Tingkai Liu, Yunzhe Tao, Haogeng Liu, Qihang Fan, Ding Zhou, Huaibo Huang, Ran He, Hongxia Yang

Figure 1 for Video-CSR: Complex Video Digest Creation for Visual-Language Models
Figure 2 for Video-CSR: Complex Video Digest Creation for Visual-Language Models
Figure 3 for Video-CSR: Complex Video Digest Creation for Visual-Language Models
Figure 4 for Video-CSR: Complex Video Digest Creation for Visual-Language Models
Viaarxiv icon