Alert button
Picture for Haogeng Liu

Haogeng Liu

Alert button

InfiMM-HD: A Leap Forward in High-Resolution Multimodal Understanding

Add code
Bookmark button
Alert button
Mar 03, 2024
Haogeng Liu, Quanzeng You, Xiaotian Han, Yiqi Wang, Bohan Zhai, Yongfei Liu, Yunzhe Tao, Huaibo Huang, Ran He, Hongxia Yang

Figure 1 for InfiMM-HD: A Leap Forward in High-Resolution Multimodal Understanding
Figure 2 for InfiMM-HD: A Leap Forward in High-Resolution Multimodal Understanding
Figure 3 for InfiMM-HD: A Leap Forward in High-Resolution Multimodal Understanding
Figure 4 for InfiMM-HD: A Leap Forward in High-Resolution Multimodal Understanding
Viaarxiv icon

Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling

Add code
Bookmark button
Alert button
Oct 11, 2023
Haogeng Liu, Qihang Fan, Tingkai Liu, Linjie Yang, Yunzhe Tao, Huaibo Huang, Ran He, Hongxia Yang

Figure 1 for Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling
Figure 2 for Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling
Figure 3 for Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling
Figure 4 for Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling
Viaarxiv icon

Video-CSR: Complex Video Digest Creation for Visual-Language Models

Add code
Bookmark button
Alert button
Oct 08, 2023
Tingkai Liu, Yunzhe Tao, Haogeng Liu, Qihang Fan, Ding Zhou, Huaibo Huang, Ran He, Hongxia Yang

Figure 1 for Video-CSR: Complex Video Digest Creation for Visual-Language Models
Figure 2 for Video-CSR: Complex Video Digest Creation for Visual-Language Models
Figure 3 for Video-CSR: Complex Video Digest Creation for Visual-Language Models
Figure 4 for Video-CSR: Complex Video Digest Creation for Visual-Language Models
Viaarxiv icon

Boosting Fast and High-Quality Speech Synthesis with Linear Diffusion

Add code
Bookmark button
Alert button
Jun 12, 2023
Haogeng Liu, Tao Wang, Jie Cao, Ran He, Jianhua Tao

Figure 1 for Boosting Fast and High-Quality Speech Synthesis with Linear Diffusion
Figure 2 for Boosting Fast and High-Quality Speech Synthesis with Linear Diffusion
Figure 3 for Boosting Fast and High-Quality Speech Synthesis with Linear Diffusion
Figure 4 for Boosting Fast and High-Quality Speech Synthesis with Linear Diffusion
Viaarxiv icon

UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion

Add code
Bookmark button
Alert button
Jan 10, 2023
Haogeng Liu, Tao Wang, Ruibo Fu, Jiangyan Yi, Zhengqi Wen, Jianhua Tao

Figure 1 for UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion
Figure 2 for UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion
Figure 3 for UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion
Figure 4 for UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion
Viaarxiv icon