Alert button
Picture for Gaoang Wang

Gaoang Wang

Alert button

FlexiFilm: Long Video Generation with Flexible Conditions

Add code
Bookmark button
Alert button
Apr 29, 2024
Yichen Ouyang, jianhao Yuan, Hao Zhao, Gaoang Wang, Bo zhao

Viaarxiv icon

MovieChat+: Question-aware Sparse Memory for Long Video Question Answering

Add code
Bookmark button
Alert button
Apr 26, 2024
Enxin Song, Wenhao Chai, Tian Ye, Jenq-Neng Hwang, Xi Li, Gaoang Wang

Viaarxiv icon

Do We Really Need a Complex Agent System? Distill Embodied Agent into a Single Model

Add code
Bookmark button
Alert button
Apr 06, 2024
Zhonghan Zhao, Ke Ma, Wenhao Chai, Xuan Wang, Kewei Chen, Dongxu Guo, Yanting Zhang, Hongwei Wang, Gaoang Wang

Figure 1 for Do We Really Need a Complex Agent System? Distill Embodied Agent into a Single Model
Figure 2 for Do We Really Need a Complex Agent System? Distill Embodied Agent into a Single Model
Figure 3 for Do We Really Need a Complex Agent System? Distill Embodied Agent into a Single Model
Figure 4 for Do We Really Need a Complex Agent System? Distill Embodied Agent into a Single Model
Viaarxiv icon

VersaT2I: Improving Text-to-Image Models with Versatile Reward

Add code
Bookmark button
Alert button
Mar 27, 2024
Jianshu Guo, Wenhao Chai, Jie Deng, Hsiang-Wei Huang, Tian Ye, Yichen Xu, Jiawei Zhang, Jenq-Neng Hwang, Gaoang Wang

Figure 1 for VersaT2I: Improving Text-to-Image Models with Versatile Reward
Figure 2 for VersaT2I: Improving Text-to-Image Models with Versatile Reward
Figure 3 for VersaT2I: Improving Text-to-Image Models with Versatile Reward
Figure 4 for VersaT2I: Improving Text-to-Image Models with Versatile Reward
Viaarxiv icon

Hierarchical Auto-Organizing System for Open-Ended Multi-Agent Navigation

Add code
Bookmark button
Alert button
Mar 18, 2024
Zhonghan Zhao, Kewei Chen, Dongxu Guo, Wenhao Chai, Tian Ye, Yanting Zhang, Gaoang Wang

Figure 1 for Hierarchical Auto-Organizing System for Open-Ended Multi-Agent Navigation
Figure 2 for Hierarchical Auto-Organizing System for Open-Ended Multi-Agent Navigation
Figure 3 for Hierarchical Auto-Organizing System for Open-Ended Multi-Agent Navigation
Figure 4 for Hierarchical Auto-Organizing System for Open-Ended Multi-Agent Navigation
Viaarxiv icon

MedM2G: Unifying Medical Multi-Modal Generation via Cross-Guided Diffusion with Visual Invariant

Add code
Bookmark button
Alert button
Mar 07, 2024
Chenlu Zhan, Yu Lin, Gaoang Wang, Hongwei Wang, Jian Wu

Figure 1 for MedM2G: Unifying Medical Multi-Modal Generation via Cross-Guided Diffusion with Visual Invariant
Figure 2 for MedM2G: Unifying Medical Multi-Modal Generation via Cross-Guided Diffusion with Visual Invariant
Figure 3 for MedM2G: Unifying Medical Multi-Modal Generation via Cross-Guided Diffusion with Visual Invariant
Figure 4 for MedM2G: Unifying Medical Multi-Modal Generation via Cross-Guided Diffusion with Visual Invariant
Viaarxiv icon

Divide and Conquer for Large Language Models Reasoning

Add code
Bookmark button
Alert button
Jan 10, 2024
Zijie Meng, Yan Zhang, Zhaopeng Feng, Yang Feng, Gaoang Wang, Joey Tianyi Zhou, Jian Wu, Zuozhu Liu

Figure 1 for Divide and Conquer for Large Language Models Reasoning
Figure 2 for Divide and Conquer for Large Language Models Reasoning
Figure 3 for Divide and Conquer for Large Language Models Reasoning
Figure 4 for Divide and Conquer for Large Language Models Reasoning
Viaarxiv icon

UniDCP: Unifying Multiple Medical Vision-language Tasks via Dynamic Cross-modal Learnable Prompts

Add code
Bookmark button
Alert button
Dec 18, 2023
Chenlu Zhan, Yufei Zhang, Yu Lin, Gaoang Wang, Hongwei Wang

Figure 1 for UniDCP: Unifying Multiple Medical Vision-language Tasks via Dynamic Cross-modal Learnable Prompts
Figure 2 for UniDCP: Unifying Multiple Medical Vision-language Tasks via Dynamic Cross-modal Learnable Prompts
Figure 3 for UniDCP: Unifying Multiple Medical Vision-language Tasks via Dynamic Cross-modal Learnable Prompts
Figure 4 for UniDCP: Unifying Multiple Medical Vision-language Tasks via Dynamic Cross-modal Learnable Prompts
Viaarxiv icon

User-Aware Prefix-Tuning is a Good Learner for Personalized Image Captioning

Add code
Bookmark button
Alert button
Dec 08, 2023
Xuan Wang, Guanhong Wang, Wenhao Chai, Jiayu Zhou, Gaoang Wang

Figure 1 for User-Aware Prefix-Tuning is a Good Learner for Personalized Image Captioning
Figure 2 for User-Aware Prefix-Tuning is a Good Learner for Personalized Image Captioning
Figure 3 for User-Aware Prefix-Tuning is a Good Learner for Personalized Image Captioning
Figure 4 for User-Aware Prefix-Tuning is a Good Learner for Personalized Image Captioning
Viaarxiv icon

See and Think: Embodied Agent in Virtual Environment

Add code
Bookmark button
Alert button
Dec 03, 2023
Zhonghan Zhao, Wenhao Chai, Xuan Wang, Li Boyi, Shengyu Hao, Shidong Cao, Tian Ye, Jenq-Neng Hwang, Gaoang Wang

Figure 1 for See and Think: Embodied Agent in Virtual Environment
Figure 2 for See and Think: Embodied Agent in Virtual Environment
Figure 3 for See and Think: Embodied Agent in Virtual Environment
Figure 4 for See and Think: Embodied Agent in Virtual Environment
Viaarxiv icon