Alert button
Picture for Haotian Liu

Haotian Liu

Alert button

Carrier Aggregation Enabled Integrated Sensing and Communication Signal Design and Processing

Sep 25, 2023
Zhiqing Wei, Haotian Liu, Xinyi Yang, Wangjun Jiang, Huici Wu, Xingwang Li, Zhiyong Feng

Viaarxiv icon

An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models

Sep 18, 2023
Yadong Lu, Chunyuan Li, Haotian Liu, Jianwei Yang, Jianfeng Gao, Yelong Shen

Figure 1 for An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models
Figure 2 for An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models
Figure 3 for An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models
Figure 4 for An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models
Viaarxiv icon

Benchmarking and Analyzing Generative Data for Visual Recognition

Jul 25, 2023
Bo Li, Haotian Liu, Liangyu Chen, Yong Jae Lee, Chunyuan Li, Ziwei Liu

Figure 1 for Benchmarking and Analyzing Generative Data for Visual Recognition
Figure 2 for Benchmarking and Analyzing Generative Data for Visual Recognition
Figure 3 for Benchmarking and Analyzing Generative Data for Visual Recognition
Figure 4 for Benchmarking and Analyzing Generative Data for Visual Recognition
Viaarxiv icon

Generate Anything Anywhere in Any Scene

Jun 29, 2023
Yuheng Li, Haotian Liu, Yangming Wen, Yong Jae Lee

Figure 1 for Generate Anything Anywhere in Any Scene
Figure 2 for Generate Anything Anywhere in Any Scene
Figure 3 for Generate Anything Anywhere in Any Scene
Figure 4 for Generate Anything Anywhere in Any Scene
Viaarxiv icon

LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day

Jun 01, 2023
Chunyuan Li, Cliff Wong, Sheng Zhang, Naoto Usuyama, Haotian Liu, Jianwei Yang, Tristan Naumann, Hoifung Poon, Jianfeng Gao

Figure 1 for LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
Figure 2 for LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
Figure 3 for LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
Figure 4 for LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
Viaarxiv icon

Visual Instruction Tuning

Apr 17, 2023
Haotian Liu, Chunyuan Li, Qingyang Wu, Yong Jae Lee

Figure 1 for Visual Instruction Tuning
Figure 2 for Visual Instruction Tuning
Figure 3 for Visual Instruction Tuning
Figure 4 for Visual Instruction Tuning
Viaarxiv icon

Data-Efficient Image Quality Assessment with Attention-Panel Decoder

Apr 11, 2023
Guanyi Qin, Runze Hu, Yutao Liu, Xiawu Zheng, Haotian Liu, Xiu Li, Yan Zhang

Figure 1 for Data-Efficient Image Quality Assessment with Attention-Panel Decoder
Figure 2 for Data-Efficient Image Quality Assessment with Attention-Panel Decoder
Figure 3 for Data-Efficient Image Quality Assessment with Attention-Panel Decoder
Figure 4 for Data-Efficient Image Quality Assessment with Attention-Panel Decoder
Viaarxiv icon

TMA: Temporal Motion Aggregation for Event-based Optical Flow

Mar 21, 2023
Haotian Liu, Guang Chen, Sanqing Qu, Yanping Zhang, Zhijun Li, Alois Knoll, Changjun Jiang

Figure 1 for TMA: Temporal Motion Aggregation for Event-based Optical Flow
Figure 2 for TMA: Temporal Motion Aggregation for Event-based Optical Flow
Figure 3 for TMA: Temporal Motion Aggregation for Event-based Optical Flow
Figure 4 for TMA: Temporal Motion Aggregation for Event-based Optical Flow
Viaarxiv icon

Learning Customized Visual Models with Retrieval-Augmented Knowledge

Jan 17, 2023
Haotian Liu, Kilho Son, Jianwei Yang, Ce Liu, Jianfeng Gao, Yong Jae Lee, Chunyuan Li

Figure 1 for Learning Customized Visual Models with Retrieval-Augmented Knowledge
Figure 2 for Learning Customized Visual Models with Retrieval-Augmented Knowledge
Figure 3 for Learning Customized Visual Models with Retrieval-Augmented Knowledge
Figure 4 for Learning Customized Visual Models with Retrieval-Augmented Knowledge
Viaarxiv icon

GLIGEN: Open-Set Grounded Text-to-Image Generation

Jan 17, 2023
Yuheng Li, Haotian Liu, Qingyang Wu, Fangzhou Mu, Jianwei Yang, Jianfeng Gao, Chunyuan Li, Yong Jae Lee

Figure 1 for GLIGEN: Open-Set Grounded Text-to-Image Generation
Figure 2 for GLIGEN: Open-Set Grounded Text-to-Image Generation
Figure 3 for GLIGEN: Open-Set Grounded Text-to-Image Generation
Figure 4 for GLIGEN: Open-Set Grounded Text-to-Image Generation
Viaarxiv icon