Alert button

"Image": models, code, and papers
Alert button

Discrete Messages Improve Communication Efficiency among Isolated Intelligent Agents

Dec 26, 2023
Hang Chen, Yuchuan Jang, Weijie Zhou, Cristian meo, Ziwei Chen, Dianbo Liu

Viaarxiv icon

InfoVisDial: An Informative Visual Dialogue Dataset by Bridging Large Multimodal and Language Models

Dec 21, 2023
Bingbing Wen, Zhengyuan Yang, Jianfeng Wang, Zhe Gan, Bill Howe, Lijuan Wang

Viaarxiv icon

HyperDID: Hyperspectral Intrinsic Image Decomposition with Deep Feature Embedding

Nov 25, 2023
Zhiqiang Gong, Xian Zhou, Wen Yao, Xiaohu Zheng, Ping Zhong

Viaarxiv icon

HMP: Hand Motion Priors for Pose and Shape Estimation from Video

Dec 27, 2023
Enes Duran, Muhammed Kocabas, Vasileios Choutas, Zicong Fan, Michael J. Black

Viaarxiv icon

Enhancing Object Coherence in Layout-to-Image Synthesis

Nov 25, 2023
Yibin Wang, Weizhong Zhang, Jianwei Zheng, Cheng Jin

Viaarxiv icon

SA$^2$VP: Spatially Aligned-and-Adapted Visual Prompt

Dec 16, 2023
Wenjie Pei, Tongqi Xia, Fanglin Chen, Jinsong Li, Jiandong Tian, Guangming Lu

Viaarxiv icon

One Model to Rule them All: Towards Universal Segmentation for Medical Images with Text Prompts

Dec 28, 2023
Ziheng Zhao, Yao Zhang, Chaoyi Wu, Xiaoman Zhang, Ya Zhang, Yanfeng Wang, Weidi Xie

Viaarxiv icon

Replica Tree-based Federated Learning using Limited Data

Dec 28, 2023
Ramona Ghilea, Islem Rekik

Viaarxiv icon

A Dual-way Enhanced Framework from Text Matching Point of View for Multimodal Entity Linking

Dec 19, 2023
Shezheng Song, Shan Zhao, Chengyu Wang, Tianwei Yan, Shasha Li, Xiaoguang Mao, Meng Wang

Viaarxiv icon

Pre-trained Language Models Do Not Help Auto-regressive Text-to-Image Generation

Nov 27, 2023
Yuhui Zhang, Brandon McKinzie, Zhe Gan, Vaishaal Shankar, Alexander Toshev

Figure 1 for Pre-trained Language Models Do Not Help Auto-regressive Text-to-Image Generation
Figure 2 for Pre-trained Language Models Do Not Help Auto-regressive Text-to-Image Generation
Figure 3 for Pre-trained Language Models Do Not Help Auto-regressive Text-to-Image Generation
Figure 4 for Pre-trained Language Models Do Not Help Auto-regressive Text-to-Image Generation
Viaarxiv icon