Alert button

"Text": models, code, and papers
Alert button

RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths

May 29, 2023
Zeyue Xue, Guanglu Song, Qiushan Guo, Boxiao Liu, Zhuofan Zong, Yu Liu, Ping Luo

Figure 1 for RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths
Figure 2 for RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths
Figure 3 for RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths
Figure 4 for RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths
Viaarxiv icon

CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model

May 23, 2023
Shuai Zhao, Xiaohan Wang, Linchao Zhu, Yi Yang

Figure 1 for CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model
Figure 2 for CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model
Figure 3 for CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model
Figure 4 for CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model
Viaarxiv icon

Tiny LVLM-eHub: Early Multimodal Experiments with Bard

Aug 07, 2023
Wenqi Shao, Yutao Hu, Peng Gao, Meng Lei, Kaipeng Zhang, Fanqing Meng, Peng Xu, Siyuan Huang, Hongsheng Li, Yu Qiao, Ping Luo

Figure 1 for Tiny LVLM-eHub: Early Multimodal Experiments with Bard
Figure 2 for Tiny LVLM-eHub: Early Multimodal Experiments with Bard
Figure 3 for Tiny LVLM-eHub: Early Multimodal Experiments with Bard
Figure 4 for Tiny LVLM-eHub: Early Multimodal Experiments with Bard
Viaarxiv icon

RepCL: Exploring Effective Representation for Continual Text Classification

May 12, 2023
Yifan Song, Peiyi Wang, Dawei Zhu, Tianyu Liu, Zhifang Sui, Sujian Li

Figure 1 for RepCL: Exploring Effective Representation for Continual Text Classification
Figure 2 for RepCL: Exploring Effective Representation for Continual Text Classification
Figure 3 for RepCL: Exploring Effective Representation for Continual Text Classification
Figure 4 for RepCL: Exploring Effective Representation for Continual Text Classification
Viaarxiv icon

A Neural Space-Time Representation for Text-to-Image Personalization

May 24, 2023
Yuval Alaluf, Elad Richardson, Gal Metzer, Daniel Cohen-Or

Figure 1 for A Neural Space-Time Representation for Text-to-Image Personalization
Figure 2 for A Neural Space-Time Representation for Text-to-Image Personalization
Figure 3 for A Neural Space-Time Representation for Text-to-Image Personalization
Figure 4 for A Neural Space-Time Representation for Text-to-Image Personalization
Viaarxiv icon

Unsupervised Improvement of Audio-Text Cross-Modal Representations

May 05, 2023
Zhepei Wang, Cem Subakan, Krishna Subramani, Junkai Wu, Tiago Tavares, Fabio Ayres, Paris Smaragdis

Figure 1 for Unsupervised Improvement of Audio-Text Cross-Modal Representations
Figure 2 for Unsupervised Improvement of Audio-Text Cross-Modal Representations
Figure 3 for Unsupervised Improvement of Audio-Text Cross-Modal Representations
Figure 4 for Unsupervised Improvement of Audio-Text Cross-Modal Representations
Viaarxiv icon

PUMGPT: A Large Vision-Language Model for Product Understanding

Aug 18, 2023
Shuhui Wu, Zengming Tang, Zongyi Guo, Weiwei Zhang, Baoliang Cui, Haihong Tang, Weiming Lu

Figure 1 for PUMGPT: A Large Vision-Language Model for Product Understanding
Figure 2 for PUMGPT: A Large Vision-Language Model for Product Understanding
Figure 3 for PUMGPT: A Large Vision-Language Model for Product Understanding
Figure 4 for PUMGPT: A Large Vision-Language Model for Product Understanding
Viaarxiv icon

Language-Guided Diffusion Model for Visual Grounding

Aug 18, 2023
Sijia Chen, Baochun Li

Figure 1 for Language-Guided Diffusion Model for Visual Grounding
Figure 2 for Language-Guided Diffusion Model for Visual Grounding
Figure 3 for Language-Guided Diffusion Model for Visual Grounding
Figure 4 for Language-Guided Diffusion Model for Visual Grounding
Viaarxiv icon

Accelerated materials language processing enabled by GPT

Aug 18, 2023
Jaewoong Choi, Byungju Lee

Figure 1 for Accelerated materials language processing enabled by GPT
Figure 2 for Accelerated materials language processing enabled by GPT
Figure 3 for Accelerated materials language processing enabled by GPT
Figure 4 for Accelerated materials language processing enabled by GPT
Viaarxiv icon

A Frustratingly Simple Decoding Method for Neural Text Generation

May 22, 2023
Haoran Yang, Deng Cai, Huayang Li, Wei Bi, Wai Lam, Shuming Shi

Figure 1 for A Frustratingly Simple Decoding Method for Neural Text Generation
Figure 2 for A Frustratingly Simple Decoding Method for Neural Text Generation
Figure 3 for A Frustratingly Simple Decoding Method for Neural Text Generation
Figure 4 for A Frustratingly Simple Decoding Method for Neural Text Generation
Viaarxiv icon