Alert button

"Text": models, code, and papers
Alert button

Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization

Mar 11, 2024
Jinlu Zhang, Yiyi Zhou, Qiancheng Zheng, Xiaoxiong Du, Gen Luo, Jun Peng, Xiaoshuai Sun, Rongrong Ji

Viaarxiv icon

DivCon: Divide and Conquer for Progressive Text-to-Image Generation

Mar 11, 2024
Yuhao Jia, Wenhan Tan

Viaarxiv icon

Controllable Generation with Text-to-Image Diffusion Models: A Survey

Mar 07, 2024
Pu Cao, Feng Zhou, Qing Song, Lu Yang

Viaarxiv icon

PET-SQL: A Prompt-enhanced Two-stage Text-to-SQL Framework with Cross-consistency

Mar 13, 2024
Zhishuai Li, Xiang Wang, Jingjing Zhao, Sun Yang, Guoqing Du, Xiaoru Hu, Bin Zhang, Yuxiao Ye, Ziyue Li, Rui Zhao, Hangyu Mao

Viaarxiv icon

Evaluating Text-to-Image Generative Models: An Empirical Study on Human Image Synthesis

Mar 08, 2024
Muxi Chen, Yi Liu, Jian Yi, Changran Xu, Qiuxia Lai, Hongliang Wang, Tsung-Yi Ho, Qiang Xu

Figure 1 for Evaluating Text-to-Image Generative Models: An Empirical Study on Human Image Synthesis
Figure 2 for Evaluating Text-to-Image Generative Models: An Empirical Study on Human Image Synthesis
Figure 3 for Evaluating Text-to-Image Generative Models: An Empirical Study on Human Image Synthesis
Figure 4 for Evaluating Text-to-Image Generative Models: An Empirical Study on Human Image Synthesis
Viaarxiv icon

RealCustom: Narrowing Real Text Word for Real-Time Open-Domain Text-to-Image Customization

Mar 01, 2024
Mengqi Huang, Zhendong Mao, Mingcong Liu, Qian He, Yongdong Zhang

Figure 1 for RealCustom: Narrowing Real Text Word for Real-Time Open-Domain Text-to-Image Customization
Figure 2 for RealCustom: Narrowing Real Text Word for Real-Time Open-Domain Text-to-Image Customization
Figure 3 for RealCustom: Narrowing Real Text Word for Real-Time Open-Domain Text-to-Image Customization
Figure 4 for RealCustom: Narrowing Real Text Word for Real-Time Open-Domain Text-to-Image Customization
Viaarxiv icon

The Case for Evaluating Multimodal Translation Models on Text Datasets

Mar 05, 2024
Vipin Vijayan, Braeden Bowen, Scott Grigsby, Timothy Anderson, Jeremy Gwinnup

Figure 1 for The Case for Evaluating Multimodal Translation Models on Text Datasets
Figure 2 for The Case for Evaluating Multimodal Translation Models on Text Datasets
Viaarxiv icon

Socratic Reasoning Improves Positive Text Rewriting

Mar 05, 2024
Anmol Goel, Nico Daheim, Iryna Gurevych

Figure 1 for Socratic Reasoning Improves Positive Text Rewriting
Figure 2 for Socratic Reasoning Improves Positive Text Rewriting
Figure 3 for Socratic Reasoning Improves Positive Text Rewriting
Figure 4 for Socratic Reasoning Improves Positive Text Rewriting
Viaarxiv icon

PromptCharm: Text-to-Image Generation through Multi-modal Prompting and Refinement

Mar 06, 2024
Zhijie Wang, Yuheng Huang, Da Song, Lei Ma, Tianyi Zhang

Figure 1 for PromptCharm: Text-to-Image Generation through Multi-modal Prompting and Refinement
Figure 2 for PromptCharm: Text-to-Image Generation through Multi-modal Prompting and Refinement
Figure 3 for PromptCharm: Text-to-Image Generation through Multi-modal Prompting and Refinement
Figure 4 for PromptCharm: Text-to-Image Generation through Multi-modal Prompting and Refinement
Viaarxiv icon

Neural Image Compression with Text-guided Encoding for both Pixel-level and Perceptual Fidelity

Mar 05, 2024
Hagyeong Lee, Minkyu Kim, Jun-Hyuk Kim, Seungeon Kim, Dokwan Oh, Jaeho Lee

Figure 1 for Neural Image Compression with Text-guided Encoding for both Pixel-level and Perceptual Fidelity
Figure 2 for Neural Image Compression with Text-guided Encoding for both Pixel-level and Perceptual Fidelity
Figure 3 for Neural Image Compression with Text-guided Encoding for both Pixel-level and Perceptual Fidelity
Figure 4 for Neural Image Compression with Text-guided Encoding for both Pixel-level and Perceptual Fidelity
Viaarxiv icon