Alert button

"Text": models, code, and papers
Alert button

Towards Effective Usage of Human-Centric Priors in Diffusion Models for Text-based Human Image Generation

Mar 08, 2024
Junyan Wang, Zhenhong Sun, Zhiyu Tan, Xuanbai Chen, Weihua Chen, Hao Li, Cheng Zhang, Yang Song

Figure 1 for Towards Effective Usage of Human-Centric Priors in Diffusion Models for Text-based Human Image Generation
Figure 2 for Towards Effective Usage of Human-Centric Priors in Diffusion Models for Text-based Human Image Generation
Figure 3 for Towards Effective Usage of Human-Centric Priors in Diffusion Models for Text-based Human Image Generation
Figure 4 for Towards Effective Usage of Human-Centric Priors in Diffusion Models for Text-based Human Image Generation
Viaarxiv icon

TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document

Mar 15, 2024
Yuliang Liu, Biao Yang, Qiang Liu, Zhang Li, Zhiyin Ma, Shuo Zhang, Xiang Bai

Figure 1 for TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document
Figure 2 for TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document
Figure 3 for TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document
Figure 4 for TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document
Viaarxiv icon

A Survey of AI-generated Text Forensic Systems: Detection, Attribution, and Characterization

Mar 02, 2024
Tharindu Kumarage, Garima Agrawal, Paras Sheth, Raha Moraffah, Aman Chadha, Joshua Garland, Huan Liu

Viaarxiv icon

Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing

Mar 06, 2024
Bingyan Liu, Chengyu Wang, Tingfeng Cao, Kui Jia, Jun Huang

Figure 1 for Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing
Figure 2 for Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing
Figure 3 for Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing
Figure 4 for Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing
Viaarxiv icon

DeepEraser: Deep Iterative Context Mining for Generic Text Eraser

Feb 29, 2024
Hao Feng, Wendi Wang, Shaokai Liu, Jiajun Deng, Wengang Zhou, Houqiang Li

Viaarxiv icon

RORA: Robust Free-Text Rationale Evaluation

Mar 01, 2024
Zhengping Jiang, Yining Lu, Hanjie Chen, Daniel Khashabi, Benjamin Van Durme, Anqi Liu

Viaarxiv icon

Towards Implicit Prompt For Text-To-Image Models

Mar 08, 2024
Yue Yang, Yuqi lin, Hong Liu, Wenqi Shao, Runjian Chen, Hailong Shang, Yu Wang, Yu Qiao, Kaipeng Zhang, Ping Luo

Figure 1 for Towards Implicit Prompt For Text-To-Image Models
Figure 2 for Towards Implicit Prompt For Text-To-Image Models
Figure 3 for Towards Implicit Prompt For Text-To-Image Models
Figure 4 for Towards Implicit Prompt For Text-To-Image Models
Viaarxiv icon

Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters

Mar 05, 2024
Weizhi Wang, Khalil Mrini, Linjie Yang, Sateesh Kumar, Yu Tian, Xifeng Yan, Heng Wang

Figure 1 for Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters
Figure 2 for Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters
Figure 3 for Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters
Figure 4 for Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters
Viaarxiv icon

Doubly Abductive Counterfactual Inference for Text-based Image Editing

Mar 05, 2024
Xue Song, Jiequan Cui, Hanwang Zhang, Jingjing Chen, Richang Hong, Yu-Gang Jiang

Figure 1 for Doubly Abductive Counterfactual Inference for Text-based Image Editing
Figure 2 for Doubly Abductive Counterfactual Inference for Text-based Image Editing
Figure 3 for Doubly Abductive Counterfactual Inference for Text-based Image Editing
Figure 4 for Doubly Abductive Counterfactual Inference for Text-based Image Editing
Viaarxiv icon

Comprehensive Implementation of TextCNN for Enhanced Collaboration between Natural Language Processing and System Recommendation

Mar 12, 2024
Xiaonan Xu, Zheng Xu, Zhipeng Ling, Zhengyu Jin, ShuQian Du

Viaarxiv icon