Alert button

"Text": models, code, and papers
Alert button

SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control

Oct 31, 2022
Xiaochuang Han, Sachin Kumar, Yulia Tsvetkov

Figure 1 for SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control
Figure 2 for SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control
Figure 3 for SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control
Figure 4 for SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control
Viaarxiv icon

Disentangled Text Representation Learning with Information-Theoretic Perspective for Adversarial Robustness

Oct 26, 2022
Jiahao Zhao, Wenji Mao

Figure 1 for Disentangled Text Representation Learning with Information-Theoretic Perspective for Adversarial Robustness
Figure 2 for Disentangled Text Representation Learning with Information-Theoretic Perspective for Adversarial Robustness
Figure 3 for Disentangled Text Representation Learning with Information-Theoretic Perspective for Adversarial Robustness
Figure 4 for Disentangled Text Representation Learning with Information-Theoretic Perspective for Adversarial Robustness
Viaarxiv icon

HumanDiffusion: a Coarse-to-Fine Alignment Diffusion Framework for Controllable Text-Driven Person Image Generation

Nov 11, 2022
Kaiduo Zhang, Muyi Sun, Jianxin Sun, Binghao Zhao, Kunbo Zhang, Zhenan Sun, Tieniu Tan

Figure 1 for HumanDiffusion: a Coarse-to-Fine Alignment Diffusion Framework for Controllable Text-Driven Person Image Generation
Figure 2 for HumanDiffusion: a Coarse-to-Fine Alignment Diffusion Framework for Controllable Text-Driven Person Image Generation
Figure 3 for HumanDiffusion: a Coarse-to-Fine Alignment Diffusion Framework for Controllable Text-Driven Person Image Generation
Figure 4 for HumanDiffusion: a Coarse-to-Fine Alignment Diffusion Framework for Controllable Text-Driven Person Image Generation
Viaarxiv icon

Language Is Not All You Need: Aligning Perception with Language Models

Mar 01, 2023
Shaohan Huang, Li Dong, Wenhui Wang, Yaru Hao, Saksham Singhal, Shuming Ma, Tengchao Lv, Lei Cui, Owais Khan Mohammed, Barun Patra, Qiang Liu, Kriti Aggarwal, Zewen Chi, Johan Bjorck, Vishrav Chaudhary, Subhojit Som, Xia Song, Furu Wei

Figure 1 for Language Is Not All You Need: Aligning Perception with Language Models
Figure 2 for Language Is Not All You Need: Aligning Perception with Language Models
Figure 3 for Language Is Not All You Need: Aligning Perception with Language Models
Figure 4 for Language Is Not All You Need: Aligning Perception with Language Models
Viaarxiv icon

StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training

Mar 01, 2023
Yuechen Yu, Yulin Li, Chengquan Zhang, Xiaoqiang Zhang, Zengyuan Guo, Xiameng Qin, Kun Yao, Junyu Han, Errui Ding, Jingdong Wang

Figure 1 for StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training
Figure 2 for StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training
Figure 3 for StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training
Figure 4 for StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training
Viaarxiv icon

LDEdit: Towards Generalized Text Guided Image Manipulation via Latent Diffusion Models

Oct 05, 2022
Paramanand Chandramouli, Kanchana Vaishnavi Gandikota

Figure 1 for LDEdit: Towards Generalized Text Guided Image Manipulation via Latent Diffusion Models
Figure 2 for LDEdit: Towards Generalized Text Guided Image Manipulation via Latent Diffusion Models
Figure 3 for LDEdit: Towards Generalized Text Guided Image Manipulation via Latent Diffusion Models
Figure 4 for LDEdit: Towards Generalized Text Guided Image Manipulation via Latent Diffusion Models
Viaarxiv icon

SwissBERT: The Multilingual Language Model for Switzerland

Mar 23, 2023
Jannis Vamvas, Johannes Graën, Rico Sennrich

Figure 1 for SwissBERT: The Multilingual Language Model for Switzerland
Figure 2 for SwissBERT: The Multilingual Language Model for Switzerland
Figure 3 for SwissBERT: The Multilingual Language Model for Switzerland
Figure 4 for SwissBERT: The Multilingual Language Model for Switzerland
Viaarxiv icon

Towards Flexible Multi-modal Document Models

Mar 31, 2023
Naoto Inoue, Kotaro Kikuchi, Edgar Simo-Serra, Mayu Otani, Kota Yamaguchi

Figure 1 for Towards Flexible Multi-modal Document Models
Figure 2 for Towards Flexible Multi-modal Document Models
Figure 3 for Towards Flexible Multi-modal Document Models
Figure 4 for Towards Flexible Multi-modal Document Models
Viaarxiv icon

Reference-based Image Composition with Sketch via Structure-aware Diffusion Model

Mar 31, 2023
Kangyeol Kim, Sunghyun Park, Junsoo Lee, Jaegul Choo

Figure 1 for Reference-based Image Composition with Sketch via Structure-aware Diffusion Model
Figure 2 for Reference-based Image Composition with Sketch via Structure-aware Diffusion Model
Figure 3 for Reference-based Image Composition with Sketch via Structure-aware Diffusion Model
Figure 4 for Reference-based Image Composition with Sketch via Structure-aware Diffusion Model
Viaarxiv icon

Swinv2-Imagen: Hierarchical Vision Transformer Diffusion Models for Text-to-Image Generation

Oct 18, 2022
Ruijun Li, Weihua Li, Yi Yang, Hanyu Wei, Jianhua Jiang, Quan Bai

Figure 1 for Swinv2-Imagen: Hierarchical Vision Transformer Diffusion Models for Text-to-Image Generation
Figure 2 for Swinv2-Imagen: Hierarchical Vision Transformer Diffusion Models for Text-to-Image Generation
Figure 3 for Swinv2-Imagen: Hierarchical Vision Transformer Diffusion Models for Text-to-Image Generation
Figure 4 for Swinv2-Imagen: Hierarchical Vision Transformer Diffusion Models for Text-to-Image Generation
Viaarxiv icon