Alert button

"Text": models, code, and papers
Alert button

Collaborative Generative AI: Integrating GPT-k for Efficient Editing in Text-to-Image Generation

May 18, 2023
Wanrong Zhu, Xinyi Wang, Yujie Lu, Tsu-Jui Fu, Xin Eric Wang, Miguel Eckstein, William Yang Wang

Figure 1 for Collaborative Generative AI: Integrating GPT-k for Efficient Editing in Text-to-Image Generation
Figure 2 for Collaborative Generative AI: Integrating GPT-k for Efficient Editing in Text-to-Image Generation
Figure 3 for Collaborative Generative AI: Integrating GPT-k for Efficient Editing in Text-to-Image Generation
Figure 4 for Collaborative Generative AI: Integrating GPT-k for Efficient Editing in Text-to-Image Generation
Viaarxiv icon

Towards Unified Scene Text Spotting based on Sequence Generation

Apr 07, 2023
Taeho Kil, Seonghyeon Kim, Sukmin Seo, Yoonsik Kim, Daehee Kim

Figure 1 for Towards Unified Scene Text Spotting based on Sequence Generation
Figure 2 for Towards Unified Scene Text Spotting based on Sequence Generation
Figure 3 for Towards Unified Scene Text Spotting based on Sequence Generation
Figure 4 for Towards Unified Scene Text Spotting based on Sequence Generation
Viaarxiv icon

TextDeformer: Geometry Manipulation using Text Guidance

Apr 26, 2023
William Gao, Noam Aigerman, Thibault Groueix, Vladimir G. Kim, Rana Hanocka

Figure 1 for TextDeformer: Geometry Manipulation using Text Guidance
Figure 2 for TextDeformer: Geometry Manipulation using Text Guidance
Figure 3 for TextDeformer: Geometry Manipulation using Text Guidance
Figure 4 for TextDeformer: Geometry Manipulation using Text Guidance
Viaarxiv icon

LISA: Reasoning Segmentation via Large Language Model

Aug 01, 2023
Xin Lai, Zhuotao Tian, Yukang Chen, Yanwei Li, Yuhui Yuan, Shu Liu, Jiaya Jia

Figure 1 for LISA: Reasoning Segmentation via Large Language Model
Figure 2 for LISA: Reasoning Segmentation via Large Language Model
Figure 3 for LISA: Reasoning Segmentation via Large Language Model
Figure 4 for LISA: Reasoning Segmentation via Large Language Model
Viaarxiv icon

Relational Extraction on Wikipedia Tables using Convolutional and Memory Networks

Jul 11, 2023
Arif Shahriar, Rohan Saha, Denilson Barbosa

Figure 1 for Relational Extraction on Wikipedia Tables using Convolutional and Memory Networks
Figure 2 for Relational Extraction on Wikipedia Tables using Convolutional and Memory Networks
Figure 3 for Relational Extraction on Wikipedia Tables using Convolutional and Memory Networks
Figure 4 for Relational Extraction on Wikipedia Tables using Convolutional and Memory Networks
Viaarxiv icon

Degeneration-Tuning: Using Scrambled Grid shield Unwanted Concepts from Stable Diffusion

Aug 02, 2023
Zixuan Ni, Longhui Wei, Jiachen Li, Siliang Tang, Yueting Zhuang, Qi Tian

Figure 1 for Degeneration-Tuning: Using Scrambled Grid shield Unwanted Concepts from Stable Diffusion
Figure 2 for Degeneration-Tuning: Using Scrambled Grid shield Unwanted Concepts from Stable Diffusion
Figure 3 for Degeneration-Tuning: Using Scrambled Grid shield Unwanted Concepts from Stable Diffusion
Figure 4 for Degeneration-Tuning: Using Scrambled Grid shield Unwanted Concepts from Stable Diffusion
Viaarxiv icon

Dual Governance: The intersection of centralized regulation and crowdsourced safety mechanisms for Generative AI

Aug 02, 2023
Avijit Ghosh, Dhanya Lakshmi

Viaarxiv icon

WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models

Jun 07, 2023
Changhoon Kim, Kyle Min, Maitreya Patel, Sheng Cheng, Yezhou Yang

Figure 1 for WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models
Figure 2 for WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models
Figure 3 for WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models
Figure 4 for WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models
Viaarxiv icon

OxfordVGG Submission to the EGO4D AV Transcription Challenge

Jul 18, 2023
Jaesung Huh, Max Bain, Andrew Zisserman

Figure 1 for OxfordVGG Submission to the EGO4D AV Transcription Challenge
Figure 2 for OxfordVGG Submission to the EGO4D AV Transcription Challenge
Viaarxiv icon

Towards Real-time Text-driven Image Manipulation with Unconditional Diffusion Models

Apr 10, 2023
Nikita Starodubcev, Dmitry Baranchuk, Valentin Khrulkov, Artem Babenko

Figure 1 for Towards Real-time Text-driven Image Manipulation with Unconditional Diffusion Models
Figure 2 for Towards Real-time Text-driven Image Manipulation with Unconditional Diffusion Models
Figure 3 for Towards Real-time Text-driven Image Manipulation with Unconditional Diffusion Models
Figure 4 for Towards Real-time Text-driven Image Manipulation with Unconditional Diffusion Models
Viaarxiv icon