Alert button

"Text": models, code, and papers
Alert button

Imagen Editor and EditBench: Advancing and Evaluating Text-Guided Image Inpainting

Dec 13, 2022
Su Wang, Chitwan Saharia, Ceslee Montgomery, Jordi Pont-Tuset, Shai Noy, Stefano Pellegrini, Yasumasa Onoe, Sarah Laszlo, David J. Fleet, Radu Soricut, Jason Baldridge, Mohammad Norouzi, Peter Anderson, William Chan

Figure 1 for Imagen Editor and EditBench: Advancing and Evaluating Text-Guided Image Inpainting
Figure 2 for Imagen Editor and EditBench: Advancing and Evaluating Text-Guided Image Inpainting
Figure 3 for Imagen Editor and EditBench: Advancing and Evaluating Text-Guided Image Inpainting
Figure 4 for Imagen Editor and EditBench: Advancing and Evaluating Text-Guided Image Inpainting
Viaarxiv icon

A Pilot Study of Query-Free Adversarial Attack against Stable Diffusion

Apr 03, 2023
Haomin Zhuang, Yihua Zhang, Sijia Liu

Figure 1 for A Pilot Study of Query-Free Adversarial Attack against Stable Diffusion
Figure 2 for A Pilot Study of Query-Free Adversarial Attack against Stable Diffusion
Figure 3 for A Pilot Study of Query-Free Adversarial Attack against Stable Diffusion
Figure 4 for A Pilot Study of Query-Free Adversarial Attack against Stable Diffusion
Viaarxiv icon

Learning to Generate Poetic Chinese Landscape Painting with Calligraphy

May 08, 2023
Shaozu Yuan, Aijun Dai, Zhiling Yan, Ruixue Liu, Meng Chen, Baoyang Chen, Zhijie Qiu, Xiaodong He

Figure 1 for Learning to Generate Poetic Chinese Landscape Painting with Calligraphy
Figure 2 for Learning to Generate Poetic Chinese Landscape Painting with Calligraphy
Figure 3 for Learning to Generate Poetic Chinese Landscape Painting with Calligraphy
Figure 4 for Learning to Generate Poetic Chinese Landscape Painting with Calligraphy
Viaarxiv icon

Re-imagine the Negative Prompt Algorithm: Transform 2D Diffusion into 3D, alleviate Janus problem and Beyond

Apr 15, 2023
Mohammadreza Armandpour, Huangjie Zheng, Ali Sadeghian, Amir Sadeghian, Mingyuan Zhou

Figure 1 for Re-imagine the Negative Prompt Algorithm: Transform 2D Diffusion into 3D, alleviate Janus problem and Beyond
Figure 2 for Re-imagine the Negative Prompt Algorithm: Transform 2D Diffusion into 3D, alleviate Janus problem and Beyond
Figure 3 for Re-imagine the Negative Prompt Algorithm: Transform 2D Diffusion into 3D, alleviate Janus problem and Beyond
Figure 4 for Re-imagine the Negative Prompt Algorithm: Transform 2D Diffusion into 3D, alleviate Janus problem and Beyond
Viaarxiv icon

Perception Test: A Diagnostic Benchmark for Multimodal Video Models

May 23, 2023
Viorica Pătrăucean, Lucas Smaira, Ankush Gupta, Adrià Recasens Continente, Larisa Markeeva, Dylan Banarse, Skanda Koppula, Joseph Heyward, Mateusz Malinowski, Yi Yang, Carl Doersch, Tatiana Matejovicova, Yury Sulsky, Antoine Miech, Alex Frechette, Hanna Klimczak, Raphael Koster, Junlin Zhang, Stephanie Winkler, Yusuf Aytar, Simon Osindero, Dima Damen, Andrew Zisserman, João Carreira

Figure 1 for Perception Test: A Diagnostic Benchmark for Multimodal Video Models
Figure 2 for Perception Test: A Diagnostic Benchmark for Multimodal Video Models
Figure 3 for Perception Test: A Diagnostic Benchmark for Multimodal Video Models
Figure 4 for Perception Test: A Diagnostic Benchmark for Multimodal Video Models
Viaarxiv icon

Learning Transferable Pedestrian Representation from Multimodal Information Supervision

Apr 12, 2023
Liping Bao, Longhui Wei, Xiaoyu Qiu, Wengang Zhou, Houqiang Li, Qi Tian

Figure 1 for Learning Transferable Pedestrian Representation from Multimodal Information Supervision
Figure 2 for Learning Transferable Pedestrian Representation from Multimodal Information Supervision
Figure 3 for Learning Transferable Pedestrian Representation from Multimodal Information Supervision
Figure 4 for Learning Transferable Pedestrian Representation from Multimodal Information Supervision
Viaarxiv icon

ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation

Feb 27, 2023
Yuxiang Wei, Yabo Zhang, Zhilong Ji, Jinfeng Bai, Lei Zhang, Wangmeng Zuo

Figure 1 for ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation
Figure 2 for ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation
Figure 3 for ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation
Figure 4 for ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation
Viaarxiv icon

What does CLIP know about a red circle? Visual prompt engineering for VLMs

Apr 13, 2023
Aleksandar Shtedritski, Christian Rupprecht, Andrea Vedaldi

Figure 1 for What does CLIP know about a red circle? Visual prompt engineering for VLMs
Figure 2 for What does CLIP know about a red circle? Visual prompt engineering for VLMs
Figure 3 for What does CLIP know about a red circle? Visual prompt engineering for VLMs
Figure 4 for What does CLIP know about a red circle? Visual prompt engineering for VLMs
Viaarxiv icon

Counterfactual Evaluation of Peer-Review Assignment Policies

May 27, 2023
Martin Saveski, Steven Jecmen, Nihar B. Shah, Johan Ugander

Figure 1 for Counterfactual Evaluation of Peer-Review Assignment Policies
Figure 2 for Counterfactual Evaluation of Peer-Review Assignment Policies
Figure 3 for Counterfactual Evaluation of Peer-Review Assignment Policies
Figure 4 for Counterfactual Evaluation of Peer-Review Assignment Policies
Viaarxiv icon

Semi-Parametric Video-Grounded Text Generation

Jan 27, 2023
Sungdong Kim, Jin-Hwa Kim, Jiyoung Lee, Minjoon Seo

Figure 1 for Semi-Parametric Video-Grounded Text Generation
Figure 2 for Semi-Parametric Video-Grounded Text Generation
Figure 3 for Semi-Parametric Video-Grounded Text Generation
Figure 4 for Semi-Parametric Video-Grounded Text Generation
Viaarxiv icon