Alert button

"Image": models, code, and papers
Alert button

PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Add code
Bookmark button
Alert button
Oct 16, 2023
Junsong Chen, Jincheng Yu, Chongjian Ge, Lewei Yao, Enze Xie, Yue Wu, Zhongdao Wang, James Kwok, Ping Luo, Huchuan Lu, Zhenguo Li

Figure 1 for PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Figure 2 for PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Figure 3 for PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Figure 4 for PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Viaarxiv icon

Deep Learning based CNN Model for Classification and Detection of Individuals Wearing Face Mask

Nov 17, 2023
R. Chinnaiyan, Iyyappan M, Al Raiyan Shariff A, Kondaveeti Sai, Mallikarjunaiah B M, P Bharath

Viaarxiv icon

SafeSea: Synthetic Data Generation for Adverse & Low Probability Maritime Conditions

Add code
Bookmark button
Alert button
Nov 24, 2023
Martin Tran, Jordan Shipard, Hermawan Mulyono, Arnold Wiliem, Clinton Fookes

Viaarxiv icon

Continual Self-supervised Learning: Towards Universal Multi-modal Medical Data Representation Learning

Add code
Bookmark button
Alert button
Nov 30, 2023
Yiwen Ye, Yutong Xie, Jianpeng Zhang, Ziyang Chen, Qi Wu, Yong Xia

Viaarxiv icon

X-Dreamer: Creating High-quality 3D Content by Bridging the Domain Gap Between Text-to-2D and Text-to-3D Generation

Add code
Bookmark button
Alert button
Nov 30, 2023
Yiwei Ma, Yijun Fan, Jiayi Ji, Haowei Wang, Xiaoshuai Sun, Guannan Jiang, Annan Shu, Rongrong Ji

Viaarxiv icon

SimulFlow: Simultaneously Extracting Feature and Identifying Target for Unsupervised Video Object Segmentation

Nov 30, 2023
Lingyi Hong, Wei Zhang, Shuyong Gao, Hong Lu, WenQiang Zhang

Viaarxiv icon

Scene Summarization: Clustering Scene Videos into Spatially Diverse Frames

Nov 28, 2023
Chao Chen, Mingzhi Zhu, Ankush Pratap Singh, Yu Yan, Felix Juefei Xu, Chen Feng

Viaarxiv icon

Zero-shot Referring Expression Comprehension via Structural Similarity Between Images and Captions

Nov 28, 2023
Zeyu Han, Fangrui Zhu, Qianru Lao, Huaizu Jiang

Viaarxiv icon

RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D

Add code
Bookmark button
Alert button
Nov 28, 2023
Lingteng Qiu, Guanying Chen, Xiaodong Gu, Qi Zuo, Mutian Xu, Yushuang Wu, Weihao Yuan, Zilong Dong, Liefeng Bo, Xiaoguang Han

Viaarxiv icon

SARA: Controllable Makeup Transfer with Spatial Alignment and Region-Adaptive Normalization

Nov 28, 2023
Xiaojing Zhong, Xinyi Huang, Zhonghua Wu, Guosheng Lin, Qingyao Wu

Viaarxiv icon