Alert button

"Text": models, code, and papers
Alert button

GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation

Dec 07, 2023
Shoufa Chen, Mengmeng Xu, Jiawei Ren, Yuren Cong, Sen He, Yanping Xie, Animesh Sinha, Ping Luo, Tao Xiang, Juan-Manuel Perez-Rua

Figure 1 for GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation
Figure 2 for GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation
Figure 3 for GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation
Figure 4 for GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation
Viaarxiv icon

Weakly Supervised Open-Vocabulary Object Detection

Dec 19, 2023
Jianghang Lin, Yunhang Shen, Bingquan Wang, Shaohui Lin, Ke Li, Liujuan Cao

Viaarxiv icon

Dynamic Weighted Combiner for Mixed-Modal Image Retrieval

Dec 11, 2023
Fuxiang Huang, Lei Zhang, Xiaowei Fu, Suqi Song

Viaarxiv icon

ArchiGuesser -- AI Art Architecture Educational Game

Dec 14, 2023
Joern Ploennigs, Markus Berger, Eva Carnein

Viaarxiv icon

De-identification of clinical free text using natural language processing: A systematic review of current approaches

Nov 28, 2023
Aleksandar Kovačević, Bojana Bašaragin, Nikola Milošević, Goran Nenadić

Viaarxiv icon

Evolving Domain Adaptation of Pretrained Language Models for Text Classification

Nov 16, 2023
Yun-Shiuan Chuang, Yi Wu, Dhruv Gupta, Rheeya Uppaal, Ananya Kumar, Luhang Sun, Makesh Narsimhan Sreedhar, Sijia Yang, Timothy T. Rogers, Junjie Hu

Figure 1 for Evolving Domain Adaptation of Pretrained Language Models for Text Classification
Figure 2 for Evolving Domain Adaptation of Pretrained Language Models for Text Classification
Figure 3 for Evolving Domain Adaptation of Pretrained Language Models for Text Classification
Figure 4 for Evolving Domain Adaptation of Pretrained Language Models for Text Classification
Viaarxiv icon

Fine-grained Controllable Video Generation via Object Appearance and Context

Dec 05, 2023
Hsin-Ping Huang, Yu-Chuan Su, Deqing Sun, Lu Jiang, Xuhui Jia, Yukun Zhu, Ming-Hsuan Yang

Viaarxiv icon

Customize-It-3D: High-Quality 3D Creation from A Single Image Using Subject-Specific Knowledge Prior

Dec 15, 2023
Nan Huang, Ting Zhang, Yuhui Yuan, Dong Chen, Shanghang Zhang

Viaarxiv icon

Using Large Language Models to Accelerate Communication for Users with Severe Motor Impairments

Dec 03, 2023
Shanqing Cai, Subhashini Venugopalan, Katie Seaver, Xiang Xiao, Katrin Tomanek, Sri Jalasutram, Meredith Ringel Morris, Shaun Kane, Ajit Narayanan, Robert L. MacDonald, Emily Kornman, Daniel Vance, Blair Casey, Steve M. Gleason, Philip Q. Nelson, Michael P. Brenner

Viaarxiv icon

Latent Space Editing in Transformer-Based Flow Matching

Dec 17, 2023
Vincent Tao Hu, David W Zhang, Pascal Mettes, Meng Tang, Deli Zhao, Cees G. M. Snoek

Viaarxiv icon