Alert button

"Text": models, code, and papers
Alert button

Customizing Motion in Text-to-Video Diffusion Models

Dec 07, 2023
Joanna Materzynska, Josef Sivic, Eli Shechtman, Antonio Torralba, Richard Zhang, Bryan Russell

Viaarxiv icon

Bridging Synthetic and Real Worlds for Pre-training Scene Text Detectors

Dec 08, 2023
Tongkun Guan, Wei Shen, Xue Yang, Xuehui Wang, Xiaokang Yang

Viaarxiv icon

A Vision Check-up for Language Models

Jan 03, 2024
Pratyusha Sharma, Tamar Rott Shaham, Manel Baradad, Stephanie Fu, Adrian Rodriguez-Munoz, Shivam Duggal, Phillip Isola, Antonio Torralba

Viaarxiv icon

Physio: An LLM-Based Physiotherapy Advisor

Jan 03, 2024
Rúben Almeida, Hugo Sousa, Luís F. Cunha, Nuno Guimarães, Ricardo Campos, Alípio Jorge

Viaarxiv icon

Autocompletion of Chief Complaints in the Electronic Health Records using Large Language Models

Jan 11, 2024
K M Sajjadul Islam, Ayesha Siddika Nipu, Praveen Madiraju, Priya Deshpande

Viaarxiv icon

GE-AdvGAN: Improving the transferability of adversarial samples by gradient editing-based adversarial generative model

Jan 11, 2024
Zhiyu Zhu, Huaming Chen, Xinyi Wang, Jiayu Zhang, Zhibo Jin, Kim-Kwang Raymond Choo

Viaarxiv icon

Cross-Modal Adaptive Dual Association for Text-to-Image Person Retrieval

Dec 04, 2023
Dixuan Lin, Yixing Peng, Jingke Meng, Wei-Shi Zheng

Viaarxiv icon

UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models

Dec 08, 2023
Yiming Zhao, Zhouhui Lian

Viaarxiv icon

Brain-Conditional Multimodal Synthesis: A Survey and Taxonomy

Jan 03, 2024
Weijian Mai, Jian Zhang, Pengfei Fang, Zhijun Zhang

Viaarxiv icon

Morphable Diffusion: 3D-Consistent Diffusion for Single-image Avatar Creation

Jan 09, 2024
Xiyi Chen, Marko Mihajlovic, Shaofei Wang, Sergey Prokudin, Siyu Tang

Viaarxiv icon