Alert button

"Image": models, code, and papers
Alert button

Improving Text-to-Image Consistency via Automatic Prompt Optimization

Mar 26, 2024
Oscar Mañas, Pietro Astolfi, Melissa Hall, Candace Ross, Jack Urbanek, Adina Williams, Aishwarya Agrawal, Adriana Romero-Soriano, Michal Drozdzal

Viaarxiv icon

ECNet: Effective Controllable Text-to-Image Diffusion Models

Mar 27, 2024
Sicheng Li, Keqiang Sun, Zhixin Lai, Xiaoshi Wu, Feng Qiu, Haoran Xie, Kazunori Miyata, Hongsheng Li

Viaarxiv icon

FairRAG: Fair Human Generation via Fair Retrieval Augmentation

Apr 05, 2024
Robik Shrestha, Yang Zou, Qiuyu Chen, Zhiheng Li, Yusheng Xie, Siqi Deng

Viaarxiv icon

Quantum Circuit $C^*$-algebra Net

Apr 09, 2024
Yuka Hashimoto, Ryuichiro Hataya

Viaarxiv icon

Vision-Language Model-based Physical Reasoning for Robot Liquid Perception

Apr 10, 2024
Wenqiang Lai, Yuan Gao, Tin Lun Lam

Viaarxiv icon

An inclusive review on deep learning techniques and their scope in handwriting recognition

Apr 10, 2024
Sukhdeep Singh, Sudhir Rohilla, Anuj Sharma

Viaarxiv icon

Unsupervised Tumor-Aware Distillation for Multi-Modal Brain Image Translation

Add code
Bookmark button
Alert button
Mar 29, 2024
Chuan Huang, Jia Wei, Rui Li

Viaarxiv icon

Dynamic Deep Learning Based Super-Resolution For The Shallow Water Equations

Apr 09, 2024
Maximilian Witte, Fabricio Rodrigues Lapolli, Philip Freese, Sebastian Götschel, Daniel Ruprecht, Peter Korn, Christopher Kadow

Viaarxiv icon

GOAT-Bench: A Benchmark for Multi-Modal Lifelong Navigation

Apr 09, 2024
Mukul Khanna, Ram Ramrakhya, Gunjan Chhablani, Sriram Yenamandra, Theophile Gervet, Matthew Chang, Zsolt Kira, Devendra Singh Chaplot, Dhruv Batra, Roozbeh Mottaghi

Viaarxiv icon

NeuroPictor: Refining fMRI-to-Image Reconstruction via Multi-individual Pretraining and Multi-level Modulation

Add code
Bookmark button
Alert button
Mar 27, 2024
Jingyang Huo, Yikai Wang, Xuelin Qian, Yun Wang, Chong Li, Jianfeng Feng, Yanwei Fu

Viaarxiv icon