Alert button

"Image": models, code, and papers
Alert button

Fast Registration of Photorealistic Avatars for VR Facial Animation

Add code
Bookmark button
Alert button
Jan 19, 2024
Chaitanya Patel, Shaojie Bai, Te-Li Wang, Jason Saragih, Shih-En Wei

Viaarxiv icon

Semantic Draw Engineering for Text-to-Image Creation

Dec 23, 2023
Yang Li, Huaqiang Jiang, Yangkai Wu

Viaarxiv icon

Transcending the Limit of Local Window: Advanced Super-Resolution Transformer with Adaptive Token Dictionary

Jan 16, 2024
Leheng Zhang, Yawei Li, Xingyu Zhou, Xiaorui Zhao, Shuhang Gu

Viaarxiv icon

MISS: A Generative Pretraining and Finetuning Approach for Med-VQA

Jan 18, 2024
Jiawei Chen, Dingkang Yang, Yue Jiang, Yuxuan Lei, Lihua Zhang

Viaarxiv icon

EEND-M2F: Masked-attention mask transformers for speaker diarization

Jan 23, 2024
Marc Härkönen, Samuel J. Broughton, Lahiru Samarakoon

Viaarxiv icon

Source-Free and Image-Only Unsupervised Domain Adaptation for Category Level Object Pose Estimation

Jan 19, 2024
Prakhar Kaushik, Aayush Mishra, Adam Kortylewski, Alan Yuille

Viaarxiv icon

CLIP Model for Images to Textual Prompts Based on Top-k Neighbors

Jan 18, 2024
Xin Zhang, Xin Zhang, YeMing Cai, Tianzhi Jia

Viaarxiv icon

Energy-Based Concept Bottleneck Models: Unifying Prediction, Concept Intervention, and Conditional Interpretations

Jan 25, 2024
Xinyue Xu, Yi Qin, Lu Mi, Hao Wang, Xiaomeng Li

Viaarxiv icon

Diffusion Enhancement for Cloud Removal in Ultra-Resolution Remote Sensing Imagery

Jan 25, 2024
Jialu Sui, Yiyang Ma, Wenhan Yang, Xiaokang Zhang, Man-On Pun, Jiaying Liu

Viaarxiv icon

Seeing the Unseen: Visual Common Sense for Semantic Placement

Jan 15, 2024
Ram Ramrakhya, Aniruddha Kembhavi, Dhruv Batra, Zsolt Kira, Kuo-Hao Zeng, Luca Weihs

Viaarxiv icon