Alert button

"Image": models, code, and papers
Alert button

CL2CM: Improving Cross-Lingual Cross-Modal Retrieval via Cross-Lingual Knowledge Transfer

Dec 14, 2023
Yabing Wang, Fan Wang, Jianfeng Dong, Hao Luo

Viaarxiv icon

HeadRecon: High-Fidelity 3D Head Reconstruction from Monocular Video

Dec 14, 2023
Xueying Wang, Juyong Zhang

Viaarxiv icon

GOEnFusion: Gradient Origin Encodings for 3D Forward Diffusion Models

Add code
Bookmark button
Alert button
Dec 14, 2023
Animesh Karnewar, Andrea Vedaldi, Niloy J. Mitra, David Novotny

Figure 1 for GOEnFusion: Gradient Origin Encodings for 3D Forward Diffusion Models
Figure 2 for GOEnFusion: Gradient Origin Encodings for 3D Forward Diffusion Models
Figure 3 for GOEnFusion: Gradient Origin Encodings for 3D Forward Diffusion Models
Figure 4 for GOEnFusion: Gradient Origin Encodings for 3D Forward Diffusion Models
Viaarxiv icon

ResEnsemble-DDPM: Residual Denoising Diffusion Probabilistic Models for Ensemble Learning

Dec 04, 2023
Shi Zhenning, Dong Changsheng, Xie Xueshuo, Pan Bin, He Along, Li Tao

Viaarxiv icon

Implicit Learning of Scene Geometry from Poses for Global Localization

Dec 04, 2023
Mohammad Altillawi, Shile Li, Sai Manoj Prakhya, Ziyuan Liu, Joan Serrat

Viaarxiv icon

VCISR: Blind Single Image Super-Resolution with Video Compression Synthetic Data

Add code
Bookmark button
Alert button
Nov 02, 2023
Boyang Wang, Bowen Liu, Shiyu Liu, Fengyu Yang

Figure 1 for VCISR: Blind Single Image Super-Resolution with Video Compression Synthetic Data
Figure 2 for VCISR: Blind Single Image Super-Resolution with Video Compression Synthetic Data
Figure 3 for VCISR: Blind Single Image Super-Resolution with Video Compression Synthetic Data
Figure 4 for VCISR: Blind Single Image Super-Resolution with Video Compression Synthetic Data
Viaarxiv icon

Filling the Image Information Gap for VQA: Prompting Large Language Models to Proactively Ask Questions

Nov 20, 2023
Ziyue Wang, Chi Chen, Peng Li, Yang Liu

Figure 1 for Filling the Image Information Gap for VQA: Prompting Large Language Models to Proactively Ask Questions
Figure 2 for Filling the Image Information Gap for VQA: Prompting Large Language Models to Proactively Ask Questions
Figure 3 for Filling the Image Information Gap for VQA: Prompting Large Language Models to Proactively Ask Questions
Figure 4 for Filling the Image Information Gap for VQA: Prompting Large Language Models to Proactively Ask Questions
Viaarxiv icon

Deep learning as a tool for quantum error reduction in quantum image processing

Nov 08, 2023
Krzysztof Werner, Kamil Wereszczyński, Rafał Potempa, Krzysztof Cyran

Figure 1 for Deep learning as a tool for quantum error reduction in quantum image processing
Figure 2 for Deep learning as a tool for quantum error reduction in quantum image processing
Figure 3 for Deep learning as a tool for quantum error reduction in quantum image processing
Figure 4 for Deep learning as a tool for quantum error reduction in quantum image processing
Viaarxiv icon

MixerFlow for Image Modelling

Oct 25, 2023
Eshant English, Matthias Kirchler, Christoph Lippert

Viaarxiv icon

A Simple Interpretable Transformer for Fine-Grained Image Classification and Analysis

Add code
Bookmark button
Alert button
Nov 07, 2023
Dipanjyoti Paul, Arpita Chowdhury, Xinqi Xiong, Feng-Ju Chang, David Carlyn, Samuel Stevens, Kaiya Provost, Anuj Karpatne, Bryan Carstens, Daniel Rubenstein, Charles Stewart, Tanya Berger-Wolf, Yu Su, Wei-Lun Chao

Viaarxiv icon