Alert button

"Image": models, code, and papers
Alert button

A proposed new metric for the conceptual diversity of a text

Dec 27, 2023
İlknur Dönmez Phd, Mehmet Haklıdır Phd

Viaarxiv icon

Learning to Embed Time Series Patches Independently

Dec 27, 2023
Seunghan Lee, Taeyoung Park, Kibok Lee

Viaarxiv icon

Variational Bayes image restoration with compressive autoencoders

Nov 29, 2023
Maud Biquard, Marie Chabert, Thomas Oberlin

Viaarxiv icon

Unveiling Backbone Effects in CLIP: Exploring Representational Synergies and Variances

Dec 22, 2023
Cristian Rodriguez-Opazo, Edison Marrese-Taylor, Ehsan Abbasnejad, Hamed Damirchi, Ignacio M. Jara, Felipe Bravo-Marquez, Anton van den Hengel

Viaarxiv icon

MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training

Nov 28, 2023
Pavan Kumar Anasosalu Vasu, Hadi Pouransari, Fartash Faghri, Raviteja Vemulapalli, Oncel Tuzel

Viaarxiv icon

VCoder: Versatile Vision Encoders for Multimodal Large Language Models

Dec 21, 2023
Jitesh Jain, Jianwei Yang, Humphrey Shi

Viaarxiv icon

Parrot Captions Teach CLIP to Spot Text

Dec 21, 2023
Yiqi Lin, Conghui He, Alex Jinpeng Wang, Bin Wang, Weijia Li, Mike Zheng Shou

Viaarxiv icon

DiffiT: Diffusion Vision Transformers for Image Generation

Dec 04, 2023
Ali Hatamizadeh, Jiaming Song, Guilin Liu, Jan Kautz, Arash Vahdat

Viaarxiv icon

Soulstyler: Using Large Language Model to Guide Image Style Transfer for Target Object

Nov 29, 2023
Junhao Chen, Peng Rong, Jingbo Sun, Chao Li, Xiang Li, Hongwu Lv

Figure 1 for Soulstyler: Using Large Language Model to Guide Image Style Transfer for Target Object
Figure 2 for Soulstyler: Using Large Language Model to Guide Image Style Transfer for Target Object
Figure 3 for Soulstyler: Using Large Language Model to Guide Image Style Transfer for Target Object
Figure 4 for Soulstyler: Using Large Language Model to Guide Image Style Transfer for Target Object
Viaarxiv icon

FerKD: Surgical Label Adaptation for Efficient Distillation

Dec 29, 2023
Zhiqiang Shen

Viaarxiv icon