Alert button

"Image": models, code, and papers
Alert button

TagCLIP: A Local-to-Global Framework to Enhance Open-Vocabulary Multi-Label Classification of CLIP Without Training

Add code
Bookmark button
Alert button
Dec 20, 2023
Yuqi Lin, Minghao Chen, Kaipeng Zhang, Hengjia Li, Mingming Li, Zheng Yang, Dongqin Lv, Binbin Lin, Haifeng Liu, Deng Cai

Viaarxiv icon

MetaSegNet: Metadata-collaborative Vision-Language Representation Learning for Semantic Segmentation of Remote Sensing Images

Dec 20, 2023
Libo Wang, Sijun Dong, Ying Chen, Xiaoliang Meng, Shenghui Fang

Viaarxiv icon

Unveiling Objects with SOLA: An Annotation-Free Image Search on the Object Level for Automotive Data Sets

Dec 04, 2023
Philipp Rigoll, Jacob Langner, Eric Sax

Viaarxiv icon

MVPatch: More Vivid Patch for Adversarial Camouflaged Attacks on Object Detectors in the Physical World

Dec 29, 2023
Zheng Zhou, Hongbo Zhao, Ju Liu, Qiaosheng Zhang, Guangbiao Wang, Chunlei Wang, Wenquan Feng

Viaarxiv icon

VCoder: Versatile Vision Encoders for Multimodal Large Language Models

Add code
Bookmark button
Alert button
Dec 21, 2023
Jitesh Jain, Jianwei Yang, Humphrey Shi

Viaarxiv icon

Parrot Captions Teach CLIP to Spot Text

Add code
Bookmark button
Alert button
Dec 21, 2023
Yiqi Lin, Conghui He, Alex Jinpeng Wang, Bin Wang, Weijia Li, Mike Zheng Shou

Viaarxiv icon

Generalizable Visual Reinforcement Learning with Segment Anything Model

Dec 28, 2023
Ziyu Wang, Yanjie Ze, Yifei Sun, Zhecheng Yuan, Huazhe Xu

Viaarxiv icon

Compression of end-to-end non-autoregressive image-to-speech system for low-resourced devices

Nov 30, 2023
Gokul Srinivasagan, Michael Deisher, Munir Georges

Viaarxiv icon

Unveiling Backbone Effects in CLIP: Exploring Representational Synergies and Variances

Dec 22, 2023
Cristian Rodriguez-Opazo, Edison Marrese-Taylor, Ehsan Abbasnejad, Hamed Damirchi, Ignacio M. Jara, Felipe Bravo-Marquez, Anton van den Hengel

Viaarxiv icon

Decouple Content and Motion for Conditional Image-to-Video Generation

Nov 24, 2023
Cuifeng Shen, Yulu Gan, Chen Chen, Xiongwei Zhu, Lele Cheng, Jinzhi Wang

Viaarxiv icon