Alert button

"Image": models, code, and papers
Alert button

SyCoCa: Symmetrizing Contrastive Captioners with Attentive Masking for Multimodal Alignment

Jan 04, 2024
Ziping Ma, Furong Xu, Jian Liu, Ming Yang, Qingpei Guo

Viaarxiv icon

Spatial-Semantic Collaborative Cropping for User Generated Content

Add code
Bookmark button
Alert button
Jan 16, 2024
Yukun Su, Yiwen Cao, Jingliang Deng, Fengyun Rao, Qingyao Wu

Viaarxiv icon

Robust Sclera Segmentation for Skin-tone Agnostic Face Image Quality Assessment

Dec 22, 2023
Wassim Kabbani, Christoph Busch, Kiran Raja

Viaarxiv icon

DarkShot: Lighting Dark Images with Low-Compute and High-Quality

Jan 10, 2024
Jiazhang Zheng, Lei Li, Qiuping Liao, Cheng Li, Li Li, Yangxing Liu

Viaarxiv icon

Automatic 3D Multi-modal Ultrasound Segmentation of Human Placenta using Fusion Strategies and Deep Learning

Jan 17, 2024
Sonit Singh, Gordon Stevenson, Brendan Mein, Alec Welsh, Arcot Sowmya

Viaarxiv icon

A gradient-based approach to fast and accurate head motion compensation in cone-beam CT

Jan 17, 2024
Mareike Thies, Fabian Wagner, Noah Maul, Haijun Yu, Manuela Meier, Linda-Sophie Schneider, Mingxuan Gu, Siyuan Mei, Lukas Folle, Andreas Maier

Viaarxiv icon

VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering

Dec 19, 2023
Chun-Mei Feng, Yang Bai, Tao Luo, Zhen Li, Salman Khan, Wangmeng Zuo, Xinxing Xu, Rick Siow Mong Goh, Yong Liu

Viaarxiv icon

One for All: Toward Unified Foundation Models for Earth Vision

Jan 15, 2024
Zhitong Xiong, Yi Wang, Fahong Zhang, Xiao Xiang Zhu

Viaarxiv icon

BA-SAM: Scalable Bias-Mode Attention Mask for Segment Anything Model

Jan 08, 2024
Yiran Song, Qianyu Zhou, Xiangtai Li, Deng-Ping Fan, Xuequan Lu, Lizhuang Ma

Viaarxiv icon

Make Prompts Adaptable: Bayesian Modeling for Vision-Language Prompt Learning with Data-Dependent Prior

Add code
Bookmark button
Alert button
Jan 09, 2024
Youngjae Cho, HeeSun Bae, Seungjae Shin, Yeo Dong Youn, Weonyoung Joo, Il-Chul Moon

Viaarxiv icon