Alert button

"Image": models, code, and papers
Alert button

SalFoM: Dynamic Saliency Prediction with Video Foundation Models

Apr 03, 2024
Morteza Moradi, Mohammad Moradi, Francesco Rundo, Concetto Spampinato, Ali Borji, Simone Palazzo

Viaarxiv icon

Latent Neural Cellular Automata for Resource-Efficient Image Restoration

Mar 22, 2024
Andrea Menta, Alberto Archetti, Matteo Matteucci

Viaarxiv icon

VideoDistill: Language-aware Vision Distillation for Video Question Answering

Apr 01, 2024
Bo Zou, Chao Yang, Yu Qiao, Chengbin Quan, Youjian Zhao

Viaarxiv icon

FE-DeTr: Keypoint Detection and Tracking in Low-quality Image Frames with Events

Add code
Bookmark button
Alert button
Mar 18, 2024
Xiangyuan Wang, Kuangyi Chen, Wen Yang, Lei Yu, Yannan Xing, Huai Yu

Figure 1 for FE-DeTr: Keypoint Detection and Tracking in Low-quality Image Frames with Events
Figure 2 for FE-DeTr: Keypoint Detection and Tracking in Low-quality Image Frames with Events
Figure 3 for FE-DeTr: Keypoint Detection and Tracking in Low-quality Image Frames with Events
Figure 4 for FE-DeTr: Keypoint Detection and Tracking in Low-quality Image Frames with Events
Viaarxiv icon

3DStyleGLIP: Part-Tailored Text-Guided 3D Neural Stylization

Apr 03, 2024
SeungJeh Chung, JooHyun Park, Hyewon Kan, HyeongYeop Kang

Viaarxiv icon

Neural Radiance Fields with Torch Units

Apr 03, 2024
Bingnan Ni, Huanyu Wang, Dongfeng Bai, Minghe Weng, Dexin Qi, Weichao Qiu, Bingbing Liu

Viaarxiv icon

Data-Efficient Contrastive Language-Image Pretraining: Prioritizing Data Quality over Quantity

Add code
Bookmark button
Alert button
Mar 20, 2024
Siddharth Joshi, Arnav Jain, Ali Payani, Baharan Mirzasoleiman

Figure 1 for Data-Efficient Contrastive Language-Image Pretraining: Prioritizing Data Quality over Quantity
Figure 2 for Data-Efficient Contrastive Language-Image Pretraining: Prioritizing Data Quality over Quantity
Figure 3 for Data-Efficient Contrastive Language-Image Pretraining: Prioritizing Data Quality over Quantity
Figure 4 for Data-Efficient Contrastive Language-Image Pretraining: Prioritizing Data Quality over Quantity
Viaarxiv icon

mChartQA: A universal benchmark for multimodal Chart Question Answer based on Vision-Language Alignment and Reasoning

Apr 02, 2024
Jingxuan Wei, Nan Xu, Guiyong Chang, Yin Luo, BiHui Yu, Ruifeng Guo

Viaarxiv icon

Low-Trace Adaptation of Zero-shot Self-supervised Blind Image Denoising

Mar 19, 2024
Jintong Hu, Bin Xia, Bingchen Li, Wenming Yang

Figure 1 for Low-Trace Adaptation of Zero-shot Self-supervised Blind Image Denoising
Figure 2 for Low-Trace Adaptation of Zero-shot Self-supervised Blind Image Denoising
Figure 3 for Low-Trace Adaptation of Zero-shot Self-supervised Blind Image Denoising
Figure 4 for Low-Trace Adaptation of Zero-shot Self-supervised Blind Image Denoising
Viaarxiv icon

VLM-CPL: Consensus Pseudo Labels from Vision-Language Models for Human Annotation-Free Pathological Image Classification

Add code
Bookmark button
Alert button
Mar 23, 2024
Lanfeng Zhong, Xin Liao, Shaoting Zhang, Xiaofan Zhang, Guotai Wang

Viaarxiv icon