Alert button

"Image": models, code, and papers
Alert button

Optical Text Recognition in Nepali and Bengali: A Transformer-based Approach

Apr 03, 2024
S M Rakib Hasan, Aakar Dhakal, Md Humaion Kabir Mehedi, Annajiat Alim Rasel

Viaarxiv icon

BCAmirs at SemEval-2024 Task 4: Beyond Words: A Multimodal and Multilingual Exploration of Persuasion in Memes

Apr 03, 2024
Amirhossein Abaskohi, Amirhossein Dabiriaghdam, Lele Wang, Giuseppe Carenini

Viaarxiv icon

MatAtlas: Text-driven Consistent Geometry Texturing and Material Assignment

Apr 03, 2024
Duygu Ceylan, Valentin Deschaintre, Thibault Groueix, Rosalie Martin, Chun-Hao Huang, Romain Rouffet, Vladimir Kim, Gaëtan Lassagne

Viaarxiv icon

Model-agnostic Origin Attribution of Generated Images with Few-shot Examples

Apr 03, 2024
Fengyuan Liu, Haochen Luo, Yiming Li, Philip Torr, Jindong Gu

Viaarxiv icon

A Recommender System for NFT Collectibles with Item Feature

Apr 03, 2024
Minjoo Choi, Seonmi Kim, Yejin Kim, Youngbin Lee, Joohwan Hong, Yongjae Lee

Viaarxiv icon

The Devil is in the Edges: Monocular Depth Estimation with Edge-aware Consistency Fusion

Add code
Bookmark button
Alert button
Mar 30, 2024
Pengzhi Li, Yikang Ding, Haohan Wang, Chengshuai Tang, Zhiheng Li

Viaarxiv icon

Your Image is My Video: Reshaping the Receptive Field via Image-To-Video Differentiable AutoAugmentation and Fusion

Mar 22, 2024
Sofia Casarin, Cynthia I. Ugwu, Sergio Escalera, Oswald Lanz

Viaarxiv icon

Diffusion Attack: Leveraging Stable Diffusion for Naturalistic Image Attacking

Mar 21, 2024
Qianyu Guo, Jiaming Fu, Yawen Lu, Dongming Gan

Viaarxiv icon

Roadside Monocular 3D Detection via 2D Detection Prompting

Apr 04, 2024
Yechi Ma, Shuoquan Wei, Churun Zhang, Wei Hua, Yanan Li, Shu Kong

Viaarxiv icon

Prompt Learning for Oriented Power Transmission Tower Detection in High-Resolution SAR Images

Apr 01, 2024
Tianyang Li, Chao Wang, Hong Zhang

Viaarxiv icon