Alert button

"Image": models, code, and papers
Alert button

MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers

Dec 19, 2023
Haoyu Ma, Shahin Mahdizadehaghdam, Bichen Wu, Zhipeng Fan, Yuchao Gu, Wenliang Zhao, Lior Shapira, Xiaohui Xie

Viaarxiv icon

Optimization of Image Processing Algorithms for Character Recognition in Cultural Typewritten Documents

Nov 27, 2023
Mariana Dias, Carla Teixeira Lopes

Viaarxiv icon

Clean Label Disentangling for Medical Image Segmentation with Noisy Labels

Nov 28, 2023
Zicheng Wang, Zhen Zhao, Erjian Guo, Luping Zhou

Viaarxiv icon

A Multimodal Approach for Advanced Pest Detection and Classification

Dec 18, 2023
Jinli Duan, Haoyu Ding, Sung Kim

Viaarxiv icon

Self-Discovering Interpretable Diffusion Latent Directions for Responsible Text-to-Image Generation

Nov 28, 2023
Hang Li, Chengzhi Shen, Philip Torr, Volker Tresp, Jindong Gu

Viaarxiv icon

Image segmentation with traveling waves in an exactly solvable recurrent neural network

Nov 28, 2023
Luisa H. B. Liboni, Roberto C. Budzinski, Alexandra N. Busch, Sindy Löwe, Thomas A. Keller, Max Welling, Lyle E. Muller

Viaarxiv icon

DiffCAD: Weakly-Supervised Probabilistic CAD Model Retrieval and Alignment from an RGB Image

Nov 30, 2023
Daoyi Gao, Dávid Rozenberszki, Stefan Leutenegger, Angela Dai

Viaarxiv icon

Generative AI and the History of Architecture

Dec 22, 2023
Joern Ploennigs, Markus Berger

Viaarxiv icon

The Rate-Distortion-Perception-Classification Tradeoff: Joint Source Coding and Modulation via Inverse-Domain GANs

Dec 22, 2023
Junli Fang, João F. C. Mota, Baoshan Lu, Weicheng Zhang, Xuemin Hong

Viaarxiv icon

ECAMP: Entity-centered Context-aware Medical Vision Language Pre-training

Dec 20, 2023
Rongsheng Wang, Qingsong Yao, Haoran Lai, Zhiyang He, Xiaodong Tao, Zihang Jiang, S. Kevin Zhou

Viaarxiv icon