Alert button

"Image": models, code, and papers
Alert button

DLoRA-TrOCR: Mixed Text Mode Optical Character Recognition Based On Transformer

Apr 19, 2024
Da Chang, Yu Li

Viaarxiv icon

VMambaMorph: a Visual Mamba-based Framework with Cross-Scan Module for Deformable 3D Image Registration

Apr 07, 2024
Ziyang Wang, Jian-Qing Zheng, Chao Ma, Tao Guo

Viaarxiv icon

NOISe: Nuclei-Aware Osteoclast Instance Segmentation for Mouse-to-Human Domain Transfer

Apr 15, 2024
Sai Kumar Reddy Manne, Brendan Martin, Tyler Roy, Ryan Neilson, Rebecca Peters, Meghana Chillara, Christine W. Lary, Katherine J. Motyl, Michael Wan

Viaarxiv icon

Fine color guidance in diffusion models and its application to image compression at extremely low bitrates

Apr 10, 2024
Tom Bordin, Thomas Maugey

Viaarxiv icon

Correcting Diffusion-Based Perceptual Image Compression with Privileged End-to-End Decoder

Apr 07, 2024
Yiyang Ma, Wenhan Yang, Jiaying Liu

Viaarxiv icon

RISE: 3D Perception Makes Real-World Robot Imitation Simple and Effective

Apr 18, 2024
Chenxi Wang, Hongjie Fang, Hao-Shu Fang, Cewu Lu

Viaarxiv icon

Dynamic Modality and View Selection for Multimodal Emotion Recognition with Missing Modalities

Apr 18, 2024
Luciana Trinkaus Menon, Luiz Carlos Ribeiro Neduziak, Jean Paul Barddal, Alessandro Lameiras Koerich, Alceu de Souza Britto Jr

Viaarxiv icon

AG-NeRF: Attention-guided Neural Radiance Fields for Multi-height Large-scale Outdoor Scene Rendering

Apr 18, 2024
Jingfeng Guo, Xiaohan Zhang, Baozhu Zhao, Qi Liu

Viaarxiv icon

A comprehensive liver CT landmark pair dataset for evaluating deformable image registration algorithms

Apr 05, 2024
Zhendong Zhang, Edward Robert Criscuolo, Yao Hao, Deshan Yang

Viaarxiv icon

Coreset Selection for Object Detection

Apr 14, 2024
Hojun Lee, Suyoung Kim, Junhoo Lee, Jaeyoung Yoo, Nojun Kwak

Viaarxiv icon