Alert button

"Image": models, code, and papers
Alert button

Advancing Generative Model Evaluation: A Novel Algorithm for Realistic Image Synthesis and Comparison in OCR System

Mar 01, 2024
Majid Memari, Khaled R. Ahmed, Shahram Rahimi, Noorbakhsh Amiri Golilarz

Viaarxiv icon

Variable-Rate Learned Image Compression with Multi-Objective Optimization and Quantization-Reconstruction Offsets

Feb 29, 2024
Fatih Kamisli, Fabien Racape, Hyomin Choi

Viaarxiv icon

Low-dose CT Denoising with Language-engaged Dual-space Alignment

Mar 10, 2024
Zhihao Chen, Tao Chen, Chenhui Wang, Chuang Niu, Ge Wang, Hongming Shan

Viaarxiv icon

ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting

Mar 01, 2024
Chen Duan, Pei Fu, Shan Guo, Qianyi Jiang, Xiaoming Wei

Figure 1 for ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting
Figure 2 for ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting
Figure 3 for ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting
Figure 4 for ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting
Viaarxiv icon

StyleGaussian: Instant 3D Style Transfer with Gaussian Splatting

Mar 12, 2024
Kunhao Liu, Fangneng Zhan, Muyu Xu, Christian Theobalt, Ling Shao, Shijian Lu

Viaarxiv icon

SSM Meets Video Diffusion Models: Efficient Video Generation with Structured State Spaces

Mar 12, 2024
Yuta Oshima, Shohei Taniguchi, Masahiro Suzuki, Yutaka Matsuo

Viaarxiv icon

Denoising Autoregressive Representation Learning

Mar 08, 2024
Yazhe Li, Jorg Bornschein, Ting Chen

Viaarxiv icon

Training Generative Image Super-Resolution Models by Wavelet-Domain Losses Enables Better Control of Artifacts

Feb 29, 2024
Cansu Korkmaz, A. Murat Tekalp, Zafer Dogan

Viaarxiv icon

Solving the bongard-logo problem by modeling a probabilistic model

Mar 09, 2024
Ruizhuo Song, Beiming Yuan

Figure 1 for Solving the bongard-logo problem by modeling a probabilistic model
Figure 2 for Solving the bongard-logo problem by modeling a probabilistic model
Figure 3 for Solving the bongard-logo problem by modeling a probabilistic model
Figure 4 for Solving the bongard-logo problem by modeling a probabilistic model
Viaarxiv icon

Multiple Instance Learning with random sampling for Whole Slide Image Classification

Mar 08, 2024
H. Keshvarikhojasteh, J. P. W. Pluim, M. Veta

Figure 1 for Multiple Instance Learning with random sampling for Whole Slide Image Classification
Figure 2 for Multiple Instance Learning with random sampling for Whole Slide Image Classification
Figure 3 for Multiple Instance Learning with random sampling for Whole Slide Image Classification
Figure 4 for Multiple Instance Learning with random sampling for Whole Slide Image Classification
Viaarxiv icon