Alert button

"Image": models, code, and papers
Alert button

Can We Solve 3D Vision Tasks Starting from A 2D Vision Transformer?

Add code
Bookmark button
Alert button
Sep 18, 2022
Yi Wang, Zhiwen Fan, Tianlong Chen, Hehe Fan, Zhangyang Wang

Figure 1 for Can We Solve 3D Vision Tasks Starting from A 2D Vision Transformer?
Figure 2 for Can We Solve 3D Vision Tasks Starting from A 2D Vision Transformer?
Figure 3 for Can We Solve 3D Vision Tasks Starting from A 2D Vision Transformer?
Figure 4 for Can We Solve 3D Vision Tasks Starting from A 2D Vision Transformer?
Viaarxiv icon

Image preprocessing and modified adaptive thresholding for improving OCR

Nov 30, 2021
Rohan Lal Kshetry

Figure 1 for Image preprocessing and modified adaptive thresholding for improving OCR
Figure 2 for Image preprocessing and modified adaptive thresholding for improving OCR
Figure 3 for Image preprocessing and modified adaptive thresholding for improving OCR
Figure 4 for Image preprocessing and modified adaptive thresholding for improving OCR
Viaarxiv icon

Recognition-Aware Learned Image Compression

Feb 01, 2022
Maxime Kawawa-Beaudan, Ryan Roggenkemper, Avideh Zakhor

Figure 1 for Recognition-Aware Learned Image Compression
Figure 2 for Recognition-Aware Learned Image Compression
Figure 3 for Recognition-Aware Learned Image Compression
Figure 4 for Recognition-Aware Learned Image Compression
Viaarxiv icon

Class-Aware Visual Prompt Tuning for Vision-Language Pre-Trained Model

Add code
Bookmark button
Alert button
Aug 22, 2022
Yinghui Xing, Qirui Wu, De Cheng, Shizhou Zhang, Guoqiang Liang, Yanning Zhang

Figure 1 for Class-Aware Visual Prompt Tuning for Vision-Language Pre-Trained Model
Figure 2 for Class-Aware Visual Prompt Tuning for Vision-Language Pre-Trained Model
Figure 3 for Class-Aware Visual Prompt Tuning for Vision-Language Pre-Trained Model
Figure 4 for Class-Aware Visual Prompt Tuning for Vision-Language Pre-Trained Model
Viaarxiv icon

Generative Category-Level Shape and Pose Estimation with Semantic Primitives

Add code
Bookmark button
Alert button
Oct 03, 2022
Guanglin Li, Yifeng Li, Zhichao Ye, Qihang Zhang, Tao Kong, Zhaopeng Cui, Guofeng Zhang

Figure 1 for Generative Category-Level Shape and Pose Estimation with Semantic Primitives
Figure 2 for Generative Category-Level Shape and Pose Estimation with Semantic Primitives
Figure 3 for Generative Category-Level Shape and Pose Estimation with Semantic Primitives
Figure 4 for Generative Category-Level Shape and Pose Estimation with Semantic Primitives
Viaarxiv icon

EraseNet: A Recurrent Residual Network for Supervised Document Cleaning

Oct 03, 2022
Yashowardhan Shinde, Kishore Kulkarni

Figure 1 for EraseNet: A Recurrent Residual Network for Supervised Document Cleaning
Figure 2 for EraseNet: A Recurrent Residual Network for Supervised Document Cleaning
Figure 3 for EraseNet: A Recurrent Residual Network for Supervised Document Cleaning
Figure 4 for EraseNet: A Recurrent Residual Network for Supervised Document Cleaning
Viaarxiv icon

Mastering Spatial Graph Prediction of Road Networks

Oct 03, 2022
Sotiris Anagnostidis, Aurelien Lucchi, Thomas Hofmann

Figure 1 for Mastering Spatial Graph Prediction of Road Networks
Figure 2 for Mastering Spatial Graph Prediction of Road Networks
Figure 3 for Mastering Spatial Graph Prediction of Road Networks
Figure 4 for Mastering Spatial Graph Prediction of Road Networks
Viaarxiv icon

Privacy-Preserving Feature Coding for Machines

Add code
Bookmark button
Alert button
Oct 03, 2022
Bardia Azizian, Ivan V. Bajić

Figure 1 for Privacy-Preserving Feature Coding for Machines
Figure 2 for Privacy-Preserving Feature Coding for Machines
Figure 3 for Privacy-Preserving Feature Coding for Machines
Figure 4 for Privacy-Preserving Feature Coding for Machines
Viaarxiv icon

Can you recommend content to creatives instead of final consumers? A RecSys based on user's preferred visual styles

Aug 23, 2022
Raul Gomez Bruballa, Lauren Burnham-King, Alessandra Sala

Figure 1 for Can you recommend content to creatives instead of final consumers? A RecSys based on user's preferred visual styles
Figure 2 for Can you recommend content to creatives instead of final consumers? A RecSys based on user's preferred visual styles
Figure 3 for Can you recommend content to creatives instead of final consumers? A RecSys based on user's preferred visual styles
Figure 4 for Can you recommend content to creatives instead of final consumers? A RecSys based on user's preferred visual styles
Viaarxiv icon

LSC-GAN: Latent Style Code Modeling for Continuous Image-to-image Translation

Add code
Bookmark button
Alert button
Oct 11, 2021
Qiusheng Huang, Xueqi Hu, Li Sun, Qingli Li

Figure 1 for LSC-GAN: Latent Style Code Modeling for Continuous Image-to-image Translation
Figure 2 for LSC-GAN: Latent Style Code Modeling for Continuous Image-to-image Translation
Figure 3 for LSC-GAN: Latent Style Code Modeling for Continuous Image-to-image Translation
Figure 4 for LSC-GAN: Latent Style Code Modeling for Continuous Image-to-image Translation
Viaarxiv icon