Alert button

"Image": models, code, and papers
Alert button

Bootstrapping Objectness from Videos by Relaxed Common Fate and Visual Grouping

Apr 17, 2023
Long Lian, Zhirong Wu, Stella X. Yu

Figure 1 for Bootstrapping Objectness from Videos by Relaxed Common Fate and Visual Grouping
Figure 2 for Bootstrapping Objectness from Videos by Relaxed Common Fate and Visual Grouping
Figure 3 for Bootstrapping Objectness from Videos by Relaxed Common Fate and Visual Grouping
Figure 4 for Bootstrapping Objectness from Videos by Relaxed Common Fate and Visual Grouping
Viaarxiv icon

LinkGAN: Linking GAN Latents to Pixels for Controllable Image Synthesis

Add code
Bookmark button
Alert button
Jan 11, 2023
Jiapeng Zhu, Ceyuan Yang, Yujun Shen, Zifan Shi, Deli Zhao, Qifeng Chen

Figure 1 for LinkGAN: Linking GAN Latents to Pixels for Controllable Image Synthesis
Figure 2 for LinkGAN: Linking GAN Latents to Pixels for Controllable Image Synthesis
Figure 3 for LinkGAN: Linking GAN Latents to Pixels for Controllable Image Synthesis
Figure 4 for LinkGAN: Linking GAN Latents to Pixels for Controllable Image Synthesis
Viaarxiv icon

Deep Visual-Genetic Biometrics for Taxonomic Classification of Rare Species

May 20, 2023
Tayfun Karaderi, Tilo Burghardt, Raphael Morard, Daniela Schmidt

Figure 1 for Deep Visual-Genetic Biometrics for Taxonomic Classification of Rare Species
Figure 2 for Deep Visual-Genetic Biometrics for Taxonomic Classification of Rare Species
Figure 3 for Deep Visual-Genetic Biometrics for Taxonomic Classification of Rare Species
Figure 4 for Deep Visual-Genetic Biometrics for Taxonomic Classification of Rare Species
Viaarxiv icon

AVFace: Towards Detailed Audio-Visual 4D Face Reconstruction

Add code
Bookmark button
Alert button
May 11, 2023
Aggelina Chatziagapi, Dimitris Samaras

Figure 1 for AVFace: Towards Detailed Audio-Visual 4D Face Reconstruction
Figure 2 for AVFace: Towards Detailed Audio-Visual 4D Face Reconstruction
Figure 3 for AVFace: Towards Detailed Audio-Visual 4D Face Reconstruction
Figure 4 for AVFace: Towards Detailed Audio-Visual 4D Face Reconstruction
Viaarxiv icon

Invariant Scattering Transform for Medical Imaging

Apr 20, 2023
Md Manjurul Ahsan, Shivakumar Raman, Zahed Siddique

Figure 1 for Invariant Scattering Transform for Medical Imaging
Figure 2 for Invariant Scattering Transform for Medical Imaging
Figure 3 for Invariant Scattering Transform for Medical Imaging
Figure 4 for Invariant Scattering Transform for Medical Imaging
Viaarxiv icon

CLUSTSEG: Clustering for Universal Segmentation

Add code
Bookmark button
Alert button
May 03, 2023
James Liang, Tianfei Zhou, Dongfang Liu, Wenguan Wang

Figure 1 for CLUSTSEG: Clustering for Universal Segmentation
Figure 2 for CLUSTSEG: Clustering for Universal Segmentation
Figure 3 for CLUSTSEG: Clustering for Universal Segmentation
Figure 4 for CLUSTSEG: Clustering for Universal Segmentation
Viaarxiv icon

Using Spatio-Temporal Dual-Stream Network with Self-Supervised Learning for Lung Tumor Classification on Radial Probe Endobronchial Ultrasound Video

May 04, 2023
Ching-Kai Lin, Chin-Wen Chen, Yun-Chien Cheng

Figure 1 for Using Spatio-Temporal Dual-Stream Network with Self-Supervised Learning for Lung Tumor Classification on Radial Probe Endobronchial Ultrasound Video
Figure 2 for Using Spatio-Temporal Dual-Stream Network with Self-Supervised Learning for Lung Tumor Classification on Radial Probe Endobronchial Ultrasound Video
Figure 3 for Using Spatio-Temporal Dual-Stream Network with Self-Supervised Learning for Lung Tumor Classification on Radial Probe Endobronchial Ultrasound Video
Figure 4 for Using Spatio-Temporal Dual-Stream Network with Self-Supervised Learning for Lung Tumor Classification on Radial Probe Endobronchial Ultrasound Video
Viaarxiv icon

PIP: Positional-encoding Image Prior

Add code
Bookmark button
Alert button
Nov 25, 2022
Nimrod Shabtay, Eli Schwartz, Raja Giryes

Figure 1 for PIP: Positional-encoding Image Prior
Figure 2 for PIP: Positional-encoding Image Prior
Figure 3 for PIP: Positional-encoding Image Prior
Figure 4 for PIP: Positional-encoding Image Prior
Viaarxiv icon

Accuracy Improvement of Object Detection in VVC Coded Video Using YOLO-v7 Features

Apr 03, 2023
Takahiro Shindo, Taiju Watanabe, Kein Yamada, Hiroshi Watanabe

Figure 1 for Accuracy Improvement of Object Detection in VVC Coded Video Using YOLO-v7 Features
Figure 2 for Accuracy Improvement of Object Detection in VVC Coded Video Using YOLO-v7 Features
Figure 3 for Accuracy Improvement of Object Detection in VVC Coded Video Using YOLO-v7 Features
Figure 4 for Accuracy Improvement of Object Detection in VVC Coded Video Using YOLO-v7 Features
Viaarxiv icon

What does CLIP know about a red circle? Visual prompt engineering for VLMs

Apr 13, 2023
Aleksandar Shtedritski, Christian Rupprecht, Andrea Vedaldi

Figure 1 for What does CLIP know about a red circle? Visual prompt engineering for VLMs
Figure 2 for What does CLIP know about a red circle? Visual prompt engineering for VLMs
Figure 3 for What does CLIP know about a red circle? Visual prompt engineering for VLMs
Figure 4 for What does CLIP know about a red circle? Visual prompt engineering for VLMs
Viaarxiv icon