Alert button

"Image": models, code, and papers
Alert button

Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review

Add code
Bookmark button
Alert button
Mar 27, 2021
Jesus Perez-Martin, Benjamin Bustos, Silvio Jamil F. Guimarães, Ivan Sipiran, Jorge Pérez, Grethel Coello Said

Figure 1 for Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review
Figure 2 for Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review
Figure 3 for Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review
Figure 4 for Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review
Viaarxiv icon

Multi-Scale Cascading Network with Compact Feature Learning for RGB-Infrared Person Re-Identification

Dec 12, 2020
Can Zhang, Hong Liu, Wei Guo, Mang Ye

Figure 1 for Multi-Scale Cascading Network with Compact Feature Learning for RGB-Infrared Person Re-Identification
Figure 2 for Multi-Scale Cascading Network with Compact Feature Learning for RGB-Infrared Person Re-Identification
Figure 3 for Multi-Scale Cascading Network with Compact Feature Learning for RGB-Infrared Person Re-Identification
Figure 4 for Multi-Scale Cascading Network with Compact Feature Learning for RGB-Infrared Person Re-Identification
Viaarxiv icon

Real-time image-based instrument classification for laparoscopic surgery

Aug 01, 2018
Sebastian Bodenstedt, Antonia Ohnemus, Darko Katic, Anna-Laura Wekerle, Martin Wagner, Hannes Kenngott, Beat Müller-Stich, Rüdiger Dillmann, Stefanie Speidel

Figure 1 for Real-time image-based instrument classification for laparoscopic surgery
Figure 2 for Real-time image-based instrument classification for laparoscopic surgery
Figure 3 for Real-time image-based instrument classification for laparoscopic surgery
Figure 4 for Real-time image-based instrument classification for laparoscopic surgery
Viaarxiv icon

Improving the Classification of Rare Chords with Unlabeled Data

Add code
Bookmark button
Alert button
Dec 13, 2020
Marcelo Bortolozzo, Rodrigo Schramm, Claudio R. Jung

Figure 1 for Improving the Classification of Rare Chords with Unlabeled Data
Figure 2 for Improving the Classification of Rare Chords with Unlabeled Data
Figure 3 for Improving the Classification of Rare Chords with Unlabeled Data
Figure 4 for Improving the Classification of Rare Chords with Unlabeled Data
Viaarxiv icon

Image-Text Multi-Modal Representation Learning by Adversarial Backpropagation

Dec 26, 2016
Gwangbeen Park, Woobin Im

Figure 1 for Image-Text Multi-Modal Representation Learning by Adversarial Backpropagation
Figure 2 for Image-Text Multi-Modal Representation Learning by Adversarial Backpropagation
Figure 3 for Image-Text Multi-Modal Representation Learning by Adversarial Backpropagation
Figure 4 for Image-Text Multi-Modal Representation Learning by Adversarial Backpropagation
Viaarxiv icon

Camera Distance-aware Top-down Approach for 3D Multi-person Pose Estimation from a Single RGB Image

Add code
Bookmark button
Alert button
Aug 17, 2019
Gyeongsik Moon, Ju Yong Chang, Kyoung Mu Lee

Figure 1 for Camera Distance-aware Top-down Approach for 3D Multi-person Pose Estimation from a Single RGB Image
Figure 2 for Camera Distance-aware Top-down Approach for 3D Multi-person Pose Estimation from a Single RGB Image
Figure 3 for Camera Distance-aware Top-down Approach for 3D Multi-person Pose Estimation from a Single RGB Image
Figure 4 for Camera Distance-aware Top-down Approach for 3D Multi-person Pose Estimation from a Single RGB Image
Viaarxiv icon

An Ultra Lightweight CNN for Low Resource Circuit Component Recognition

Add code
Bookmark button
Alert button
Oct 01, 2020
Yingnan Ju, Yue Chen

Figure 1 for An Ultra Lightweight CNN for Low Resource Circuit Component Recognition
Figure 2 for An Ultra Lightweight CNN for Low Resource Circuit Component Recognition
Figure 3 for An Ultra Lightweight CNN for Low Resource Circuit Component Recognition
Figure 4 for An Ultra Lightweight CNN for Low Resource Circuit Component Recognition
Viaarxiv icon

Anomaly localization by modeling perceptual features

Add code
Bookmark button
Alert button
Aug 12, 2020
David Dehaene, Pierre Eline

Figure 1 for Anomaly localization by modeling perceptual features
Figure 2 for Anomaly localization by modeling perceptual features
Figure 3 for Anomaly localization by modeling perceptual features
Figure 4 for Anomaly localization by modeling perceptual features
Viaarxiv icon

NeRF--: Neural Radiance Fields Without Known Camera Parameters

Feb 19, 2021
Zirui Wang, Shangzhe Wu, Weidi Xie, Min Chen, Victor Adrian Prisacariu

Figure 1 for NeRF--: Neural Radiance Fields Without Known Camera Parameters
Figure 2 for NeRF--: Neural Radiance Fields Without Known Camera Parameters
Figure 3 for NeRF--: Neural Radiance Fields Without Known Camera Parameters
Figure 4 for NeRF--: Neural Radiance Fields Without Known Camera Parameters
Viaarxiv icon

A Novel Deep ML Architecture by Integrating Visual Simultaneous Localization and Mapping (vSLAM) into Mask R-CNN for Real-time Surgical Video Analysis

Apr 09, 2021
Ella Selina Lan

Figure 1 for A Novel Deep ML Architecture by Integrating Visual Simultaneous Localization and Mapping (vSLAM) into Mask R-CNN for Real-time Surgical Video Analysis
Figure 2 for A Novel Deep ML Architecture by Integrating Visual Simultaneous Localization and Mapping (vSLAM) into Mask R-CNN for Real-time Surgical Video Analysis
Figure 3 for A Novel Deep ML Architecture by Integrating Visual Simultaneous Localization and Mapping (vSLAM) into Mask R-CNN for Real-time Surgical Video Analysis
Figure 4 for A Novel Deep ML Architecture by Integrating Visual Simultaneous Localization and Mapping (vSLAM) into Mask R-CNN for Real-time Surgical Video Analysis
Viaarxiv icon