Alert button

"Image": models, code, and papers
Alert button

CLIP on Wheels: Zero-Shot Object Navigation as Object Localization and Exploration

Mar 20, 2022
Samir Yitzhak Gadre, Mitchell Wortsman, Gabriel Ilharco, Ludwig Schmidt, Shuran Song

Figure 1 for CLIP on Wheels: Zero-Shot Object Navigation as Object Localization and Exploration
Figure 2 for CLIP on Wheels: Zero-Shot Object Navigation as Object Localization and Exploration
Figure 3 for CLIP on Wheels: Zero-Shot Object Navigation as Object Localization and Exploration
Figure 4 for CLIP on Wheels: Zero-Shot Object Navigation as Object Localization and Exploration
Viaarxiv icon

Parallel Training of GRU Networks with a Multi-Grid Solver for Long Sequences

Mar 07, 2022
Gordon Euhyun Moon, Eric C. Cyr

Figure 1 for Parallel Training of GRU Networks with a Multi-Grid Solver for Long Sequences
Figure 2 for Parallel Training of GRU Networks with a Multi-Grid Solver for Long Sequences
Figure 3 for Parallel Training of GRU Networks with a Multi-Grid Solver for Long Sequences
Figure 4 for Parallel Training of GRU Networks with a Multi-Grid Solver for Long Sequences
Viaarxiv icon

Dimensions of Motion: Learning to Predict a Subspace of Optical Flow from a Single Image

Jan 06, 2022
Richard Strong Bowen, Richard Tucker, Ramin Zabih, Noah Snavely

Figure 1 for Dimensions of Motion: Learning to Predict a Subspace of Optical Flow from a Single Image
Figure 2 for Dimensions of Motion: Learning to Predict a Subspace of Optical Flow from a Single Image
Figure 3 for Dimensions of Motion: Learning to Predict a Subspace of Optical Flow from a Single Image
Figure 4 for Dimensions of Motion: Learning to Predict a Subspace of Optical Flow from a Single Image
Viaarxiv icon

A Multi-Scale Time-Frequency Spectrogram Discriminator for GAN-based Non-Autoregressive TTS

Mar 02, 2022
Haohan Guo, Hui Lu, Xixin Wu, Helen Meng

Figure 1 for A Multi-Scale Time-Frequency Spectrogram Discriminator for GAN-based Non-Autoregressive TTS
Figure 2 for A Multi-Scale Time-Frequency Spectrogram Discriminator for GAN-based Non-Autoregressive TTS
Figure 3 for A Multi-Scale Time-Frequency Spectrogram Discriminator for GAN-based Non-Autoregressive TTS
Figure 4 for A Multi-Scale Time-Frequency Spectrogram Discriminator for GAN-based Non-Autoregressive TTS
Viaarxiv icon

Universal adversarial perturbation for remote sensing images

Feb 22, 2022
Zhaoxia Yin, Qingyu Wang, Jin Tang, Bin Luo

Figure 1 for Universal adversarial perturbation for remote sensing images
Figure 2 for Universal adversarial perturbation for remote sensing images
Figure 3 for Universal adversarial perturbation for remote sensing images
Figure 4 for Universal adversarial perturbation for remote sensing images
Viaarxiv icon

HoughCL: Finding Better Positive Pairs in Dense Self-supervised Learning

Nov 21, 2021
Yunsung Lee, Teakgyu Hong, Han-Cheol Cho, Junbum Cha, Seungryong Kim

Figure 1 for HoughCL: Finding Better Positive Pairs in Dense Self-supervised Learning
Figure 2 for HoughCL: Finding Better Positive Pairs in Dense Self-supervised Learning
Figure 3 for HoughCL: Finding Better Positive Pairs in Dense Self-supervised Learning
Figure 4 for HoughCL: Finding Better Positive Pairs in Dense Self-supervised Learning
Viaarxiv icon

Spot the Difference: A Cooperative Object-Referring Game in Non-Perfectly Co-Observable Scene

Mar 16, 2022
Duo Zheng, Fandong Meng, Qingyi Si, Hairun Fan, Zipeng Xu, Jie Zhou, Fangxiang Feng, Xiaojie Wang

Figure 1 for Spot the Difference: A Cooperative Object-Referring Game in Non-Perfectly Co-Observable Scene
Figure 2 for Spot the Difference: A Cooperative Object-Referring Game in Non-Perfectly Co-Observable Scene
Figure 3 for Spot the Difference: A Cooperative Object-Referring Game in Non-Perfectly Co-Observable Scene
Figure 4 for Spot the Difference: A Cooperative Object-Referring Game in Non-Perfectly Co-Observable Scene
Viaarxiv icon

An Efficient Technique for Image Captioning using Deep Neural Network

Sep 05, 2020
Borneel Bikash Phukan, Amiya Ranjan Panda

Figure 1 for An Efficient Technique for Image Captioning using Deep Neural Network
Figure 2 for An Efficient Technique for Image Captioning using Deep Neural Network
Figure 3 for An Efficient Technique for Image Captioning using Deep Neural Network
Figure 4 for An Efficient Technique for Image Captioning using Deep Neural Network
Viaarxiv icon

Blind Image Quality Assessment for MRI with A Deep Three-dimensional content-adaptive Hyper-Network

Jul 13, 2021
Kehan Qi, Haoran Li, Chuyu Rong, Yu Gong, Cheng Li, Hairong Zheng, Shanshan Wang

Figure 1 for Blind Image Quality Assessment for MRI with A Deep Three-dimensional content-adaptive Hyper-Network
Figure 2 for Blind Image Quality Assessment for MRI with A Deep Three-dimensional content-adaptive Hyper-Network
Figure 3 for Blind Image Quality Assessment for MRI with A Deep Three-dimensional content-adaptive Hyper-Network
Figure 4 for Blind Image Quality Assessment for MRI with A Deep Three-dimensional content-adaptive Hyper-Network
Viaarxiv icon

Comparing radiologists' gaze and saliency maps generated by interpretability methods for chest x-rays

Dec 22, 2021
Ricardo Bigolin Lanfredi, Ambuj Arora, Trafton Drew, Joyce D. Schroeder, Tolga Tasdizen

Figure 1 for Comparing radiologists' gaze and saliency maps generated by interpretability methods for chest x-rays
Figure 2 for Comparing radiologists' gaze and saliency maps generated by interpretability methods for chest x-rays
Figure 3 for Comparing radiologists' gaze and saliency maps generated by interpretability methods for chest x-rays
Figure 4 for Comparing radiologists' gaze and saliency maps generated by interpretability methods for chest x-rays
Viaarxiv icon