Alert button

"Image": models, code, and papers
Alert button

The Five-Dollar Model: Generating Game Maps and Sprites from Sentence Embeddings

Aug 08, 2023
Timothy Merino, Roman Negri, Dipika Rajesh, M Charity, Julian Togelius

Viaarxiv icon

Making the V in Text-VQA Matter

Aug 01, 2023
Shamanthak Hegde, Soumya Jahagirdar, Shankar Gangisetty

Figure 1 for Making the V in Text-VQA Matter
Figure 2 for Making the V in Text-VQA Matter
Figure 3 for Making the V in Text-VQA Matter
Figure 4 for Making the V in Text-VQA Matter
Viaarxiv icon

S2R: Exploring a Double-Win Transformer-Based Framework for Ideal and Blind Super-Resolution

Add code
Bookmark button
Alert button
Aug 16, 2023
Minghao She, Wendong Mao, Huihong Shi, Zhongfeng Wang

Figure 1 for S2R: Exploring a Double-Win Transformer-Based Framework for Ideal and Blind Super-Resolution
Figure 2 for S2R: Exploring a Double-Win Transformer-Based Framework for Ideal and Blind Super-Resolution
Figure 3 for S2R: Exploring a Double-Win Transformer-Based Framework for Ideal and Blind Super-Resolution
Figure 4 for S2R: Exploring a Double-Win Transformer-Based Framework for Ideal and Blind Super-Resolution
Viaarxiv icon

Robotic Scene Segmentation with Memory Network for Runtime Surgical Context Inference

Add code
Bookmark button
Alert button
Aug 24, 2023
Zongyu Li, Ian Reyes, Homa Alemzadeh

Figure 1 for Robotic Scene Segmentation with Memory Network for Runtime Surgical Context Inference
Figure 2 for Robotic Scene Segmentation with Memory Network for Runtime Surgical Context Inference
Figure 3 for Robotic Scene Segmentation with Memory Network for Runtime Surgical Context Inference
Figure 4 for Robotic Scene Segmentation with Memory Network for Runtime Surgical Context Inference
Viaarxiv icon

WS-SfMLearner: Self-supervised Monocular Depth and Ego-motion Estimation on Surgical Videos with Unknown Camera Parameters

Aug 22, 2023
Ange Lou, Jack Noble

Figure 1 for WS-SfMLearner: Self-supervised Monocular Depth and Ego-motion Estimation on Surgical Videos with Unknown Camera Parameters
Figure 2 for WS-SfMLearner: Self-supervised Monocular Depth and Ego-motion Estimation on Surgical Videos with Unknown Camera Parameters
Figure 3 for WS-SfMLearner: Self-supervised Monocular Depth and Ego-motion Estimation on Surgical Videos with Unknown Camera Parameters
Figure 4 for WS-SfMLearner: Self-supervised Monocular Depth and Ego-motion Estimation on Surgical Videos with Unknown Camera Parameters
Viaarxiv icon

Coarse-to-Fine Multi-Scene Pose Regression with Transformers

Add code
Bookmark button
Alert button
Aug 22, 2023
Yoli Shavit, Ron Ferens, Yosi Keller

Figure 1 for Coarse-to-Fine Multi-Scene Pose Regression with Transformers
Figure 2 for Coarse-to-Fine Multi-Scene Pose Regression with Transformers
Figure 3 for Coarse-to-Fine Multi-Scene Pose Regression with Transformers
Figure 4 for Coarse-to-Fine Multi-Scene Pose Regression with Transformers
Viaarxiv icon

WMFormer++: Nested Transformer for Visible Watermark Removal via Implict Joint Learning

Aug 22, 2023
Dongjian Huo, Zehong Zhang, Hanjing Su, Guanbin Li, Chaowei Fang, Qingyao Wu

Figure 1 for WMFormer++: Nested Transformer for Visible Watermark Removal via Implict Joint Learning
Figure 2 for WMFormer++: Nested Transformer for Visible Watermark Removal via Implict Joint Learning
Figure 3 for WMFormer++: Nested Transformer for Visible Watermark Removal via Implict Joint Learning
Figure 4 for WMFormer++: Nested Transformer for Visible Watermark Removal via Implict Joint Learning
Viaarxiv icon

Three Towers: Flexible Contrastive Learning with Pretrained Image Models

Add code
Bookmark button
Alert button
May 29, 2023
Jannik Kossen, Mark Collier, Basil Mustafa, Xiao Wang, Xiaohua Zhai, Lucas Beyer, Andreas Steiner, Jesse Berent, Rodolphe Jenatton, Efi Kokiopoulou

Figure 1 for Three Towers: Flexible Contrastive Learning with Pretrained Image Models
Figure 2 for Three Towers: Flexible Contrastive Learning with Pretrained Image Models
Figure 3 for Three Towers: Flexible Contrastive Learning with Pretrained Image Models
Figure 4 for Three Towers: Flexible Contrastive Learning with Pretrained Image Models
Viaarxiv icon

Self-Supervised Learning for Endoscopic Video Analysis

Add code
Bookmark button
Alert button
Aug 23, 2023
Roy Hirsch, Mathilde Caron, Regev Cohen, Amir Livne, Ron Shapiro, Tomer Golany, Roman Goldenberg, Daniel Freedman, Ehud Rivlin

Figure 1 for Self-Supervised Learning for Endoscopic Video Analysis
Figure 2 for Self-Supervised Learning for Endoscopic Video Analysis
Figure 3 for Self-Supervised Learning for Endoscopic Video Analysis
Figure 4 for Self-Supervised Learning for Endoscopic Video Analysis
Viaarxiv icon

Anisotropic Hybrid Networks for liver tumor segmentation with uncertainty quantification

Aug 23, 2023
Benjamin Lambert, Pauline Roca, Florence Forbes, Senan Doyle, Michel Dojat

Figure 1 for Anisotropic Hybrid Networks for liver tumor segmentation with uncertainty quantification
Figure 2 for Anisotropic Hybrid Networks for liver tumor segmentation with uncertainty quantification
Figure 3 for Anisotropic Hybrid Networks for liver tumor segmentation with uncertainty quantification
Figure 4 for Anisotropic Hybrid Networks for liver tumor segmentation with uncertainty quantification
Viaarxiv icon