Alert button

"Image": models, code, and papers
Alert button

Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles

Add code
Bookmark button
Alert button
Jun 01, 2023
Chaitanya Ryali, Yuan-Ting Hu, Daniel Bolya, Chen Wei, Haoqi Fan, Po-Yao Huang, Vaibhav Aggarwal, Arkabandhu Chowdhury, Omid Poursaeed, Judy Hoffman, Jitendra Malik, Yanghao Li, Christoph Feichtenhofer

Figure 1 for Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Figure 2 for Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Figure 3 for Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Figure 4 for Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Viaarxiv icon

Correspondence-Free Domain Alignment for Unsupervised Cross-Domain Image Retrieval

Feb 13, 2023
Xu Wang, Dezhong Peng, Ming Yan, Peng Hu

Figure 1 for Correspondence-Free Domain Alignment for Unsupervised Cross-Domain Image Retrieval
Figure 2 for Correspondence-Free Domain Alignment for Unsupervised Cross-Domain Image Retrieval
Figure 3 for Correspondence-Free Domain Alignment for Unsupervised Cross-Domain Image Retrieval
Figure 4 for Correspondence-Free Domain Alignment for Unsupervised Cross-Domain Image Retrieval
Viaarxiv icon

The Tunnel Effect: Building Data Representations in Deep Neural Networks

May 31, 2023
Wojciech Masarczyk, Mateusz Ostaszewski, Ehsan Imani, Razvan Pascanu, Piotr Miłoś, Tomasz Trzciński

Figure 1 for The Tunnel Effect: Building Data Representations in Deep Neural Networks
Figure 2 for The Tunnel Effect: Building Data Representations in Deep Neural Networks
Figure 3 for The Tunnel Effect: Building Data Representations in Deep Neural Networks
Figure 4 for The Tunnel Effect: Building Data Representations in Deep Neural Networks
Viaarxiv icon

Image to Sphere: Learning Equivariant Features for Efficient Pose Prediction

Add code
Bookmark button
Alert button
Feb 27, 2023
David M. Klee, Ondrej Biza, Robert Platt, Robin Walters

Figure 1 for Image to Sphere: Learning Equivariant Features for Efficient Pose Prediction
Figure 2 for Image to Sphere: Learning Equivariant Features for Efficient Pose Prediction
Figure 3 for Image to Sphere: Learning Equivariant Features for Efficient Pose Prediction
Figure 4 for Image to Sphere: Learning Equivariant Features for Efficient Pose Prediction
Viaarxiv icon

Improving Scene Text Image Super-Resolution via Dual Prior Modulation Network

Add code
Bookmark button
Alert button
Feb 21, 2023
Shipeng Zhu, Zuoyan Zhao, Pengfei Fang, Hui Xue

Figure 1 for Improving Scene Text Image Super-Resolution via Dual Prior Modulation Network
Figure 2 for Improving Scene Text Image Super-Resolution via Dual Prior Modulation Network
Figure 3 for Improving Scene Text Image Super-Resolution via Dual Prior Modulation Network
Figure 4 for Improving Scene Text Image Super-Resolution via Dual Prior Modulation Network
Viaarxiv icon

STAIR: Learning Sparse Text and Image Representation in Grounded Tokens

Feb 08, 2023
Chen Chen, Bowen Zhang, Liangliang Cao, Jiguang Shen, Tom Gunter, Albin Madappally Jose, Alexander Toshev, Jonathon Shlens, Ruoming Pang, Yinfei Yang

Figure 1 for STAIR: Learning Sparse Text and Image Representation in Grounded Tokens
Figure 2 for STAIR: Learning Sparse Text and Image Representation in Grounded Tokens
Figure 3 for STAIR: Learning Sparse Text and Image Representation in Grounded Tokens
Figure 4 for STAIR: Learning Sparse Text and Image Representation in Grounded Tokens
Viaarxiv icon

OPE-SR: Orthogonal Position Encoding for Designing a Parameter-free Upsampling Module in Arbitrary-scale Image Super-Resolution

Mar 02, 2023
Gaochao Song, Luo Zhang, Ran Su, Jianfeng Shi, Ying He, Qian Sun

Figure 1 for OPE-SR: Orthogonal Position Encoding for Designing a Parameter-free Upsampling Module in Arbitrary-scale Image Super-Resolution
Figure 2 for OPE-SR: Orthogonal Position Encoding for Designing a Parameter-free Upsampling Module in Arbitrary-scale Image Super-Resolution
Figure 3 for OPE-SR: Orthogonal Position Encoding for Designing a Parameter-free Upsampling Module in Arbitrary-scale Image Super-Resolution
Figure 4 for OPE-SR: Orthogonal Position Encoding for Designing a Parameter-free Upsampling Module in Arbitrary-scale Image Super-Resolution
Viaarxiv icon

Learning to Imagine: Visually-Augmented Natural Language Generation

Add code
Bookmark button
Alert button
Jun 04, 2023
Tianyi Tang, Yushuo Chen, Yifan Du, Junyi Li, Wayne Xin Zhao, Ji-Rong Wen

Figure 1 for Learning to Imagine: Visually-Augmented Natural Language Generation
Figure 2 for Learning to Imagine: Visually-Augmented Natural Language Generation
Figure 3 for Learning to Imagine: Visually-Augmented Natural Language Generation
Figure 4 for Learning to Imagine: Visually-Augmented Natural Language Generation
Viaarxiv icon

ReliableSwap: Boosting General Face Swapping Via Reliable Supervision

Add code
Bookmark button
Alert button
Jun 08, 2023
Ge Yuan, Maomao Li, Yong Zhang, Huicheng Zheng

Viaarxiv icon

Accurate Gigapixel Crowd Counting by Iterative Zooming and Refinement

Add code
Bookmark button
Alert button
May 16, 2023
Arian Bakhtiarnia, Qi Zhang, Alexandros Iosifidis

Figure 1 for Accurate Gigapixel Crowd Counting by Iterative Zooming and Refinement
Figure 2 for Accurate Gigapixel Crowd Counting by Iterative Zooming and Refinement
Figure 3 for Accurate Gigapixel Crowd Counting by Iterative Zooming and Refinement
Figure 4 for Accurate Gigapixel Crowd Counting by Iterative Zooming and Refinement
Viaarxiv icon