Alert button

"Image": models, code, and papers
Alert button

Aligning Text-to-Image Models using Human Feedback

Add code
Bookmark button
Alert button
Feb 23, 2023
Kimin Lee, Hao Liu, Moonkyung Ryu, Olivia Watkins, Yuqing Du, Craig Boutilier, Pieter Abbeel, Mohammad Ghavamzadeh, Shixiang Shane Gu

Figure 1 for Aligning Text-to-Image Models using Human Feedback
Figure 2 for Aligning Text-to-Image Models using Human Feedback
Figure 3 for Aligning Text-to-Image Models using Human Feedback
Figure 4 for Aligning Text-to-Image Models using Human Feedback
Viaarxiv icon

A Comprehensive Review of Image Line Segment Detection and Description: Taxonomies, Comparisons, and Challenges

Add code
Bookmark button
Alert button
Apr 29, 2023
Xinyu Lin, Yingjie Zhou, Yipeng Liu, Ce Zhu

Figure 1 for A Comprehensive Review of Image Line Segment Detection and Description: Taxonomies, Comparisons, and Challenges
Figure 2 for A Comprehensive Review of Image Line Segment Detection and Description: Taxonomies, Comparisons, and Challenges
Figure 3 for A Comprehensive Review of Image Line Segment Detection and Description: Taxonomies, Comparisons, and Challenges
Figure 4 for A Comprehensive Review of Image Line Segment Detection and Description: Taxonomies, Comparisons, and Challenges
Viaarxiv icon

Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior

Add code
Bookmark button
Alert button
Mar 24, 2023
Junshu Tang, Tengfei Wang, Bo Zhang, Ting Zhang, Ran Yi, Lizhuang Ma, Dong Chen

Figure 1 for Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior
Figure 2 for Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior
Figure 3 for Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior
Figure 4 for Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior
Viaarxiv icon

PFT-SSR: Parallax Fusion Transformer for Stereo Image Super-Resolution

Add code
Bookmark button
Alert button
Mar 24, 2023
Hansheng Guo, Juncheng Li, Guangwei Gao, Zhi Li, Tieyong Zeng

Figure 1 for PFT-SSR: Parallax Fusion Transformer for Stereo Image Super-Resolution
Figure 2 for PFT-SSR: Parallax Fusion Transformer for Stereo Image Super-Resolution
Figure 3 for PFT-SSR: Parallax Fusion Transformer for Stereo Image Super-Resolution
Figure 4 for PFT-SSR: Parallax Fusion Transformer for Stereo Image Super-Resolution
Viaarxiv icon

S-CLIP: Semi-supervised Vision-Language Pre-training using Few Specialist Captions

Add code
Bookmark button
Alert button
May 23, 2023
Sangwoo Mo, Minkyu Kim, Kyungmin Lee, Jinwoo Shin

Figure 1 for S-CLIP: Semi-supervised Vision-Language Pre-training using Few Specialist Captions
Figure 2 for S-CLIP: Semi-supervised Vision-Language Pre-training using Few Specialist Captions
Figure 3 for S-CLIP: Semi-supervised Vision-Language Pre-training using Few Specialist Captions
Figure 4 for S-CLIP: Semi-supervised Vision-Language Pre-training using Few Specialist Captions
Viaarxiv icon

Harnessing the Spatial-Temporal Attention of Diffusion Models for High-Fidelity Text-to-Image Synthesis

Add code
Bookmark button
Alert button
Apr 07, 2023
Qiucheng Wu, Yujian Liu, Handong Zhao, Trung Bui, Zhe Lin, Yang Zhang, Shiyu Chang

Figure 1 for Harnessing the Spatial-Temporal Attention of Diffusion Models for High-Fidelity Text-to-Image Synthesis
Figure 2 for Harnessing the Spatial-Temporal Attention of Diffusion Models for High-Fidelity Text-to-Image Synthesis
Figure 3 for Harnessing the Spatial-Temporal Attention of Diffusion Models for High-Fidelity Text-to-Image Synthesis
Figure 4 for Harnessing the Spatial-Temporal Attention of Diffusion Models for High-Fidelity Text-to-Image Synthesis
Viaarxiv icon

PuMer: Pruning and Merging Tokens for Efficient Vision Language Models

Add code
Bookmark button
Alert button
May 27, 2023
Qingqing Cao, Bhargavi Paranjape, Hannaneh Hajishirzi

Figure 1 for PuMer: Pruning and Merging Tokens for Efficient Vision Language Models
Figure 2 for PuMer: Pruning and Merging Tokens for Efficient Vision Language Models
Figure 3 for PuMer: Pruning and Merging Tokens for Efficient Vision Language Models
Figure 4 for PuMer: Pruning and Merging Tokens for Efficient Vision Language Models
Viaarxiv icon

Make the Most Out of Your Net: Alternating Between Canonical and Hard Datasets for Improved Image Demosaicing

Mar 28, 2023
Yuval Becker, Raz Z. Nossek, Tomer Peleg

Figure 1 for Make the Most Out of Your Net: Alternating Between Canonical and Hard Datasets for Improved Image Demosaicing
Figure 2 for Make the Most Out of Your Net: Alternating Between Canonical and Hard Datasets for Improved Image Demosaicing
Figure 3 for Make the Most Out of Your Net: Alternating Between Canonical and Hard Datasets for Improved Image Demosaicing
Figure 4 for Make the Most Out of Your Net: Alternating Between Canonical and Hard Datasets for Improved Image Demosaicing
Viaarxiv icon

Image-based Indian Sign Language Recognition: A Practical Review using Deep Neural Networks

Apr 28, 2023
Mallikharjuna Rao K, Harleen Kaur, Sanjam Kaur Bedi, M A Lekhana

Figure 1 for Image-based Indian Sign Language Recognition: A Practical Review using Deep Neural Networks
Figure 2 for Image-based Indian Sign Language Recognition: A Practical Review using Deep Neural Networks
Figure 3 for Image-based Indian Sign Language Recognition: A Practical Review using Deep Neural Networks
Figure 4 for Image-based Indian Sign Language Recognition: A Practical Review using Deep Neural Networks
Viaarxiv icon

3D-aware Conditional Image Synthesis

Add code
Bookmark button
Alert button
Feb 16, 2023
Kangle Deng, Gengshan Yang, Deva Ramanan, Jun-Yan Zhu

Figure 1 for 3D-aware Conditional Image Synthesis
Figure 2 for 3D-aware Conditional Image Synthesis
Figure 3 for 3D-aware Conditional Image Synthesis
Figure 4 for 3D-aware Conditional Image Synthesis
Viaarxiv icon