Alert button

"Image": models, code, and papers
Alert button

SkyGPT: Probabilistic Short-term Solar Forecasting Using Synthetic Sky Videos from Physics-constrained VideoGPT

Add code
Bookmark button
Alert button
Jun 20, 2023
Yuhao Nie, Eric Zelikman, Andea Scott, Quentin Paletta, Adam Brandt

Viaarxiv icon

VisText: A Benchmark for Semantically Rich Chart Captioning

Add code
Bookmark button
Alert button
Jun 28, 2023
Benny J. Tang, Angie Boggust, Arvind Satyanarayan

Figure 1 for VisText: A Benchmark for Semantically Rich Chart Captioning
Figure 2 for VisText: A Benchmark for Semantically Rich Chart Captioning
Figure 3 for VisText: A Benchmark for Semantically Rich Chart Captioning
Figure 4 for VisText: A Benchmark for Semantically Rich Chart Captioning
Viaarxiv icon

Theater Aid System for the Visually Impaired Through Transfer Learning of Spatio-Temporal Graph Convolution Networks

Jun 28, 2023
Leyla Benhamida, Slimane Larabi

Figure 1 for Theater Aid System for the Visually Impaired Through Transfer Learning of Spatio-Temporal Graph Convolution Networks
Figure 2 for Theater Aid System for the Visually Impaired Through Transfer Learning of Spatio-Temporal Graph Convolution Networks
Figure 3 for Theater Aid System for the Visually Impaired Through Transfer Learning of Spatio-Temporal Graph Convolution Networks
Figure 4 for Theater Aid System for the Visually Impaired Through Transfer Learning of Spatio-Temporal Graph Convolution Networks
Viaarxiv icon

RQAT-INR: Improved Implicit Neural Image Compression

Mar 06, 2023
Bharath Bhushan Damodaran, Muhammet Balcilar, Franck Galpin, Pierre Hellier

Figure 1 for RQAT-INR: Improved Implicit Neural Image Compression
Figure 2 for RQAT-INR: Improved Implicit Neural Image Compression
Figure 3 for RQAT-INR: Improved Implicit Neural Image Compression
Figure 4 for RQAT-INR: Improved Implicit Neural Image Compression
Viaarxiv icon

Global and Local Semantic Completion Learning for Vision-Language Pre-training

Add code
Bookmark button
Alert button
Jun 12, 2023
Rong-Cheng Tu, Yatai Ji, Jie Jiang, Weijie Kong, Chengfei Cai, Wenzhe Zhao, Hongfa Wang, Yujiu Yang, Wei Liu

Figure 1 for Global and Local Semantic Completion Learning for Vision-Language Pre-training
Figure 2 for Global and Local Semantic Completion Learning for Vision-Language Pre-training
Figure 3 for Global and Local Semantic Completion Learning for Vision-Language Pre-training
Figure 4 for Global and Local Semantic Completion Learning for Vision-Language Pre-training
Viaarxiv icon

Ablating Concepts in Text-to-Image Diffusion Models

Add code
Bookmark button
Alert button
Mar 23, 2023
Nupur Kumari, Bingliang Zhang, Sheng-Yu Wang, Eli Shechtman, Richard Zhang, Jun-Yan Zhu

Figure 1 for Ablating Concepts in Text-to-Image Diffusion Models
Figure 2 for Ablating Concepts in Text-to-Image Diffusion Models
Figure 3 for Ablating Concepts in Text-to-Image Diffusion Models
Figure 4 for Ablating Concepts in Text-to-Image Diffusion Models
Viaarxiv icon

NeuroCLIP: Neuromorphic Data Understanding by CLIP and SNN

Add code
Bookmark button
Alert button
Jun 21, 2023
Yufei Guo, Yuanpei Chen

Figure 1 for NeuroCLIP: Neuromorphic Data Understanding by CLIP and SNN
Figure 2 for NeuroCLIP: Neuromorphic Data Understanding by CLIP and SNN
Viaarxiv icon

Instruct-Video2Avatar: Video-to-Avatar Generation with Instructions

Add code
Bookmark button
Alert button
Jun 05, 2023
Shaoxu Li

Figure 1 for Instruct-Video2Avatar: Video-to-Avatar Generation with Instructions
Figure 2 for Instruct-Video2Avatar: Video-to-Avatar Generation with Instructions
Figure 3 for Instruct-Video2Avatar: Video-to-Avatar Generation with Instructions
Figure 4 for Instruct-Video2Avatar: Video-to-Avatar Generation with Instructions
Viaarxiv icon

StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training

Add code
Bookmark button
Alert button
Mar 01, 2023
Yuechen Yu, Yulin Li, Chengquan Zhang, Xiaoqiang Zhang, Zengyuan Guo, Xiameng Qin, Kun Yao, Junyu Han, Errui Ding, Jingdong Wang

Figure 1 for StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training
Figure 2 for StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training
Figure 3 for StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training
Figure 4 for StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training
Viaarxiv icon

Highly Personalized Text Embedding for Image Manipulation by Stable Diffusion

Add code
Bookmark button
Alert button
Mar 15, 2023
Inhwa Han, Serin Yang, Taesung Kwon, Jong Chul Ye

Figure 1 for Highly Personalized Text Embedding for Image Manipulation by Stable Diffusion
Figure 2 for Highly Personalized Text Embedding for Image Manipulation by Stable Diffusion
Figure 3 for Highly Personalized Text Embedding for Image Manipulation by Stable Diffusion
Figure 4 for Highly Personalized Text Embedding for Image Manipulation by Stable Diffusion
Viaarxiv icon