Alert button

"Image": models, code, and papers
Alert button

Learning transformer-based heterogeneously salient graph representation for multimodal fusion classification of hyperspectral image and LiDAR data

Nov 17, 2023
Jiaqi Yang, Bo Du, Liangpei Zhang

Viaarxiv icon

Lego: Learning to Disentangle and Invert Concepts Beyond Object Appearance in Text-to-Image Diffusion Models

Nov 23, 2023
Saman Motamed, Danda Pani Paudel, Luc Van Gool

Viaarxiv icon

IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks

Dec 04, 2023
Jiarui Xu, Yossi Gandelsman, Amir Bar, Jianwei Yang, Jianfeng Gao, Trevor Darrell, Xiaolong Wang

Figure 1 for IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
Figure 2 for IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
Figure 3 for IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
Figure 4 for IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
Viaarxiv icon

NDELS: A Novel Approach for Nighttime Dehazing, Low-Light Enhancement, and Light Suppression

Dec 11, 2023
Silvano A. Bernabel, Sos S. Agaian

Viaarxiv icon

Shape Matters: Detecting Vertebral Fractures Using Differentiable Point-Based Shape Decoding

Dec 08, 2023
Hellena Hempe, Alexander Bigalke, Mattias P. Heinrich

Viaarxiv icon

HiFi Tuner: High-Fidelity Subject-Driven Fine-Tuning for Diffusion Models

Nov 30, 2023
Zhonghao Wang, Wei Wei, Yang Zhao, Zhisheng Xiao, Mark Hasegawa-Johnson, Humphrey Shi, Tingbo Hou

Viaarxiv icon

Make-A-Storyboard: A General Framework for Storyboard with Disentangled and Merged Control

Dec 06, 2023
Sitong Su, Litao Guo, Lianli Gao, Heng Tao Shen, Jingkuan Song

Figure 1 for Make-A-Storyboard: A General Framework for Storyboard with Disentangled and Merged Control
Figure 2 for Make-A-Storyboard: A General Framework for Storyboard with Disentangled and Merged Control
Figure 3 for Make-A-Storyboard: A General Framework for Storyboard with Disentangled and Merged Control
Figure 4 for Make-A-Storyboard: A General Framework for Storyboard with Disentangled and Merged Control
Viaarxiv icon

BenchLMM: Benchmarking Cross-style Visual Capability of Large Multimodal Models

Add code
Bookmark button
Alert button
Dec 06, 2023
Rizhao Cai, Zirui Song, Dayan Guan, Zhenhao Chen, Xing Luo, Chenyu Yi, Alex Kot

Viaarxiv icon

GCFA:Geodesic Curve Feature Augmentation via Shape Space Theory

Dec 06, 2023
Yuexing Han, Guanxin Wan, Bing Wang

Viaarxiv icon

Novel class discovery meets foundation models for 3D semantic segmentation

Dec 06, 2023
Luigi Riz, Cristiano Saltori, Yiming Wang, Elisa Ricci, Fabio Poiesi

Viaarxiv icon