Alert button

"Image": models, code, and papers
Alert button

How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary Investigation

Add code
Bookmark button
Alert button
Dec 13, 2023
Zhongyi Han, Guanglin Zhou, Rundong He, Jindong Wang, Tailin Wu, Yilong Yin, Salman Khan, Lina Yao, Tongliang Liu, Kun Zhang

Figure 1 for How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary Investigation
Figure 2 for How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary Investigation
Figure 3 for How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary Investigation
Figure 4 for How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary Investigation
Viaarxiv icon

IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks

Dec 04, 2023
Jiarui Xu, Yossi Gandelsman, Amir Bar, Jianwei Yang, Jianfeng Gao, Trevor Darrell, Xiaolong Wang

Figure 1 for IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
Figure 2 for IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
Figure 3 for IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
Figure 4 for IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
Viaarxiv icon

Plasticine3D: Non-rigid 3D editting with text guidance

Dec 15, 2023
Yige Chen, Ang Chen, Siyuan Chen, Ran Yi

Viaarxiv icon

Make-A-Storyboard: A General Framework for Storyboard with Disentangled and Merged Control

Dec 06, 2023
Sitong Su, Litao Guo, Lianli Gao, Heng Tao Shen, Jingkuan Song

Figure 1 for Make-A-Storyboard: A General Framework for Storyboard with Disentangled and Merged Control
Figure 2 for Make-A-Storyboard: A General Framework for Storyboard with Disentangled and Merged Control
Figure 3 for Make-A-Storyboard: A General Framework for Storyboard with Disentangled and Merged Control
Figure 4 for Make-A-Storyboard: A General Framework for Storyboard with Disentangled and Merged Control
Viaarxiv icon

Novel class discovery meets foundation models for 3D semantic segmentation

Dec 06, 2023
Luigi Riz, Cristiano Saltori, Yiming Wang, Elisa Ricci, Fabio Poiesi

Viaarxiv icon

GCFA:Geodesic Curve Feature Augmentation via Shape Space Theory

Dec 06, 2023
Yuexing Han, Guanxin Wan, Bing Wang

Viaarxiv icon

FAAC: Facial Animation Generation with Anchor Frame and Conditional Control for Superior Fidelity and Editability

Dec 06, 2023
Linze Li, Sunqi Fan, Hengjun Pu, Zhaodong Bing, Yao Tang, Tianzhu Ye, Tong Yang, Liangyu Chen, Jiajun Liang

Viaarxiv icon

BenchLMM: Benchmarking Cross-style Visual Capability of Large Multimodal Models

Add code
Bookmark button
Alert button
Dec 06, 2023
Rizhao Cai, Zirui Song, Dayan Guan, Zhenhao Chen, Xing Luo, Chenyu Yi, Alex Kot

Viaarxiv icon

Shape Matters: Detecting Vertebral Fractures Using Differentiable Point-Based Shape Decoding

Dec 08, 2023
Hellena Hempe, Alexander Bigalke, Mattias P. Heinrich

Viaarxiv icon

An unsupervised approach towards promptable defect segmentation in laser-based additive manufacturing by Segment Anything

Dec 07, 2023
Israt Zarin Era, Imtiaz Ahmed, Zhichao Liu, Srinjoy Das

Viaarxiv icon