Alert button

"Image": models, code, and papers
Alert button

TouchStone: Evaluating Vision-Language Models by Language Models

Add code
Bookmark button
Alert button
Sep 04, 2023
Shuai Bai, Shusheng Yang, Jinze Bai, Peng Wang, Xingxuan Zhang, Junyang Lin, Xinggang Wang, Chang Zhou, Jingren Zhou

Figure 1 for TouchStone: Evaluating Vision-Language Models by Language Models
Figure 2 for TouchStone: Evaluating Vision-Language Models by Language Models
Figure 3 for TouchStone: Evaluating Vision-Language Models by Language Models
Figure 4 for TouchStone: Evaluating Vision-Language Models by Language Models
Viaarxiv icon

Few-shot Diagnosis of Chest x-rays Using an Ensemble of Random Discriminative Subspaces

Add code
Bookmark button
Alert button
Aug 31, 2023
Kshitiz, Garvit Garg, Angshuman Paul

Figure 1 for Few-shot Diagnosis of Chest x-rays Using an Ensemble of Random Discriminative Subspaces
Figure 2 for Few-shot Diagnosis of Chest x-rays Using an Ensemble of Random Discriminative Subspaces
Figure 3 for Few-shot Diagnosis of Chest x-rays Using an Ensemble of Random Discriminative Subspaces
Figure 4 for Few-shot Diagnosis of Chest x-rays Using an Ensemble of Random Discriminative Subspaces
Viaarxiv icon

MVDream: Multi-view Diffusion for 3D Generation

Add code
Bookmark button
Alert button
Aug 31, 2023
Yichun Shi, Peng Wang, Jianglong Ye, Mai Long, Kejie Li, Xiao Yang

Viaarxiv icon

Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation

Add code
Bookmark button
Alert button
Jun 29, 2023
Zibo Zhao, Wen Liu, Xin Chen, Xianfang Zeng, Rui Wang, Pei Cheng, Bin Fu, Tao Chen, Gang Yu, Shenghua Gao

Figure 1 for Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation
Figure 2 for Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation
Figure 3 for Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation
Figure 4 for Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation
Viaarxiv icon

Large-scale gradient-based training of Mixtures of Factor Analyzers

Add code
Bookmark button
Alert button
Aug 26, 2023
Alexander Gepperth

Viaarxiv icon

Orientation-Independent Chinese Text Recognition in Scene Images

Add code
Bookmark button
Alert button
Sep 03, 2023
Haiyang Yu, Xiaocong Wang, Bin Li, Xiangyang Xue

Figure 1 for Orientation-Independent Chinese Text Recognition in Scene Images
Figure 2 for Orientation-Independent Chinese Text Recognition in Scene Images
Figure 3 for Orientation-Independent Chinese Text Recognition in Scene Images
Figure 4 for Orientation-Independent Chinese Text Recognition in Scene Images
Viaarxiv icon

BDC-Adapter: Brownian Distance Covariance for Better Vision-Language Reasoning

Sep 03, 2023
Yi Zhang, Ce Zhang, Zihan Liao, Yushun Tang, Zhihai He

Figure 1 for BDC-Adapter: Brownian Distance Covariance for Better Vision-Language Reasoning
Figure 2 for BDC-Adapter: Brownian Distance Covariance for Better Vision-Language Reasoning
Figure 3 for BDC-Adapter: Brownian Distance Covariance for Better Vision-Language Reasoning
Figure 4 for BDC-Adapter: Brownian Distance Covariance for Better Vision-Language Reasoning
Viaarxiv icon

CoTDet: Affordance Knowledge Prompting for Task Driven Object Detection

Add code
Bookmark button
Alert button
Sep 03, 2023
Jiajin Tang, Ge Zheng, Jingyi Yu, Sibei Yang

Figure 1 for CoTDet: Affordance Knowledge Prompting for Task Driven Object Detection
Figure 2 for CoTDet: Affordance Knowledge Prompting for Task Driven Object Detection
Figure 3 for CoTDet: Affordance Knowledge Prompting for Task Driven Object Detection
Figure 4 for CoTDet: Affordance Knowledge Prompting for Task Driven Object Detection
Viaarxiv icon

ArSDM: Colonoscopy Images Synthesis with Adaptive Refinement Semantic Diffusion Models

Add code
Bookmark button
Alert button
Sep 03, 2023
Yuhao Du, Yuncheng Jiang, Shuangyi Tan, Xusheng Wu, Qi Dou, Zhen Li, Guanbin Li, Xiang Wan

Figure 1 for ArSDM: Colonoscopy Images Synthesis with Adaptive Refinement Semantic Diffusion Models
Figure 2 for ArSDM: Colonoscopy Images Synthesis with Adaptive Refinement Semantic Diffusion Models
Figure 3 for ArSDM: Colonoscopy Images Synthesis with Adaptive Refinement Semantic Diffusion Models
Figure 4 for ArSDM: Colonoscopy Images Synthesis with Adaptive Refinement Semantic Diffusion Models
Viaarxiv icon

LoGoPrompt: Synthetic Text Images Can Be Good Visual Prompts for Vision-Language Models

Add code
Bookmark button
Alert button
Sep 03, 2023
Cheng Shi, Sibei Yang

Figure 1 for LoGoPrompt: Synthetic Text Images Can Be Good Visual Prompts for Vision-Language Models
Figure 2 for LoGoPrompt: Synthetic Text Images Can Be Good Visual Prompts for Vision-Language Models
Figure 3 for LoGoPrompt: Synthetic Text Images Can Be Good Visual Prompts for Vision-Language Models
Figure 4 for LoGoPrompt: Synthetic Text Images Can Be Good Visual Prompts for Vision-Language Models
Viaarxiv icon