Alert button

"Image": models, code, and papers
Alert button

UniBind: LLM-Augmented Unified and Balanced Representation Space to Bind Them All

Mar 19, 2024
Yuanhuiyi Lyu, Xu Zheng, Jiazhou Zhou, Lin Wang

Figure 1 for UniBind: LLM-Augmented Unified and Balanced Representation Space to Bind Them All
Figure 2 for UniBind: LLM-Augmented Unified and Balanced Representation Space to Bind Them All
Figure 3 for UniBind: LLM-Augmented Unified and Balanced Representation Space to Bind Them All
Figure 4 for UniBind: LLM-Augmented Unified and Balanced Representation Space to Bind Them All
Viaarxiv icon

Image-Based Dietary Assessment: A Healthy Eating Plate Estimation System

Mar 02, 2024
Assylzhan Izbassar, Pakizar Shamoi

Figure 1 for Image-Based Dietary Assessment: A Healthy Eating Plate Estimation System
Figure 2 for Image-Based Dietary Assessment: A Healthy Eating Plate Estimation System
Figure 3 for Image-Based Dietary Assessment: A Healthy Eating Plate Estimation System
Figure 4 for Image-Based Dietary Assessment: A Healthy Eating Plate Estimation System
Viaarxiv icon

Predicting Generalization of AI Colonoscopy Models to Unseen Data

Mar 22, 2024
Joel Shor, Carson McNeil, Yotam Intrator, Joseph R Ledsam, Hiro-o Yamano, Daisuke Tsurumaru, Hiroki Kayama, Atsushi Hamabe, Koji Ando, Mitsuhiko Ota, Haruei Ogino, Hiroshi Nakase, Kaho Kobayashi, Masaaki Miyo, Eiji Oki, Ichiro Takemasa, Ehud Rivlin, Roman Goldenberg

Figure 1 for Predicting Generalization of AI Colonoscopy Models to Unseen Data
Figure 2 for Predicting Generalization of AI Colonoscopy Models to Unseen Data
Figure 3 for Predicting Generalization of AI Colonoscopy Models to Unseen Data
Figure 4 for Predicting Generalization of AI Colonoscopy Models to Unseen Data
Viaarxiv icon

A Dual-domain Regularization Method for Ring Artifact Removal of X-ray CT

Mar 15, 2024
Hongyang Zhu, Xin Lu, Yanwei Qin, Xinran Yu, Tianjiao Sun, Yunsong Zhao

Figure 1 for A Dual-domain Regularization Method for Ring Artifact Removal of X-ray CT
Figure 2 for A Dual-domain Regularization Method for Ring Artifact Removal of X-ray CT
Figure 3 for A Dual-domain Regularization Method for Ring Artifact Removal of X-ray CT
Figure 4 for A Dual-domain Regularization Method for Ring Artifact Removal of X-ray CT
Viaarxiv icon

PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis

Add code
Bookmark button
Alert button
Mar 04, 2024
Zhengyao Lv, Yuxiang Wei, Wangmeng Zuo, Kwan-Yee K. Wong

Figure 1 for PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis
Figure 2 for PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis
Figure 3 for PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis
Figure 4 for PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis
Viaarxiv icon

LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models

Add code
Bookmark button
Alert button
Mar 18, 2024
Yang Yang, Wen Wang, Liang Peng, Chaotian Song, Yao Chen, Hengjia Li, Xiaolong Yang, Qinglin Lu, Deng Cai, Boxi Wu, Wei Liu

Figure 1 for LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models
Figure 2 for LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models
Figure 3 for LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models
Figure 4 for LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models
Viaarxiv icon

Modality-Agnostic fMRI Decoding of Vision and Language

Mar 18, 2024
Mitja Nikolaus, Milad Mozafari, Nicholas Asher, Leila Reddy, Rufin VanRullen

Figure 1 for Modality-Agnostic fMRI Decoding of Vision and Language
Figure 2 for Modality-Agnostic fMRI Decoding of Vision and Language
Figure 3 for Modality-Agnostic fMRI Decoding of Vision and Language
Figure 4 for Modality-Agnostic fMRI Decoding of Vision and Language
Viaarxiv icon

Learning User Embeddings from Human Gaze for Personalised Saliency Prediction

Mar 20, 2024
Florian Strohm, Mihai Bâce, Andreas Bulling

Figure 1 for Learning User Embeddings from Human Gaze for Personalised Saliency Prediction
Figure 2 for Learning User Embeddings from Human Gaze for Personalised Saliency Prediction
Figure 3 for Learning User Embeddings from Human Gaze for Personalised Saliency Prediction
Figure 4 for Learning User Embeddings from Human Gaze for Personalised Saliency Prediction
Viaarxiv icon

Retina Vision Transformer (RetinaViT): Introducing Scaled Patches into Vision Transformers

Mar 20, 2024
Yuyang Shu, Michael E. Bain

Figure 1 for Retina Vision Transformer (RetinaViT): Introducing Scaled Patches into Vision Transformers
Figure 2 for Retina Vision Transformer (RetinaViT): Introducing Scaled Patches into Vision Transformers
Figure 3 for Retina Vision Transformer (RetinaViT): Introducing Scaled Patches into Vision Transformers
Figure 4 for Retina Vision Transformer (RetinaViT): Introducing Scaled Patches into Vision Transformers
Viaarxiv icon

Boosting Flow-based Generative Super-Resolution Models via Learned Prior

Add code
Bookmark button
Alert button
Mar 16, 2024
Li-Yuan Tsao, Yi-Chen Lo, Chia-Che Chang, Hao-Wei Chen, Roy Tseng, Chien Feng, Chun-Yi Lee

Figure 1 for Boosting Flow-based Generative Super-Resolution Models via Learned Prior
Figure 2 for Boosting Flow-based Generative Super-Resolution Models via Learned Prior
Figure 3 for Boosting Flow-based Generative Super-Resolution Models via Learned Prior
Figure 4 for Boosting Flow-based Generative Super-Resolution Models via Learned Prior
Viaarxiv icon