Alert button

"Image": models, code, and papers
Alert button

The Right Losses for the Right Gains: Improving the Semantic Consistency of Deep Text-to-Image Generation with Distribution-Sensitive Losses

Dec 18, 2023
Mahmoud Ahmed, Omer Moussa, Ismail Shaheen, Mohamed Abdelfattah, Amr Abdalla, Marwan Eid, Hesham Eraqi, Mohamed Moustafa

Viaarxiv icon

Aligned with LLM: a new multi-modal training paradigm for encoding fMRI activity in visual cortex

Jan 08, 2024
Shuxiao Ma, Linyuan Wang, Senbao Hou, Bin Yan

Viaarxiv icon

A Compact and Semantic Latent Space for Disentangled and Controllable Image Editing

Dec 13, 2023
Gwilherm Lesné, Yann Gousseau, Saïd Ladjal, Alasdair Newson

Viaarxiv icon

Customize-It-3D: High-Quality 3D Creation from A Single Image Using Subject-Specific Knowledge Prior

Add code
Bookmark button
Alert button
Dec 15, 2023
Nan Huang, Ting Zhang, Yuhui Yuan, Dong Chen, Shanghang Zhang

Viaarxiv icon

MCANet: Medical Image Segmentation with Multi-Scale Cross-Axis Attention

Add code
Bookmark button
Alert button
Dec 20, 2023
Hao Shao, Quansheng Zeng, Qibin Hou, Jufeng Yang

Figure 1 for MCANet: Medical Image Segmentation with Multi-Scale Cross-Axis Attention
Figure 2 for MCANet: Medical Image Segmentation with Multi-Scale Cross-Axis Attention
Figure 3 for MCANet: Medical Image Segmentation with Multi-Scale Cross-Axis Attention
Figure 4 for MCANet: Medical Image Segmentation with Multi-Scale Cross-Axis Attention
Viaarxiv icon

Bring Metric Functions into Diffusion Models

Jan 04, 2024
Jie An, Zhengyuan Yang, Jianfeng Wang, Linjie Li, Zicheng Liu, Lijuan Wang, Jiebo Luo

Viaarxiv icon

SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language Models

Add code
Bookmark button
Alert button
Dec 11, 2023
Yuzhou Huang, Liangbin Xie, Xintao Wang, Ziyang Yuan, Xiaodong Cun, Yixiao Ge, Jiantao Zhou, Chao Dong, Rui Huang, Ruimao Zhang, Ying Shan

Viaarxiv icon

FineControlNet: Fine-level Text Control for Image Generation with Spatially Aligned Text Control Injection

Add code
Bookmark button
Alert button
Dec 14, 2023
Hongsuk Choi, Isaac Kasahara, Selim Engin, Moritz Graule, Nikhil Chavan-Dafle, Volkan Isler

Viaarxiv icon

Knowledge-Aware Artifact Image Synthesis with LLM-Enhanced Prompting and Multi-Source Supervision

Add code
Bookmark button
Alert button
Dec 13, 2023
Shengguang Wu, Zhenglun Chen, Qi Su

Viaarxiv icon

BA-SAM: Scalable Bias-Mode Attention Mask for Segment Anything Model

Jan 04, 2024
Yiran Song, Qianyu Zhou, Xiangtai Li, Deng-Ping Fan, Xuequan Lu, Lizhuang Ma

Viaarxiv icon