Alert button

"Image": models, code, and papers
Alert button

DiffColor: Toward High Fidelity Text-Guided Image Colorization with Diffusion Models

Aug 03, 2023
Jianxin Lin, Peng Xiao, Yijun Wang, Rongju Zhang, Xiangxiang Zeng

Figure 1 for DiffColor: Toward High Fidelity Text-Guided Image Colorization with Diffusion Models
Figure 2 for DiffColor: Toward High Fidelity Text-Guided Image Colorization with Diffusion Models
Figure 3 for DiffColor: Toward High Fidelity Text-Guided Image Colorization with Diffusion Models
Figure 4 for DiffColor: Toward High Fidelity Text-Guided Image Colorization with Diffusion Models
Viaarxiv icon

Grounded Image Text Matching with Mismatched Relation Reasoning

Aug 02, 2023
Yu Wu, Yana Wei, Haozhe Wang, Yongfei Liu, Sibei Yang, Xuming He

Figure 1 for Grounded Image Text Matching with Mismatched Relation Reasoning
Figure 2 for Grounded Image Text Matching with Mismatched Relation Reasoning
Figure 3 for Grounded Image Text Matching with Mismatched Relation Reasoning
Figure 4 for Grounded Image Text Matching with Mismatched Relation Reasoning
Viaarxiv icon

Event-Driven Imaging in Turbid Media: A Confluence of Optoelectronics and Neuromorphic Computation

Sep 13, 2023
Ning Zhang, Timothy Shea, Arto Nurmikko

Figure 1 for Event-Driven Imaging in Turbid Media: A Confluence of Optoelectronics and Neuromorphic Computation
Figure 2 for Event-Driven Imaging in Turbid Media: A Confluence of Optoelectronics and Neuromorphic Computation
Figure 3 for Event-Driven Imaging in Turbid Media: A Confluence of Optoelectronics and Neuromorphic Computation
Figure 4 for Event-Driven Imaging in Turbid Media: A Confluence of Optoelectronics and Neuromorphic Computation
Viaarxiv icon

Generalized Schrödinger Bridge Matching

Add code
Bookmark button
Alert button
Oct 03, 2023
Guan-Horng Liu, Yaron Lipman, Maximilian Nickel, Brian Karrer, Evangelos A. Theodorou, Ricky T. Q. Chen

Viaarxiv icon

ScaleNet: An Unsupervised Representation Learning Method for Limited Information

Oct 03, 2023
Huili Huang, M. Mahdi Roozbahani

Viaarxiv icon

HallE-Switch: Rethinking and Controlling Object Existence Hallucinations in Large Vision Language Models for Detailed Caption

Oct 03, 2023
Bohan Zhai, Shijia Yang, Xiangchen Zhao, Chenfeng Xu, Sheng Shen, Dongdi Zhao, Kurt Keutzer, Manling Li, Tan Yan, Xiangjun Fan

Figure 1 for HallE-Switch: Rethinking and Controlling Object Existence Hallucinations in Large Vision Language Models for Detailed Caption
Figure 2 for HallE-Switch: Rethinking and Controlling Object Existence Hallucinations in Large Vision Language Models for Detailed Caption
Figure 3 for HallE-Switch: Rethinking and Controlling Object Existence Hallucinations in Large Vision Language Models for Detailed Caption
Figure 4 for HallE-Switch: Rethinking and Controlling Object Existence Hallucinations in Large Vision Language Models for Detailed Caption
Viaarxiv icon

An End-to-end Food Portion Estimation Framework Based on Shape Reconstruction from Monocular Image

Aug 03, 2023
Zeman Shao, Gautham Vinod, Jiangpeng He, Fengqing Zhu

Figure 1 for An End-to-end Food Portion Estimation Framework Based on Shape Reconstruction from Monocular Image
Figure 2 for An End-to-end Food Portion Estimation Framework Based on Shape Reconstruction from Monocular Image
Figure 3 for An End-to-end Food Portion Estimation Framework Based on Shape Reconstruction from Monocular Image
Figure 4 for An End-to-end Food Portion Estimation Framework Based on Shape Reconstruction from Monocular Image
Viaarxiv icon

Weakly Supervised Semantic Segmentation by Knowledge Graph Inference

Add code
Bookmark button
Alert button
Sep 25, 2023
Jia Zhang, Bo Peng, Xi Wu

Figure 1 for Weakly Supervised Semantic Segmentation by Knowledge Graph Inference
Figure 2 for Weakly Supervised Semantic Segmentation by Knowledge Graph Inference
Figure 3 for Weakly Supervised Semantic Segmentation by Knowledge Graph Inference
Figure 4 for Weakly Supervised Semantic Segmentation by Knowledge Graph Inference
Viaarxiv icon

IBCL: Zero-shot Model Generation for Task Trade-offs in Continual Learning

Oct 05, 2023
Pengyuan Lu, Michele Caprio, Eric Eaton, Insup Lee

Viaarxiv icon

Ctrl-Room: Controllable Text-to-3D Room Meshes Generation with Layout Constraints

Oct 05, 2023
Chuan Fang, Xiaotao Hu, Kunming Luo, Ping Tan

Viaarxiv icon