Alert button

"Image": models, code, and papers
Alert button

Instance-aware Exploration-Verification-Exploitation for Instance ImageGoal Navigation

Add code
Bookmark button
Alert button
Feb 25, 2024
Xiaohan Lei, Min Wang, Wengang Zhou, Li Li, Houqiang Li

Viaarxiv icon

Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs

Add code
Bookmark button
Alert button
Feb 06, 2024
Ling Yang, Zhaochen Yu, Chenlin Meng, Minkai Xu, Stefano Ermon, Bin Cui

Viaarxiv icon

Training-Free Consistent Text-to-Image Generation

Add code
Bookmark button
Alert button
Feb 05, 2024
Yoad Tewel, Omri Kaduri, Rinon Gal, Yoni Kasten, Lior Wolf, Gal Chechik, Yuval Atzmon

Viaarxiv icon

Enhancing Embodied Object Detection through Language-Image Pre-training and Implicit Object Memory

Feb 06, 2024
Nicolas Harvey Chapman, Feras Dayoub, Will Browne, Chris Lehnert

Viaarxiv icon

Trustworthy SR: Resolving Ambiguity in Image Super-resolution via Diffusion Models and Human Feedback

Feb 12, 2024
Cansu Korkmaz, Ege Cirakman, A. Murat Tekalp, Zafer Dogan

Viaarxiv icon

A SAM-guided Two-stream Lightweight Model for Anomaly Detection

Feb 29, 2024
Chenghao Li, Lei Qi, Xin Geng

Viaarxiv icon

Graph Convolutional Neural Networks for Automated Echocardiography View Recognition: A Holistic Approach

Feb 29, 2024
Sarina Thomas, Cristiana Tiago, Børge Solli Andreassen, Svein-Arne Aase, Jurica Sprem, Erik Steen, Anne Solberg, Guy Ben-Yosef

Viaarxiv icon

FusionVision: A comprehensive approach of 3D object reconstruction and segmentation from RGB-D cameras using YOLO and fast segment anything

Add code
Bookmark button
Alert button
Feb 29, 2024
Safouane El Ghazouali, Youssef Mhirit, Ali Oukhrid, Umberto Michelucci, Hichem Nouira

Figure 1 for FusionVision: A comprehensive approach of 3D object reconstruction and segmentation from RGB-D cameras using YOLO and fast segment anything
Figure 2 for FusionVision: A comprehensive approach of 3D object reconstruction and segmentation from RGB-D cameras using YOLO and fast segment anything
Figure 3 for FusionVision: A comprehensive approach of 3D object reconstruction and segmentation from RGB-D cameras using YOLO and fast segment anything
Figure 4 for FusionVision: A comprehensive approach of 3D object reconstruction and segmentation from RGB-D cameras using YOLO and fast segment anything
Viaarxiv icon

Anatomically-Controllable Medical Image Generation with Segmentation-Guided Diffusion Models

Add code
Bookmark button
Alert button
Feb 07, 2024
Nicholas Konz, Yuwen Chen, Haoyu Dong, Maciej A. Mazurowski

Viaarxiv icon

Tables as Images? Exploring the Strengths and Limitations of LLMs on Multimodal Representations of Tabular Data

Feb 23, 2024
Naihao Deng, Zhenjie Sun, Ruiqi He, Aman Sikka, Yulong Chen, Lin Ma, Yue Zhang, Rada Mihalcea

Viaarxiv icon