Alert button

"Image": models, code, and papers
Alert button

Bilateral Propagation Network for Depth Completion

Add code
Bookmark button
Alert button
Mar 17, 2024
Jie Tang, Fei-Peng Tian, Boshi An, Jian Li, Ping Tan

Figure 1 for Bilateral Propagation Network for Depth Completion
Figure 2 for Bilateral Propagation Network for Depth Completion
Figure 3 for Bilateral Propagation Network for Depth Completion
Figure 4 for Bilateral Propagation Network for Depth Completion
Viaarxiv icon

ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models

Add code
Bookmark button
Alert button
Mar 17, 2024
Siyuan Huang, Iaroslav Ponomarenko, Zhengkai Jiang, Xiaoqi Li, Xiaobin Hu, Peng Gao, Hongsheng Li, Hao Dong

Figure 1 for ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
Figure 2 for ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
Figure 3 for ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
Figure 4 for ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
Viaarxiv icon

Self-Supervised Video Desmoking for Laparoscopic Surgery

Add code
Bookmark button
Alert button
Mar 17, 2024
Renlong Wu, Zhilu Zhang, Shuohao Zhang, Longfei Gou, Haobin Chen, Lei Zhang, Hao Chen, Wangmeng Zuo

Figure 1 for Self-Supervised Video Desmoking for Laparoscopic Surgery
Figure 2 for Self-Supervised Video Desmoking for Laparoscopic Surgery
Figure 3 for Self-Supervised Video Desmoking for Laparoscopic Surgery
Figure 4 for Self-Supervised Video Desmoking for Laparoscopic Surgery
Viaarxiv icon

Mixture-of-Prompt-Experts for Multi-modal Semantic Understanding

Mar 17, 2024
Zichen Wu, HsiuYuan Huang, Fanyi Qu, Yunfang Wu

Figure 1 for Mixture-of-Prompt-Experts for Multi-modal Semantic Understanding
Figure 2 for Mixture-of-Prompt-Experts for Multi-modal Semantic Understanding
Figure 3 for Mixture-of-Prompt-Experts for Multi-modal Semantic Understanding
Figure 4 for Mixture-of-Prompt-Experts for Multi-modal Semantic Understanding
Viaarxiv icon

RSAM-Seg: A SAM-based Approach with Prior Knowledge Integration for Remote Sensing Image Semantic Segmentation

Add code
Bookmark button
Alert button
Feb 29, 2024
Jie Zhang, Xubing Yang, Rui Jiang, Wei Shao, Li Zhang

Viaarxiv icon

Block and Detail: Scaffolding Sketch-to-Image Generation

Feb 28, 2024
Vishnu Sarukkai, Lu Yuan, Mia Tang, Maneesh Agrawala, Kayvon Fatahalian

Viaarxiv icon

Local positional graphs and attentive local features for a data and runtime-efficient hierarchical place recognition pipeline

Mar 15, 2024
Fangming Yuan, Stefan Schubert, Peter Protzel, Peer Neubert

Figure 1 for Local positional graphs and attentive local features for a data and runtime-efficient hierarchical place recognition pipeline
Figure 2 for Local positional graphs and attentive local features for a data and runtime-efficient hierarchical place recognition pipeline
Figure 3 for Local positional graphs and attentive local features for a data and runtime-efficient hierarchical place recognition pipeline
Figure 4 for Local positional graphs and attentive local features for a data and runtime-efficient hierarchical place recognition pipeline
Viaarxiv icon

Deep Learning for Multi-Level Detection and Localization of Myocardial Scars Based on Regional Strain Validated on Virtual Patients

Mar 15, 2024
Müjde Akdeniz, Claudia Alessandra Manetti, Tijmen Koopsen, Hani Nozari Mirar, Sten Roar Snare, Svein Arne Aase, Joost Lumens, Jurica Šprem, Kristin Sarah McLeod

Figure 1 for Deep Learning for Multi-Level Detection and Localization of Myocardial Scars Based on Regional Strain Validated on Virtual Patients
Figure 2 for Deep Learning for Multi-Level Detection and Localization of Myocardial Scars Based on Regional Strain Validated on Virtual Patients
Figure 3 for Deep Learning for Multi-Level Detection and Localization of Myocardial Scars Based on Regional Strain Validated on Virtual Patients
Figure 4 for Deep Learning for Multi-Level Detection and Localization of Myocardial Scars Based on Regional Strain Validated on Virtual Patients
Viaarxiv icon

Predicting Generalization of AI Colonoscopy Models to Unseen Data

Mar 18, 2024
Joel Shor, Carson McNeil, Yotam Intrator, Joseph R Ledsam, Hiro-o Yamano, Daisuke Tsurumaru, Hiroki Kayama, Atsushi Hamabe, Koji Ando, Mitsuhiko Ota, Haruei Ogino, Hiroshi Nakase, Kaho Kobayashi, Masaaki Miyo, Eiji Oki, Ichiro Takemasa, Ehud Rivlin, Roman Goldenberg

Figure 1 for Predicting Generalization of AI Colonoscopy Models to Unseen Data
Figure 2 for Predicting Generalization of AI Colonoscopy Models to Unseen Data
Figure 3 for Predicting Generalization of AI Colonoscopy Models to Unseen Data
Figure 4 for Predicting Generalization of AI Colonoscopy Models to Unseen Data
Viaarxiv icon

CCC++: Optimized Color Classified Colorization with Segment Anything Model (SAM) Empowered Object Selective Color Harmonization

Mar 18, 2024
Mrityunjoy Gain, Avi Deb Raha, Rameswar Debnath

Figure 1 for CCC++: Optimized Color Classified Colorization with Segment Anything Model (SAM) Empowered Object Selective Color Harmonization
Figure 2 for CCC++: Optimized Color Classified Colorization with Segment Anything Model (SAM) Empowered Object Selective Color Harmonization
Figure 3 for CCC++: Optimized Color Classified Colorization with Segment Anything Model (SAM) Empowered Object Selective Color Harmonization
Figure 4 for CCC++: Optimized Color Classified Colorization with Segment Anything Model (SAM) Empowered Object Selective Color Harmonization
Viaarxiv icon