Alert button

"Image": models, code, and papers
Alert button

Task-Aware Low-Rank Adaptation of Segment Anything Model

Mar 16, 2024
Xuehao Wang, Feiyang Ye, Yu Zhang

Figure 1 for Task-Aware Low-Rank Adaptation of Segment Anything Model
Figure 2 for Task-Aware Low-Rank Adaptation of Segment Anything Model
Figure 3 for Task-Aware Low-Rank Adaptation of Segment Anything Model
Figure 4 for Task-Aware Low-Rank Adaptation of Segment Anything Model
Viaarxiv icon

Graph Regularized NMF with L20-norm for Unsupervised Feature Learning

Mar 16, 2024
Zhen Wang, Wenwen Min

Figure 1 for Graph Regularized NMF with L20-norm for Unsupervised Feature Learning
Figure 2 for Graph Regularized NMF with L20-norm for Unsupervised Feature Learning
Figure 3 for Graph Regularized NMF with L20-norm for Unsupervised Feature Learning
Figure 4 for Graph Regularized NMF with L20-norm for Unsupervised Feature Learning
Viaarxiv icon

Generalized Relevance Learning Grassmann Quantization

Mar 14, 2024
M. Mohammadi, M. Babai, M. H. F. Wilkinson

Figure 1 for Generalized Relevance Learning Grassmann Quantization
Figure 2 for Generalized Relevance Learning Grassmann Quantization
Figure 3 for Generalized Relevance Learning Grassmann Quantization
Figure 4 for Generalized Relevance Learning Grassmann Quantization
Viaarxiv icon

VisionGPT-3D: A Generalized Multimodal Agent for Enhanced 3D Vision Understanding

Mar 14, 2024
Chris Kelly, Luhui Hu, Jiayin Hu, Yu Tian, Deshun Yang, Bang Yang, Cindy Yang, Zihao Li, Zaoshan Huang, Yuexian Zou

Figure 1 for VisionGPT-3D: A Generalized Multimodal Agent for Enhanced 3D Vision Understanding
Figure 2 for VisionGPT-3D: A Generalized Multimodal Agent for Enhanced 3D Vision Understanding
Figure 3 for VisionGPT-3D: A Generalized Multimodal Agent for Enhanced 3D Vision Understanding
Figure 4 for VisionGPT-3D: A Generalized Multimodal Agent for Enhanced 3D Vision Understanding
Viaarxiv icon

Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring

Add code
Bookmark button
Alert button
Mar 14, 2024
Yufei Zhan, Yousong Zhu, Hongyin Zhao, Fan Yang, Ming Tang, Jinqiao Wang

Figure 1 for Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring
Figure 2 for Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring
Figure 3 for Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring
Figure 4 for Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring
Viaarxiv icon

VDNA-PR: Using General Dataset Representations for Robust Sequential Visual Place Recognition

Mar 14, 2024
Benjamin Ramtoula, Daniele De Martini, Matthew Gadd, Paul Newman

Figure 1 for VDNA-PR: Using General Dataset Representations for Robust Sequential Visual Place Recognition
Figure 2 for VDNA-PR: Using General Dataset Representations for Robust Sequential Visual Place Recognition
Figure 3 for VDNA-PR: Using General Dataset Representations for Robust Sequential Visual Place Recognition
Figure 4 for VDNA-PR: Using General Dataset Representations for Robust Sequential Visual Place Recognition
Viaarxiv icon

RSAM-Seg: A SAM-based Approach with Prior Knowledge Integration for Remote Sensing Image Semantic Segmentation

Add code
Bookmark button
Alert button
Feb 29, 2024
Jie Zhang, Xubing Yang, Rui Jiang, Wei Shao, Li Zhang

Viaarxiv icon

Exploiting Structural Consistency of Chest Anatomy for Unsupervised Anomaly Detection in Radiography Images

Add code
Bookmark button
Alert button
Mar 13, 2024
Tiange Xiang, Yixiao Zhang, Yongyi Lu, Alan Yuille, Chaoyi Zhang, Weidong Cai, Zongwei Zhou

Figure 1 for Exploiting Structural Consistency of Chest Anatomy for Unsupervised Anomaly Detection in Radiography Images
Figure 2 for Exploiting Structural Consistency of Chest Anatomy for Unsupervised Anomaly Detection in Radiography Images
Figure 3 for Exploiting Structural Consistency of Chest Anatomy for Unsupervised Anomaly Detection in Radiography Images
Figure 4 for Exploiting Structural Consistency of Chest Anatomy for Unsupervised Anomaly Detection in Radiography Images
Viaarxiv icon

DuPL: Dual Student with Trustworthy Progressive Learning for Robust Weakly Supervised Semantic Segmentation

Add code
Bookmark button
Alert button
Mar 17, 2024
Yuanchen Wu, Xichen Ye, Kequan Yang, Jide Li, Xiaoqiang Li

Figure 1 for DuPL: Dual Student with Trustworthy Progressive Learning for Robust Weakly Supervised Semantic Segmentation
Figure 2 for DuPL: Dual Student with Trustworthy Progressive Learning for Robust Weakly Supervised Semantic Segmentation
Figure 3 for DuPL: Dual Student with Trustworthy Progressive Learning for Robust Weakly Supervised Semantic Segmentation
Figure 4 for DuPL: Dual Student with Trustworthy Progressive Learning for Robust Weakly Supervised Semantic Segmentation
Viaarxiv icon

Training A Small Emotional Vision Language Model for Visual Art Comprehension

Add code
Bookmark button
Alert button
Mar 17, 2024
Jing Zhang, Liang Zheng, Dan Guo, Meng Wang

Figure 1 for Training A Small Emotional Vision Language Model for Visual Art Comprehension
Figure 2 for Training A Small Emotional Vision Language Model for Visual Art Comprehension
Figure 3 for Training A Small Emotional Vision Language Model for Visual Art Comprehension
Figure 4 for Training A Small Emotional Vision Language Model for Visual Art Comprehension
Viaarxiv icon