Alert button

"Information": models, code, and papers
Alert button

GeoDTR+: Toward generic cross-view geolocalization via geometric disentanglement

Aug 18, 2023
Xiaohan Zhang, Xingyu Li, Waqas Sultani, Chen Chen, Safwan Wshah

Figure 1 for GeoDTR+: Toward generic cross-view geolocalization via geometric disentanglement
Figure 2 for GeoDTR+: Toward generic cross-view geolocalization via geometric disentanglement
Figure 3 for GeoDTR+: Toward generic cross-view geolocalization via geometric disentanglement
Figure 4 for GeoDTR+: Toward generic cross-view geolocalization via geometric disentanglement
Viaarxiv icon

ALIP: Adaptive Language-Image Pre-training with Synthetic Caption

Add code
Bookmark button
Alert button
Aug 18, 2023
Kaicheng Yang, Jiankang Deng, Xiang An, Jiawei Li, Ziyong Feng, Jia Guo, Jing Yang, Tongliang Liu

Figure 1 for ALIP: Adaptive Language-Image Pre-training with Synthetic Caption
Figure 2 for ALIP: Adaptive Language-Image Pre-training with Synthetic Caption
Figure 3 for ALIP: Adaptive Language-Image Pre-training with Synthetic Caption
Figure 4 for ALIP: Adaptive Language-Image Pre-training with Synthetic Caption
Viaarxiv icon

Multi-scale Target-Aware Framework for Constrained Image Splicing Detection and Localization

Aug 18, 2023
Yuxuan Tan, Yuanman Li, Limin Zeng, Jiaxiong Ye, Wei wang, Xia Li

Figure 1 for Multi-scale Target-Aware Framework for Constrained Image Splicing Detection and Localization
Figure 2 for Multi-scale Target-Aware Framework for Constrained Image Splicing Detection and Localization
Figure 3 for Multi-scale Target-Aware Framework for Constrained Image Splicing Detection and Localization
Figure 4 for Multi-scale Target-Aware Framework for Constrained Image Splicing Detection and Localization
Viaarxiv icon

PUMGPT: A Large Vision-Language Model for Product Understanding

Aug 18, 2023
Shuhui Wu, Zengming Tang, Zongyi Guo, Weiwei Zhang, Baoliang Cui, Haihong Tang, Weiming Lu

Figure 1 for PUMGPT: A Large Vision-Language Model for Product Understanding
Figure 2 for PUMGPT: A Large Vision-Language Model for Product Understanding
Figure 3 for PUMGPT: A Large Vision-Language Model for Product Understanding
Figure 4 for PUMGPT: A Large Vision-Language Model for Product Understanding
Viaarxiv icon

Bridged-GNN: Knowledge Bridge Learning for Effective Knowledge Transfer

Add code
Bookmark button
Alert button
Aug 18, 2023
Wendong Bi, Xueqi Cheng, Bingbing Xu, Xiaoqian Sun, Li Xu, Huawei Shen

Viaarxiv icon

Physics-Informed Boundary Integral Networks (PIBI-Nets): A Data-Driven Approach for Solving Partial Differential Equations

Aug 18, 2023
Monika Nagy-Huber, Volker Roth

Figure 1 for Physics-Informed Boundary Integral Networks (PIBI-Nets): A Data-Driven Approach for Solving Partial Differential Equations
Figure 2 for Physics-Informed Boundary Integral Networks (PIBI-Nets): A Data-Driven Approach for Solving Partial Differential Equations
Figure 3 for Physics-Informed Boundary Integral Networks (PIBI-Nets): A Data-Driven Approach for Solving Partial Differential Equations
Figure 4 for Physics-Informed Boundary Integral Networks (PIBI-Nets): A Data-Driven Approach for Solving Partial Differential Equations
Viaarxiv icon

T-UNet: Triplet UNet for Change Detection in High-Resolution Remote Sensing Images

Add code
Bookmark button
Alert button
Aug 04, 2023
Huan Zhong, Chen Wu

Figure 1 for T-UNet: Triplet UNet for Change Detection in High-Resolution Remote Sensing Images
Figure 2 for T-UNet: Triplet UNet for Change Detection in High-Resolution Remote Sensing Images
Figure 3 for T-UNet: Triplet UNet for Change Detection in High-Resolution Remote Sensing Images
Figure 4 for T-UNet: Triplet UNet for Change Detection in High-Resolution Remote Sensing Images
Viaarxiv icon

Beyond First Impressions: Integrating Joint Multi-modal Cues for Comprehensive 3D Representation

Add code
Bookmark button
Alert button
Aug 06, 2023
Haowei Wang, Jiji Tang, Jiayi Ji, Xiaoshuai Sun, Rongsheng Zhang, Yiwei Ma, Minda Zhao, Lincheng Li, zeng zhao, Tangjie Lv, Rongrong Ji

Figure 1 for Beyond First Impressions: Integrating Joint Multi-modal Cues for Comprehensive 3D Representation
Figure 2 for Beyond First Impressions: Integrating Joint Multi-modal Cues for Comprehensive 3D Representation
Figure 3 for Beyond First Impressions: Integrating Joint Multi-modal Cues for Comprehensive 3D Representation
Figure 4 for Beyond First Impressions: Integrating Joint Multi-modal Cues for Comprehensive 3D Representation
Viaarxiv icon

Continual Vision-Language Representation Learning with Off-Diagonal Information

Add code
Bookmark button
Alert button
May 17, 2023
Zixuan Ni, Longhui Wei, Siliang Tang, Yueting Zhuang, Qi Tian

Figure 1 for Continual Vision-Language Representation Learning with Off-Diagonal Information
Figure 2 for Continual Vision-Language Representation Learning with Off-Diagonal Information
Figure 3 for Continual Vision-Language Representation Learning with Off-Diagonal Information
Figure 4 for Continual Vision-Language Representation Learning with Off-Diagonal Information
Viaarxiv icon

AST-MHSA : Code Summarization using Multi-Head Self-Attention

Aug 10, 2023
Yeshwanth Nagaraj, Ujjwal Gupta

Figure 1 for AST-MHSA : Code Summarization using Multi-Head Self-Attention
Viaarxiv icon