Alert button

"Image": models, code, and papers
Alert button

Trajectory Consistency Distillation

Add code
Bookmark button
Alert button
Feb 29, 2024
Jianbin Zheng, Minghui Hu, Zhongyi Fan, Chaoyue Wang, Changxing Ding, Dacheng Tao, Tat-Jen Cham

Viaarxiv icon

ProtoP-OD: Explainable Object Detection with Prototypical Parts

Add code
Bookmark button
Alert button
Feb 29, 2024
Pavlos Rath-Manakidis, Frederik Strothmann, Tobias Glasmachers, Laurenz Wiskott

Viaarxiv icon

OpenMEDLab: An Open-source Platform for Multi-modality Foundation Models in Medicine

Add code
Bookmark button
Alert button
Mar 04, 2024
Xiaosong Wang, Xiaofan Zhang, Guotai Wang, Junjun He, Zhongyu Li, Wentao Zhu, Yi Guo, Qi Dou, Xiaoxiao Li, Dequan Wang, Liang Hong, Qicheng Lao, Tong Ruan, Yukun Zhou, Yixue Li, Jie Zhao, Kang Li, Xin Sun, Lifeng Zhu, Shaoting Zhang

Viaarxiv icon

Theoretical Insights for Diffusion Guidance: A Case Study for Gaussian Mixture Models

Mar 03, 2024
Yuchen Wu, Minshuo Chen, Zihao Li, Mengdi Wang, Yuting Wei

Figure 1 for Theoretical Insights for Diffusion Guidance: A Case Study for Gaussian Mixture Models
Figure 2 for Theoretical Insights for Diffusion Guidance: A Case Study for Gaussian Mixture Models
Figure 3 for Theoretical Insights for Diffusion Guidance: A Case Study for Gaussian Mixture Models
Figure 4 for Theoretical Insights for Diffusion Guidance: A Case Study for Gaussian Mixture Models
Viaarxiv icon

Depth Estimation Algorithm Based on Transformer-Encoder and Feature Fusion

Mar 03, 2024
Linhan Xia, Junbang Liu, Tong Wu

Figure 1 for Depth Estimation Algorithm Based on Transformer-Encoder and Feature Fusion
Figure 2 for Depth Estimation Algorithm Based on Transformer-Encoder and Feature Fusion
Figure 3 for Depth Estimation Algorithm Based on Transformer-Encoder and Feature Fusion
Figure 4 for Depth Estimation Algorithm Based on Transformer-Encoder and Feature Fusion
Viaarxiv icon

Demonstrating and Reducing Shortcuts in Vision-Language Representation Learning

Add code
Bookmark button
Alert button
Feb 27, 2024
Maurits Bleeker, Mariya Hendriksen, Andrew Yates, Maarten de Rijke

Viaarxiv icon

LLMBind: A Unified Modality-Task Integration Framework

Add code
Bookmark button
Alert button
Feb 26, 2024
Bin Zhu, Peng Jin, Munan Ning, Bin Lin, Jinfa Huang, Qi Song, Junwu Zhang, Zhenyu Tang, Mingjun Pan, Xing Zhou, Li Yuan

Viaarxiv icon

Referee Can Play: An Alternative Approach to Conditional Generation via Model Inversion

Feb 26, 2024
Xuantong Liu, Tianyang Hu, Wenjia Wang, Kenji Kawaguchi, Yuan Yao

Viaarxiv icon

KnowPhish: Large Language Models Meet Multimodal Knowledge Graphs for Enhancing Reference-Based Phishing Detection

Mar 04, 2024
Yuexin Li, Chengyu Huang, Shumin Deng, Mei Lin Lock, Tri Cao, Nay Oo, Bryan Hooi, Hoon Wei Lim

Figure 1 for KnowPhish: Large Language Models Meet Multimodal Knowledge Graphs for Enhancing Reference-Based Phishing Detection
Figure 2 for KnowPhish: Large Language Models Meet Multimodal Knowledge Graphs for Enhancing Reference-Based Phishing Detection
Figure 3 for KnowPhish: Large Language Models Meet Multimodal Knowledge Graphs for Enhancing Reference-Based Phishing Detection
Figure 4 for KnowPhish: Large Language Models Meet Multimodal Knowledge Graphs for Enhancing Reference-Based Phishing Detection
Viaarxiv icon

Map-aided annotation for pole base detection

Mar 04, 2024
Benjamin Missaoui, Maxime Noizet, Philippe Xu

Figure 1 for Map-aided annotation for pole base detection
Figure 2 for Map-aided annotation for pole base detection
Figure 3 for Map-aided annotation for pole base detection
Figure 4 for Map-aided annotation for pole base detection
Viaarxiv icon