Alert button

"Image": models, code, and papers
Alert button

A Feature Matching Method Based on Multi-Level Refinement Strategy

Feb 25, 2024
Shaojie Zhang, Yinghui Wang, Jiaxing Ma, Wei Li, Jinlong Yang, Tao Yan, Yukai Wang, Liangyi Huang, Mingfeng Wang, Ibragim R. Atadjanov

Viaarxiv icon

BIKED++: A Multimodal Dataset of 1.4 Million Bicycle Image and Parametric CAD Designs

Add code
Bookmark button
Alert button
Feb 09, 2024
Lyle Regenwetter, Yazan Abu Obaideh, Amin Heyrani Nobari, Faez Ahmed

Viaarxiv icon

An Interpretable Evaluation of Entropy-based Novelty of Generative Models

Feb 27, 2024
Jingwei Zhang, Cheuk Ting Li, Farzan Farnia

Viaarxiv icon

Scaling Supervised Local Learning with Augmented Auxiliary Networks

Add code
Bookmark button
Alert button
Feb 27, 2024
Chenxiang Ma, Jibin Wu, Chenyang Si, Kay Chen Tan

Viaarxiv icon

MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation

Add code
Bookmark button
Alert button
Feb 27, 2024
Hanan Gani, Muzammal Naseer, Fahad Khan, Salman Khan

Viaarxiv icon

ConVQG: Contrastive Visual Question Generation with Multimodal Guidance

Add code
Bookmark button
Alert button
Feb 20, 2024
Li Mi, Syrielle Montariol, Javiera Castillo-Navarro, Xianjie Dai, Antoine Bosselut, Devis Tuia

Viaarxiv icon

A Comprehensive Survey of Convolutions in Deep Learning: Applications, Challenges, and Future Trends

Feb 23, 2024
Abolfazl Younesi, Mohsen Ansari, MohammadAmin Fazli, Alireza Ejlali, Muhammad Shafique, Jörg Henkel

Viaarxiv icon

Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey

Add code
Bookmark button
Alert button
Feb 26, 2024
Zhuo Chen, Yichi Zhang, Yin Fang, Yuxia Geng, Lingbing Guo, Xiang Chen, Qian Li, Wen Zhang, Jiaoyan Chen, Yushan Zhu, Jiaqi Li, Xiaoze Liu, Jeff Z. Pan, Ningyu Zhang, Huajun Chen

Viaarxiv icon

Mysterious Projections: Multimodal LLMs Gain Domain-Specific Visual Capabilities Without Richer Cross-Modal Projections

Add code
Bookmark button
Alert button
Feb 26, 2024
Gaurav Verma, Minje Choi, Kartik Sharma, Jamelle Watson-Daniels, Sejoon Oh, Srijan Kumar

Viaarxiv icon

Cross-Modal Contextualized Diffusion Models for Text-Guided Visual Generation and Editing

Add code
Bookmark button
Alert button
Feb 26, 2024
Ling Yang, Zhilong Zhang, Zhaochen Yu, Jingwei Liu, Minkai Xu, Stefano Ermon, Bin Cui

Viaarxiv icon