Alert button

"Image": models, code, and papers
Alert button

CMMMU: A Chinese Massive Multi-discipline Multimodal Understanding Benchmark

Jan 22, 2024
Ge Zhang, Xinrun Du, Bei Chen, Yiming Liang, Tongxu Luo, Tianyu Zheng, Kang Zhu, Yuyang Cheng, Chunpu Xu, Shuyue Guo, Haoran Zhang, Xingwei Qu, Junjie Wang, Ruibin Yuan, Yizhi Li, Zekun Wang, Yudong Liu, Yu-Hsuan Tsai, Fengji Zhang, Chenghua Lin, Wenhao Huang, Wenhu Chen, Jie Fu

Viaarxiv icon

Haptic-Assisted Collaborative Robot Framework for Improved Situational Awareness in Skull Base Surgery

Jan 22, 2024
Hisashi Ishida, Manish Sahu, Adnan Munawar, Nimesh Nagururu, Deepa Galaiya, Peter Kazanzides, Francis X. Creighton, Russell H. Taylor

Viaarxiv icon

Distribution-aware Interactive Attention Network and Large-scale Cloud Recognition Benchmark on FY-4A Satellite Image

Add code
Bookmark button
Alert button
Jan 06, 2024
Jiaqing Zhang, Jie Lei, Weiying Xie, Kai Jiang, Mingxiang Cao, Yunsong Li

Viaarxiv icon

ContextMix: A context-aware data augmentation method for industrial visual inspection systems

Jan 18, 2024
Hyungmin Kim, Donghun Kim, Pyunghwan Ahn, Sungho Suh, Hansang Cho, Junmo Kim

Viaarxiv icon

CPCL: Cross-Modal Prototypical Contrastive Learning for Weakly Supervised Text-based Person Re-Identification

Add code
Bookmark button
Alert button
Jan 18, 2024
Yanwei Zheng, Xinpeng Zhao, Chuanlin Lan, Xiaowei Zhang, Bowen Huang, Jibin Yang, Dongxiao Yu

Viaarxiv icon

BreastRegNet: A Deep Learning Framework for Registration of Breast Faxitron and Histopathology Images

Jan 18, 2024
Negar Golestani, Aihui Wang, Gregory R Bean, Mirabela Rusu

Viaarxiv icon

CIS-UNet: Multi-Class Segmentation of the Aorta in Computed Tomography Angiography via Context-Aware Shifted Window Self-Attention

Jan 23, 2024
Muhammad Imran, Jonathan R Krebs, Veera Rajasekhar Reddy Gopu, Brian Fazzone, Vishal Balaji Sivaraman, Amarjeet Kumar, Chelsea Viscardi, Robert Evans Heithaus, Benjamin Shickel, Yuyin Zhou, Michol A Cooper, Wei Shao

Viaarxiv icon

Deep Learning-based Intraoperative MRI Reconstruction

Jan 23, 2024
Jon André Ottesen, Tryggve Storas, Svein Are Sirirud Vatnehol, Grethe Løvland, Einar O. Vik-Mo, Till Schellhorn, Karoline Skogen, Christopher Larsson, Atle Bjørnerud, Inge Rasmus Groote-Eindbaas, Matthan W. A. Caan

Viaarxiv icon

Interpreting Equivariant Representations

Jan 23, 2024
Andreas Abildtrup Hansen, Anna Calissano, Aasa Feragen

Viaarxiv icon

The Neglected Tails of Vision-Language Models

Add code
Bookmark button
Alert button
Jan 23, 2024
Shubham Parashar, Zhiqiu Lin, Tian Liu, Xiangjue Dong, Yanan Li, Deva Ramanan, James Caverlee, Shu Kong

Viaarxiv icon