Alert button
Picture for Kai Zhang

Kai Zhang

Alert button

PathMMU: A Massive Multimodal Expert-Level Benchmark for Understanding and Reasoning in Pathology

Add code
Bookmark button
Alert button
Jan 29, 2024
Yuxuan Sun, Hao Wu, Chenglu Zhu, Sunyi Zheng, Qizi Chen, Kai Zhang, Yunlong Zhang, Xiaoxiao Lan, Mengyue Zheng, Jingxiong Li, Xinheng Lyu, Tao Lin, Lin Yang

Viaarxiv icon

LKFormer: Large Kernel Transformer for Infrared Image Super-Resolution

Add code
Bookmark button
Alert button
Jan 24, 2024
Feiwei Qin, Kang Yan, Changmiao Wang, Ruiquan Ge, Yong Peng, Kai Zhang

Viaarxiv icon

Objects With Lighting: A Real-World Dataset for Evaluating Reconstruction and Rendering for Object Relighting

Add code
Bookmark button
Alert button
Jan 17, 2024
Benjamin Ummenhofer, Sanskar Agrawal, Rene Sepulveda, Yixing Lao, Kai Zhang, Tianhang Cheng, Stephan Richter, Shenlong Wang, German Ros

Viaarxiv icon

CrossDiff: Exploring Self-Supervised Representation of Pansharpening via Cross-Predictive Diffusion Model

Add code
Bookmark button
Alert button
Jan 13, 2024
Yinghui Xing, Litao Qu, Shizhou Zhang, Kai Zhang, Yanning Zhang

Viaarxiv icon

GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation

Add code
Bookmark button
Alert button
Jan 09, 2024
Tong Wu, Guandao Yang, Zhibing Li, Kai Zhang, Ziwei Liu, Leonidas Guibas, Dahua Lin, Gordon Wetzstein

Viaarxiv icon

UMIE: Unified Multimodal Information Extraction with Instruction Tuning

Add code
Bookmark button
Alert button
Jan 05, 2024
Lin Sun, Kai Zhang, Qingyuan Li, Renze Lou

Viaarxiv icon

MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following

Add code
Bookmark button
Alert button
Dec 05, 2023
Renze Lou, Kai Zhang, Jian Xie, Yuxuan Sun, Janice Ahn, Hanzi Xu, Yu Su, Wenpeng Yin

Figure 1 for MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following
Figure 2 for MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following
Figure 3 for MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following
Figure 4 for MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following
Viaarxiv icon

Robust Computer Vision in an Ever-Changing World: A Survey of Techniques for Tackling Distribution Shifts

Add code
Bookmark button
Alert button
Dec 03, 2023
Eashan Adhikarla, Kai Zhang, Jun Yu, Lichao Sun, John Nicholson, Brian D. Davison

Figure 1 for Robust Computer Vision in an Ever-Changing World: A Survey of Techniques for Tackling Distribution Shifts
Figure 2 for Robust Computer Vision in an Ever-Changing World: A Survey of Techniques for Tackling Distribution Shifts
Figure 3 for Robust Computer Vision in an Ever-Changing World: A Survey of Techniques for Tackling Distribution Shifts
Figure 4 for Robust Computer Vision in an Ever-Changing World: A Survey of Techniques for Tackling Distribution Shifts
Viaarxiv icon

Multi-scale Iterative Refinement towards Robust and Versatile Molecular Docking

Add code
Bookmark button
Alert button
Nov 30, 2023
Jiaxian Yan, Zaixi Zhang, Kai Zhang, Qi Liu

Figure 1 for Multi-scale Iterative Refinement towards Robust and Versatile Molecular Docking
Figure 2 for Multi-scale Iterative Refinement towards Robust and Versatile Molecular Docking
Figure 3 for Multi-scale Iterative Refinement towards Robust and Versatile Molecular Docking
Figure 4 for Multi-scale Iterative Refinement towards Robust and Versatile Molecular Docking
Viaarxiv icon

MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI

Add code
Bookmark button
Alert button
Nov 27, 2023
Xiang Yue, Yuansheng Ni, Kai Zhang, Tianyu Zheng, Ruoqi Liu, Ge Zhang, Samuel Stevens, Dongfu Jiang, Weiming Ren, Yuxuan Sun, Cong Wei, Botao Yu, Ruibin Yuan, Renliang Sun, Ming Yin, Boyuan Zheng, Zhenzhu Yang, Yibo Liu, Wenhao Huang, Huan Sun, Yu Su, Wenhu Chen

Viaarxiv icon