Alert button
Picture for Zihao Li

Zihao Li

Alert button

Quantifying Multilingual Performance of Large Language Models Across Languages

Add code
Bookmark button
Alert button
Apr 17, 2024
Zihao Li, Yucheng Shi, Zirui Liu, Fan Yang, Ninghao Liu, Mengnan Du

Viaarxiv icon

Multi-level Graph Subspace Contrastive Learning for Hyperspectral Image Clustering

Add code
Bookmark button
Alert button
Apr 08, 2024
Jingxin Wang, Renxiang Guan, Kainan Gao, Zihao Li, Hao Li, Xianju Li, Chang Tang

Viaarxiv icon

S2RC-GCN: A Spatial-Spectral Reliable Contrastive Graph Convolutional Network for Complex Land Cover Classification Using Hyperspectral Images

Add code
Bookmark button
Alert button
Apr 01, 2024
Renxiang Guan, Zihao Li, Chujia Song, Guo Yu, Xianju Li, Ruyi Feng

Viaarxiv icon

Heterogeneous Contrastive Learning for Foundation Models and Beyond

Add code
Bookmark button
Alert button
Mar 30, 2024
Lecheng Zheng, Baoyu Jing, Zihao Li, Hanghang Tong, Jingrui He

Viaarxiv icon

Interpretable Machine Learning for Weather and Climate Prediction: A Survey

Add code
Bookmark button
Alert button
Mar 24, 2024
Ruyi Yang, Jingyu Hu, Zihao Li, Jianli Mu, Tingzhao Yu, Jiangjiang Xia, Xuhong Li, Aritra Dasgupta, Haoyi Xiong

Viaarxiv icon

VisionGPT-3D: A Generalized Multimodal Agent for Enhanced 3D Vision Understanding

Add code
Bookmark button
Alert button
Mar 22, 2024
Chris Kelly, Luhui Hu, Jiayin Hu, Yu Tian, Deshun Yang, Bang Yang, Cindy Yang, Zihao Li, Zaoshan Huang, Yuexian Zou

Figure 1 for VisionGPT-3D: A Generalized Multimodal Agent for Enhanced 3D Vision Understanding
Figure 2 for VisionGPT-3D: A Generalized Multimodal Agent for Enhanced 3D Vision Understanding
Figure 3 for VisionGPT-3D: A Generalized Multimodal Agent for Enhanced 3D Vision Understanding
Figure 4 for VisionGPT-3D: A Generalized Multimodal Agent for Enhanced 3D Vision Understanding
Viaarxiv icon

Diffusion Model for Data-Driven Black-Box Optimization

Add code
Bookmark button
Alert button
Mar 20, 2024
Zihao Li, Hui Yuan, Kaixuan Huang, Chengzhuo Ni, Yinyu Ye, Minshuo Chen, Mengdi Wang

Figure 1 for Diffusion Model for Data-Driven Black-Box Optimization
Figure 2 for Diffusion Model for Data-Driven Black-Box Optimization
Figure 3 for Diffusion Model for Data-Driven Black-Box Optimization
Figure 4 for Diffusion Model for Data-Driven Black-Box Optimization
Viaarxiv icon

VisionGPT: Vision-Language Understanding Agent Using Generalized Multimodal Framework

Add code
Bookmark button
Alert button
Mar 14, 2024
Chris Kelly, Luhui Hu, Bang Yang, Yu Tian, Deshun Yang, Cindy Yang, Zaoshan Huang, Zihao Li, Jiayin Hu, Yuexian Zou

Figure 1 for VisionGPT: Vision-Language Understanding Agent Using Generalized Multimodal Framework
Figure 2 for VisionGPT: Vision-Language Understanding Agent Using Generalized Multimodal Framework
Figure 3 for VisionGPT: Vision-Language Understanding Agent Using Generalized Multimodal Framework
Figure 4 for VisionGPT: Vision-Language Understanding Agent Using Generalized Multimodal Framework
Viaarxiv icon

WorldGPT: A Sora-Inspired Video AI Agent as Rich World Models from Text and Image Inputs

Add code
Bookmark button
Alert button
Mar 10, 2024
Deshun Yang, Luhui Hu, Yu Tian, Zihao Li, Chris Kelly, Bang Yang, Cindy Yang, Yuexian Zou

Figure 1 for WorldGPT: A Sora-Inspired Video AI Agent as Rich World Models from Text and Image Inputs
Figure 2 for WorldGPT: A Sora-Inspired Video AI Agent as Rich World Models from Text and Image Inputs
Figure 3 for WorldGPT: A Sora-Inspired Video AI Agent as Rich World Models from Text and Image Inputs
Figure 4 for WorldGPT: A Sora-Inspired Video AI Agent as Rich World Models from Text and Image Inputs
Viaarxiv icon