Picture for Yichi Zhang

Yichi Zhang

AI Lab, Netease

MR-MLLM: Mutual Reinforcement of Multimodal Comprehension and Vision Perception

Add code
Jun 22, 2024
Figure 1 for MR-MLLM: Mutual Reinforcement of Multimodal Comprehension and Vision Perception
Figure 2 for MR-MLLM: Mutual Reinforcement of Multimodal Comprehension and Vision Perception
Figure 3 for MR-MLLM: Mutual Reinforcement of Multimodal Comprehension and Vision Perception
Figure 4 for MR-MLLM: Mutual Reinforcement of Multimodal Comprehension and Vision Perception
Viaarxiv icon

On Efficient Neural Network Architectures for Image Compression

Add code
Jun 14, 2024
Figure 1 for On Efficient Neural Network Architectures for Image Compression
Figure 2 for On Efficient Neural Network Architectures for Image Compression
Figure 3 for On Efficient Neural Network Architectures for Image Compression
Figure 4 for On Efficient Neural Network Architectures for Image Compression
Viaarxiv icon

Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study

Add code
Jun 11, 2024
Figure 1 for Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study
Figure 2 for Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study
Figure 3 for Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study
Figure 4 for Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study
Viaarxiv icon

PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling

Add code
Jun 04, 2024
Figure 1 for PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling
Figure 2 for PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling
Figure 3 for PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling
Figure 4 for PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling
Viaarxiv icon

Eliciting Informative Text Evaluations with Large Language Models

Add code
May 28, 2024
Figure 1 for Eliciting Informative Text Evaluations with Large Language Models
Figure 2 for Eliciting Informative Text Evaluations with Large Language Models
Figure 3 for Eliciting Informative Text Evaluations with Large Language Models
Figure 4 for Eliciting Informative Text Evaluations with Large Language Models
Viaarxiv icon

Mixture of Modality Knowledge Experts for Robust Multi-modal Knowledge Graph Completion

Add code
May 27, 2024
Viaarxiv icon

Multi-domain Knowledge Graph Collaborative Pre-training and Prompt Tuning for Diverse Downstream Tasks

Add code
May 21, 2024
Figure 1 for Multi-domain Knowledge Graph Collaborative Pre-training and Prompt Tuning for Diverse Downstream Tasks
Figure 2 for Multi-domain Knowledge Graph Collaborative Pre-training and Prompt Tuning for Diverse Downstream Tasks
Figure 3 for Multi-domain Knowledge Graph Collaborative Pre-training and Prompt Tuning for Diverse Downstream Tasks
Figure 4 for Multi-domain Knowledge Graph Collaborative Pre-training and Prompt Tuning for Diverse Downstream Tasks
Viaarxiv icon

Surveying Attitudinal Alignment Between Large Language Models Vs. Humans Towards 17 Sustainable Development Goals

Add code
Apr 22, 2024
Figure 1 for Surveying Attitudinal Alignment Between Large Language Models Vs. Humans Towards 17 Sustainable Development Goals
Figure 2 for Surveying Attitudinal Alignment Between Large Language Models Vs. Humans Towards 17 Sustainable Development Goals
Viaarxiv icon

Beyond Pixel-Wise Supervision for Medical Image Segmentation: From Traditional Models to Foundation Models

Add code
Apr 20, 2024
Figure 1 for Beyond Pixel-Wise Supervision for Medical Image Segmentation: From Traditional Models to Foundation Models
Figure 2 for Beyond Pixel-Wise Supervision for Medical Image Segmentation: From Traditional Models to Foundation Models
Figure 3 for Beyond Pixel-Wise Supervision for Medical Image Segmentation: From Traditional Models to Foundation Models
Figure 4 for Beyond Pixel-Wise Supervision for Medical Image Segmentation: From Traditional Models to Foundation Models
Viaarxiv icon

Exploring the Transferability of Visual Prompting for Multimodal Large Language Models

Add code
Apr 17, 2024
Figure 1 for Exploring the Transferability of Visual Prompting for Multimodal Large Language Models
Figure 2 for Exploring the Transferability of Visual Prompting for Multimodal Large Language Models
Figure 3 for Exploring the Transferability of Visual Prompting for Multimodal Large Language Models
Figure 4 for Exploring the Transferability of Visual Prompting for Multimodal Large Language Models
Viaarxiv icon