Picture for Yichi Zhang

Yichi Zhang

AI Lab, Netease

Unleashing the Potential of SAM2 for Biomedical Images and Videos: A Survey

Add code
Aug 23, 2024
Figure 1 for Unleashing the Potential of SAM2 for Biomedical Images and Videos: A Survey
Viaarxiv icon

Prompt Your Brain: Scaffold Prompt Tuning for Efficient Adaptation of fMRI Pre-trained Model

Add code
Aug 20, 2024
Viaarxiv icon

Timeliness-Fidelity Tradeoff in 3D Scene Representations

Add code
Jul 23, 2024
Viaarxiv icon

MAVIS: Mathematical Visual Instruction Tuning

Add code
Jul 11, 2024
Figure 1 for MAVIS: Mathematical Visual Instruction Tuning
Figure 2 for MAVIS: Mathematical Visual Instruction Tuning
Figure 3 for MAVIS: Mathematical Visual Instruction Tuning
Figure 4 for MAVIS: Mathematical Visual Instruction Tuning
Viaarxiv icon

MMEvalPro: Calibrating Multimodal Benchmarks Towards Trustworthy and Efficient Evaluation

Add code
Jun 29, 2024
Figure 1 for MMEvalPro: Calibrating Multimodal Benchmarks Towards Trustworthy and Efficient Evaluation
Figure 2 for MMEvalPro: Calibrating Multimodal Benchmarks Towards Trustworthy and Efficient Evaluation
Figure 3 for MMEvalPro: Calibrating Multimodal Benchmarks Towards Trustworthy and Efficient Evaluation
Figure 4 for MMEvalPro: Calibrating Multimodal Benchmarks Towards Trustworthy and Efficient Evaluation
Viaarxiv icon

Accelerating Clinical Evidence Synthesis with Large Language Models

Add code
Jun 25, 2024
Viaarxiv icon

MR-MLLM: Mutual Reinforcement of Multimodal Comprehension and Vision Perception

Add code
Jun 22, 2024
Viaarxiv icon

On Efficient Neural Network Architectures for Image Compression

Add code
Jun 14, 2024
Viaarxiv icon

Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study

Add code
Jun 11, 2024
Figure 1 for Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study
Figure 2 for Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study
Figure 3 for Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study
Figure 4 for Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study
Viaarxiv icon

PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling

Add code
Jun 04, 2024
Figure 1 for PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling
Figure 2 for PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling
Figure 3 for PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling
Figure 4 for PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling
Viaarxiv icon