Picture for Min Zhang

Min Zhang

Jake

Knowledge Editing with Dynamic Knowledge Graphs for Multi-hop Question Answering

Add code
Dec 18, 2024
Figure 1 for Knowledge Editing with Dynamic Knowledge Graphs for Multi-hop Question Answering
Figure 2 for Knowledge Editing with Dynamic Knowledge Graphs for Multi-hop Question Answering
Figure 3 for Knowledge Editing with Dynamic Knowledge Graphs for Multi-hop Question Answering
Figure 4 for Knowledge Editing with Dynamic Knowledge Graphs for Multi-hop Question Answering
Viaarxiv icon

Benchmarking and Improving Large Vision-Language Models for Fundamental Visual Graph Understanding and Reasoning

Add code
Dec 18, 2024
Figure 1 for Benchmarking and Improving Large Vision-Language Models for Fundamental Visual Graph Understanding and Reasoning
Figure 2 for Benchmarking and Improving Large Vision-Language Models for Fundamental Visual Graph Understanding and Reasoning
Figure 3 for Benchmarking and Improving Large Vision-Language Models for Fundamental Visual Graph Understanding and Reasoning
Figure 4 for Benchmarking and Improving Large Vision-Language Models for Fundamental Visual Graph Understanding and Reasoning
Viaarxiv icon

LinguaLIFT: An Effective Two-stage Instruction Tuning Framework for Low-Resource Language Tasks

Add code
Dec 17, 2024
Viaarxiv icon

Make Imagination Clearer! Stable Diffusion-based Visual Imagination for Multimodal Machine Translation

Add code
Dec 17, 2024
Figure 1 for Make Imagination Clearer! Stable Diffusion-based Visual Imagination for Multimodal Machine Translation
Figure 2 for Make Imagination Clearer! Stable Diffusion-based Visual Imagination for Multimodal Machine Translation
Figure 3 for Make Imagination Clearer! Stable Diffusion-based Visual Imagination for Multimodal Machine Translation
Figure 4 for Make Imagination Clearer! Stable Diffusion-based Visual Imagination for Multimodal Machine Translation
Viaarxiv icon

Efficient Speech Command Recognition Leveraging Spiking Neural Network and Curriculum Learning-based Knowledge Distillation

Add code
Dec 17, 2024
Figure 1 for Efficient Speech Command Recognition Leveraging Spiking Neural Network and Curriculum Learning-based Knowledge Distillation
Figure 2 for Efficient Speech Command Recognition Leveraging Spiking Neural Network and Curriculum Learning-based Knowledge Distillation
Figure 3 for Efficient Speech Command Recognition Leveraging Spiking Neural Network and Curriculum Learning-based Knowledge Distillation
Figure 4 for Efficient Speech Command Recognition Leveraging Spiking Neural Network and Curriculum Learning-based Knowledge Distillation
Viaarxiv icon

LLM-based Discriminative Reasoning for Knowledge Graph Question Answering

Add code
Dec 17, 2024
Figure 1 for LLM-based Discriminative Reasoning for Knowledge Graph Question Answering
Figure 2 for LLM-based Discriminative Reasoning for Knowledge Graph Question Answering
Figure 3 for LLM-based Discriminative Reasoning for Knowledge Graph Question Answering
Figure 4 for LLM-based Discriminative Reasoning for Knowledge Graph Question Answering
Viaarxiv icon

DISC: Plug-and-Play Decoding Intervention with Similarity of Characters for Chinese Spelling Check

Add code
Dec 17, 2024
Figure 1 for DISC: Plug-and-Play Decoding Intervention with Similarity of Characters for Chinese Spelling Check
Figure 2 for DISC: Plug-and-Play Decoding Intervention with Similarity of Characters for Chinese Spelling Check
Figure 3 for DISC: Plug-and-Play Decoding Intervention with Similarity of Characters for Chinese Spelling Check
Figure 4 for DISC: Plug-and-Play Decoding Intervention with Similarity of Characters for Chinese Spelling Check
Viaarxiv icon

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Add code
Dec 12, 2024
Figure 1 for InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Figure 2 for InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Figure 3 for InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Figure 4 for InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Viaarxiv icon

ZigZagkv: Dynamic KV Cache Compression for Long-context Modeling based on Layer Uncertainty

Add code
Dec 12, 2024
Figure 1 for ZigZagkv: Dynamic KV Cache Compression for Long-context Modeling based on Layer Uncertainty
Figure 2 for ZigZagkv: Dynamic KV Cache Compression for Long-context Modeling based on Layer Uncertainty
Figure 3 for ZigZagkv: Dynamic KV Cache Compression for Long-context Modeling based on Layer Uncertainty
Figure 4 for ZigZagkv: Dynamic KV Cache Compression for Long-context Modeling based on Layer Uncertainty
Viaarxiv icon

Look Before You Leap: Enhancing Attention and Vigilance Regarding Harmful Content with GuidelineLLM

Add code
Dec 10, 2024
Figure 1 for Look Before You Leap: Enhancing Attention and Vigilance Regarding Harmful Content with GuidelineLLM
Figure 2 for Look Before You Leap: Enhancing Attention and Vigilance Regarding Harmful Content with GuidelineLLM
Figure 3 for Look Before You Leap: Enhancing Attention and Vigilance Regarding Harmful Content with GuidelineLLM
Figure 4 for Look Before You Leap: Enhancing Attention and Vigilance Regarding Harmful Content with GuidelineLLM
Viaarxiv icon