Picture for Mi Zhang

Mi Zhang

SVD-LLM V2: Optimizing Singular Value Truncation for Large Language Model Compression

Add code
Mar 16, 2025
Figure 1 for SVD-LLM V2: Optimizing Singular Value Truncation for Large Language Model Compression
Figure 2 for SVD-LLM V2: Optimizing Singular Value Truncation for Large Language Model Compression
Figure 3 for SVD-LLM V2: Optimizing Singular Value Truncation for Large Language Model Compression
Figure 4 for SVD-LLM V2: Optimizing Singular Value Truncation for Large Language Model Compression
Viaarxiv icon

Revisiting Backdoor Attacks on Time Series Classification in the Frequency Domain

Add code
Mar 12, 2025
Viaarxiv icon

MEDA: Dynamic KV Cache Allocation for Efficient Multimodal Long-Context Inference

Add code
Feb 24, 2025
Figure 1 for MEDA: Dynamic KV Cache Allocation for Efficient Multimodal Long-Context Inference
Figure 2 for MEDA: Dynamic KV Cache Allocation for Efficient Multimodal Long-Context Inference
Figure 3 for MEDA: Dynamic KV Cache Allocation for Efficient Multimodal Long-Context Inference
Figure 4 for MEDA: Dynamic KV Cache Allocation for Efficient Multimodal Long-Context Inference
Viaarxiv icon

Mirage in the Eyes: Hallucination Attack on Multi-modal Large Language Models with Only Attention Sink

Add code
Jan 25, 2025
Figure 1 for Mirage in the Eyes: Hallucination Attack on Multi-modal Large Language Models with Only Attention Sink
Figure 2 for Mirage in the Eyes: Hallucination Attack on Multi-modal Large Language Models with Only Attention Sink
Figure 3 for Mirage in the Eyes: Hallucination Attack on Multi-modal Large Language Models with Only Attention Sink
Figure 4 for Mirage in the Eyes: Hallucination Attack on Multi-modal Large Language Models with Only Attention Sink
Viaarxiv icon

Safe Text-to-Image Generation: Simply Sanitize the Prompt Embedding

Add code
Nov 15, 2024
Figure 1 for Safe Text-to-Image Generation: Simply Sanitize the Prompt Embedding
Figure 2 for Safe Text-to-Image Generation: Simply Sanitize the Prompt Embedding
Figure 3 for Safe Text-to-Image Generation: Simply Sanitize the Prompt Embedding
Figure 4 for Safe Text-to-Image Generation: Simply Sanitize the Prompt Embedding
Viaarxiv icon

Autoregressive Models in Vision: A Survey

Add code
Nov 08, 2024
Figure 1 for Autoregressive Models in Vision: A Survey
Figure 2 for Autoregressive Models in Vision: A Survey
Figure 3 for Autoregressive Models in Vision: A Survey
Figure 4 for Autoregressive Models in Vision: A Survey
Viaarxiv icon

Artificial Intelligence of Things: A Survey

Add code
Oct 25, 2024
Figure 1 for Artificial Intelligence of Things: A Survey
Figure 2 for Artificial Intelligence of Things: A Survey
Figure 3 for Artificial Intelligence of Things: A Survey
Figure 4 for Artificial Intelligence of Things: A Survey
Viaarxiv icon

Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion

Add code
Sep 15, 2024
Figure 1 for Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion
Figure 2 for Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion
Figure 3 for Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion
Figure 4 for Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion
Viaarxiv icon

D2O:Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models

Add code
Jun 18, 2024
Figure 1 for D2O:Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models
Figure 2 for D2O:Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models
Figure 3 for D2O:Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models
Figure 4 for D2O:Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models
Viaarxiv icon

Creating a Lens of Chinese Culture: A Multimodal Dataset for Chinese Pun Rebus Art Understanding

Add code
Jun 14, 2024
Viaarxiv icon