Picture for Kejia Zhang

Kejia Zhang

When Tokens Talk Too Much: A Survey of Multimodal Long-Context Token Compression across Images, Videos, and Audios

Add code
Jul 27, 2025
Viaarxiv icon

Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs

Add code
Jan 31, 2025
Figure 1 for Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs
Figure 2 for Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs
Figure 3 for Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs
Figure 4 for Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs
Viaarxiv icon

ZhoBLiMP: a Systematic Assessment of Language Models with Linguistic Minimal Pairs in Chinese

Add code
Nov 09, 2024
Figure 1 for ZhoBLiMP: a Systematic Assessment of Language Models with Linguistic Minimal Pairs in Chinese
Figure 2 for ZhoBLiMP: a Systematic Assessment of Language Models with Linguistic Minimal Pairs in Chinese
Figure 3 for ZhoBLiMP: a Systematic Assessment of Language Models with Linguistic Minimal Pairs in Chinese
Figure 4 for ZhoBLiMP: a Systematic Assessment of Language Models with Linguistic Minimal Pairs in Chinese
Viaarxiv icon

Towards Adversarial Robustness via Debiased High-Confidence Logit Alignment

Add code
Aug 12, 2024
Figure 1 for Towards Adversarial Robustness via Debiased High-Confidence Logit Alignment
Figure 2 for Towards Adversarial Robustness via Debiased High-Confidence Logit Alignment
Figure 3 for Towards Adversarial Robustness via Debiased High-Confidence Logit Alignment
Figure 4 for Towards Adversarial Robustness via Debiased High-Confidence Logit Alignment
Viaarxiv icon

A Reference-free Metric for Language-Queried Audio Source Separation using Contrastive Language-Audio Pretraining

Add code
Jul 06, 2024
Figure 1 for A Reference-free Metric for Language-Queried Audio Source Separation using Contrastive Language-Audio Pretraining
Figure 2 for A Reference-free Metric for Language-Queried Audio Source Separation using Contrastive Language-Audio Pretraining
Figure 3 for A Reference-free Metric for Language-Queried Audio Source Separation using Contrastive Language-Audio Pretraining
Figure 4 for A Reference-free Metric for Language-Queried Audio Source Separation using Contrastive Language-Audio Pretraining
Viaarxiv icon

Mitigating Low-Frequency Bias: Feature Recalibration and Frequency Attention Regularization for Adversarial Robustness

Add code
Jul 04, 2024
Viaarxiv icon

Harmonizing Feature Maps: A Graph Convolutional Approach for Enhancing Adversarial Robustness

Add code
Jun 17, 2024
Figure 1 for Harmonizing Feature Maps: A Graph Convolutional Approach for Enhancing Adversarial Robustness
Figure 2 for Harmonizing Feature Maps: A Graph Convolutional Approach for Enhancing Adversarial Robustness
Figure 3 for Harmonizing Feature Maps: A Graph Convolutional Approach for Enhancing Adversarial Robustness
Figure 4 for Harmonizing Feature Maps: A Graph Convolutional Approach for Enhancing Adversarial Robustness
Viaarxiv icon

CTS: A Consistency-Based Medical Image Segmentation Model

Add code
May 15, 2024
Viaarxiv icon

MambaDFuse: A Mamba-based Dual-phase Model for Multi-modality Image Fusion

Add code
Apr 12, 2024
Figure 1 for MambaDFuse: A Mamba-based Dual-phase Model for Multi-modality Image Fusion
Figure 2 for MambaDFuse: A Mamba-based Dual-phase Model for Multi-modality Image Fusion
Figure 3 for MambaDFuse: A Mamba-based Dual-phase Model for Multi-modality Image Fusion
Figure 4 for MambaDFuse: A Mamba-based Dual-phase Model for Multi-modality Image Fusion
Viaarxiv icon

Synth-AC: Enhancing Audio Captioning with Synthetic Supervision

Add code
Sep 18, 2023
Viaarxiv icon