Picture for Lei Kang

Lei Kang

An Effective Data Augmentation Method by Asking Questions about Scene Text Images

Add code
Mar 03, 2026
Viaarxiv icon

Diffusion-Based Low-Light Image Enhancement with Color and Luminance Priors

Add code
Feb 27, 2026
Viaarxiv icon

AVIR: Adaptive Visual In-Document Retrieval for Efficient Multi-Page Document Question Answering

Add code
Jan 17, 2026
Viaarxiv icon

LLM-Driven Medical Document Analysis: Enhancing Trustworthy Pathology and Differential Diagnosis

Add code
Jun 24, 2025
Viaarxiv icon

xLSTM-ECG: Multi-label ECG Classification via Feature Fusion with xLSTM

Add code
Apr 14, 2025
Figure 1 for xLSTM-ECG: Multi-label ECG Classification via Feature Fusion with xLSTM
Figure 2 for xLSTM-ECG: Multi-label ECG Classification via Feature Fusion with xLSTM
Figure 3 for xLSTM-ECG: Multi-label ECG Classification via Feature Fusion with xLSTM
Figure 4 for xLSTM-ECG: Multi-label ECG Classification via Feature Fusion with xLSTM
Viaarxiv icon

Mixture of Group Experts for Learning Invariant Representations

Add code
Apr 12, 2025
Viaarxiv icon

Preserving Privacy Without Compromising Accuracy: Machine Unlearning for Handwritten Text Recognition

Add code
Apr 11, 2025
Figure 1 for Preserving Privacy Without Compromising Accuracy: Machine Unlearning for Handwritten Text Recognition
Figure 2 for Preserving Privacy Without Compromising Accuracy: Machine Unlearning for Handwritten Text Recognition
Figure 3 for Preserving Privacy Without Compromising Accuracy: Machine Unlearning for Handwritten Text Recognition
Figure 4 for Preserving Privacy Without Compromising Accuracy: Machine Unlearning for Handwritten Text Recognition
Viaarxiv icon

NeurIPS 2023 Competition: Privacy Preserving Federated Learning Document VQA

Add code
Nov 06, 2024
Figure 1 for NeurIPS 2023 Competition: Privacy Preserving Federated Learning Document VQA
Figure 2 for NeurIPS 2023 Competition: Privacy Preserving Federated Learning Document VQA
Figure 3 for NeurIPS 2023 Competition: Privacy Preserving Federated Learning Document VQA
Figure 4 for NeurIPS 2023 Competition: Privacy Preserving Federated Learning Document VQA
Viaarxiv icon

GRIF-DM: Generation of Rich Impression Fonts using Diffusion Models

Add code
Aug 14, 2024
Figure 1 for GRIF-DM: Generation of Rich Impression Fonts using Diffusion Models
Figure 2 for GRIF-DM: Generation of Rich Impression Fonts using Diffusion Models
Figure 3 for GRIF-DM: Generation of Rich Impression Fonts using Diffusion Models
Figure 4 for GRIF-DM: Generation of Rich Impression Fonts using Diffusion Models
Viaarxiv icon

Multi-Page Document Visual Question Answering using Self-Attention Scoring Mechanism

Add code
Apr 29, 2024
Figure 1 for Multi-Page Document Visual Question Answering using Self-Attention Scoring Mechanism
Figure 2 for Multi-Page Document Visual Question Answering using Self-Attention Scoring Mechanism
Figure 3 for Multi-Page Document Visual Question Answering using Self-Attention Scoring Mechanism
Figure 4 for Multi-Page Document Visual Question Answering using Self-Attention Scoring Mechanism
Viaarxiv icon