Picture for Heuiseok Lim

Heuiseok Lim

MultiDocFusion: Hierarchical and Multimodal Chunking Pipeline for Enhanced RAG on Long Industrial Documents

Add code
Apr 14, 2026
Viaarxiv icon

Evidential Transformation Network: Turning Pretrained Models into Evidential Models for Post-hoc Uncertainty Estimation

Add code
Apr 09, 2026
Viaarxiv icon

Revise: A Framework for Revising OCRed text in Practical Information Systems with Data Contamination Strategy

Add code
Apr 09, 2026
Viaarxiv icon

Towards Privacy-Preserving Large Language Model: Text-free Inference Through Alignment and Adaptation

Add code
Apr 08, 2026
Viaarxiv icon

CLEAR: Cross-Lingual Enhancement in Alignment via Reverse-training

Add code
Apr 07, 2026
Viaarxiv icon

Improving Semantic Proximity in Information Retrieval through Cross-Lingual Alignment

Add code
Apr 07, 2026
Viaarxiv icon

Beyond Hard Negatives: The Importance of Score Distribution in Knowledge Distillation for Dense Retrieval

Add code
Apr 06, 2026
Viaarxiv icon

LANGSAE EDITING: Improving Multilingual Information Retrieval via Post-hoc Language Identity Removal

Add code
Jan 08, 2026
Viaarxiv icon

The Impact of Negated Text on Hallucination with Large Language Models

Add code
Oct 23, 2025
Viaarxiv icon

Metric Calculating Benchmark: Code-Verifiable Complicate Instruction Following Benchmark for Large Language Models

Add code
Oct 09, 2025
Figure 1 for Metric Calculating Benchmark: Code-Verifiable Complicate Instruction Following Benchmark for Large Language Models
Figure 2 for Metric Calculating Benchmark: Code-Verifiable Complicate Instruction Following Benchmark for Large Language Models
Figure 3 for Metric Calculating Benchmark: Code-Verifiable Complicate Instruction Following Benchmark for Large Language Models
Figure 4 for Metric Calculating Benchmark: Code-Verifiable Complicate Instruction Following Benchmark for Large Language Models
Viaarxiv icon