Cross Lingual Document Classification


Low-Resource Language Processing: An OCR-Driven Summarization and Translation Pipeline

Add code
May 16, 2025
Viaarxiv icon

Towards Scalable and Cross-Lingual Specialist Language Models for Oncology

Add code
Mar 11, 2025
Viaarxiv icon

SRFUND: A Multi-Granularity Hierarchical Structure Reconstruction Benchmark in Form Understanding

Add code
Jun 13, 2024
Viaarxiv icon

What Drives Performance in Multilingual Language Models?

Add code
Apr 29, 2024
Viaarxiv icon

A Multi-Modal Multilingual Benchmark for Document Image Classification

Add code
Oct 25, 2023
Figure 1 for A Multi-Modal Multilingual Benchmark for Document Image Classification
Figure 2 for A Multi-Modal Multilingual Benchmark for Document Image Classification
Figure 3 for A Multi-Modal Multilingual Benchmark for Document Image Classification
Figure 4 for A Multi-Modal Multilingual Benchmark for Document Image Classification
Viaarxiv icon

L3Cube-IndicNews: News-based Short Text and Long Document Classification Datasets in Indic Languages

Add code
Jan 04, 2024
Figure 1 for L3Cube-IndicNews: News-based Short Text and Long Document Classification Datasets in Indic Languages
Figure 2 for L3Cube-IndicNews: News-based Short Text and Long Document Classification Datasets in Indic Languages
Figure 3 for L3Cube-IndicNews: News-based Short Text and Long Document Classification Datasets in Indic Languages
Figure 4 for L3Cube-IndicNews: News-based Short Text and Long Document Classification Datasets in Indic Languages
Viaarxiv icon

AMuRD: Annotated Multilingual Receipts Dataset for Cross-lingual Key Information Extraction and Classification

Add code
Sep 18, 2023
Viaarxiv icon

A General-Purpose Multilingual Document Encoder

Add code
May 11, 2023
Viaarxiv icon

Multimodal Document Analytics for Banking Process Automation

Add code
Jul 21, 2023
Viaarxiv icon

Knowledge Graph Embeddings for Multi-Lingual Structured Representations of Radiology Reports

Add code
Sep 14, 2023
Viaarxiv icon