Text Classification


Text classification is the process of categorizing text documents into predefined categories or labels.

Cardiac-CLIP: A Vision-Language Foundation Model for 3D Cardiac CT Images

Add code
Jul 29, 2025
Figure 1 for Cardiac-CLIP: A Vision-Language Foundation Model for 3D Cardiac CT Images
Figure 2 for Cardiac-CLIP: A Vision-Language Foundation Model for 3D Cardiac CT Images
Figure 3 for Cardiac-CLIP: A Vision-Language Foundation Model for 3D Cardiac CT Images
Figure 4 for Cardiac-CLIP: A Vision-Language Foundation Model for 3D Cardiac CT Images
Viaarxiv icon

Uncertainty-driven Embedding Convolution

Add code
Jul 28, 2025
Viaarxiv icon

MGFFD-VLM: Multi-Granularity Prompt Learning for Face Forgery Detection with VLM

Add code
Jul 16, 2025
Figure 1 for MGFFD-VLM: Multi-Granularity Prompt Learning for Face Forgery Detection with VLM
Figure 2 for MGFFD-VLM: Multi-Granularity Prompt Learning for Face Forgery Detection with VLM
Figure 3 for MGFFD-VLM: Multi-Granularity Prompt Learning for Face Forgery Detection with VLM
Figure 4 for MGFFD-VLM: Multi-Granularity Prompt Learning for Face Forgery Detection with VLM
Viaarxiv icon

HQ-CLIP: Leveraging Large Vision-Language Models to Create High-Quality Image-Text Datasets and CLIP Models

Add code
Jul 30, 2025
Viaarxiv icon

RingMo-Agent: A Unified Remote Sensing Foundation Model for Multi-Platform and Multi-Modal Reasoning

Add code
Jul 28, 2025
Figure 1 for RingMo-Agent: A Unified Remote Sensing Foundation Model for Multi-Platform and Multi-Modal Reasoning
Figure 2 for RingMo-Agent: A Unified Remote Sensing Foundation Model for Multi-Platform and Multi-Modal Reasoning
Figure 3 for RingMo-Agent: A Unified Remote Sensing Foundation Model for Multi-Platform and Multi-Modal Reasoning
Figure 4 for RingMo-Agent: A Unified Remote Sensing Foundation Model for Multi-Platform and Multi-Modal Reasoning
Viaarxiv icon

Leveraging Pre-Trained Visual Models for AI-Generated Video Detection

Add code
Jul 17, 2025
Figure 1 for Leveraging Pre-Trained Visual Models for AI-Generated Video Detection
Figure 2 for Leveraging Pre-Trained Visual Models for AI-Generated Video Detection
Figure 3 for Leveraging Pre-Trained Visual Models for AI-Generated Video Detection
Figure 4 for Leveraging Pre-Trained Visual Models for AI-Generated Video Detection
Viaarxiv icon

TransLPRNet: Lite Vision-Language Network for Single/Dual-line Chinese License Plate Recognition

Add code
Jul 23, 2025
Viaarxiv icon

How Effectively Can BERT Models Interpret Context and Detect Bengali Communal Violent Text?

Add code
Jun 24, 2025
Viaarxiv icon

FRaN-X: FRaming and Narratives-eXplorer

Add code
Jul 09, 2025
Viaarxiv icon

Inverse Scene Text Removal

Add code
Jun 26, 2025
Viaarxiv icon