Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:ProtoSiTex: Learning Semi-Interpretable Prototypes for Multi-label Text Classification

Oct 14, 2025

Utsav Kumar Nareti, Suraj Kumar, Soumya Pandey, Soumi Chattopadhyay, Chandranath Adak

Figure 1 for ProtoSiTex: Learning Semi-Interpretable Prototypes for Multi-label Text Classification

Figure 2 for ProtoSiTex: Learning Semi-Interpretable Prototypes for Multi-label Text Classification

Figure 3 for ProtoSiTex: Learning Semi-Interpretable Prototypes for Multi-label Text Classification

Figure 4 for ProtoSiTex: Learning Semi-Interpretable Prototypes for Multi-label Text Classification

Share this with someone who'll enjoy it:

Abstract:The surge in user-generated reviews has amplified the need for interpretable models that can provide fine-grained insights. Existing prototype-based models offer intuitive explanations but typically operate at coarse granularity (sentence or document level) and fail to address the multi-label nature of real-world text classification. We propose ProtoSiTex, a semi-interpretable framework designed for fine-grained multi-label text classification. ProtoSiTex employs a dual-phase alternating training strategy: an unsupervised prototype discovery phase that learns semantically coherent and diverse prototypes, and a supervised classification phase that maps these prototypes to class labels. A hierarchical loss function enforces consistency across sub-sentence, sentence, and document levels, enhancing interpretability and alignment. Unlike prior approaches, ProtoSiTex captures overlapping and conflicting semantics using adaptive prototypes and multi-head attention. We also introduce a benchmark dataset of hotel reviews annotated at the sub-sentence level with multiple labels. Experiments on this dataset and two public benchmarks (binary and multi-class) show that ProtoSiTex achieves state-of-the-art performance while delivering faithful, human-aligned explanations, establishing it as a robust solution for semi-interpretable multi-label text classification.

View paper on

Share this with someone who'll enjoy it:

Title:ProtoSiTex: Learning Semi-Interpretable Prototypes for Multi-label Text Classification

Paper and Code