Topic Modeling


Topic modeling is a type of statistical modeling for discovering the abstract topics that occur in a collection of documents.

BhashaKritika: Building Synthetic Pretraining Data at Scale for Indic Languages

Add code
Nov 16, 2025
Viaarxiv icon

RAGFort: Dual-Path Defense Against Proprietary Knowledge Base Extraction in Retrieval-Augmented Generation

Add code
Nov 13, 2025
Figure 1 for RAGFort: Dual-Path Defense Against Proprietary Knowledge Base Extraction in Retrieval-Augmented Generation
Figure 2 for RAGFort: Dual-Path Defense Against Proprietary Knowledge Base Extraction in Retrieval-Augmented Generation
Figure 3 for RAGFort: Dual-Path Defense Against Proprietary Knowledge Base Extraction in Retrieval-Augmented Generation
Figure 4 for RAGFort: Dual-Path Defense Against Proprietary Knowledge Base Extraction in Retrieval-Augmented Generation
Viaarxiv icon

MedPT: A Massive Medical Question Answering Dataset for Brazilian-Portuguese Speakers

Add code
Nov 14, 2025
Viaarxiv icon

State of the Art in Text Classification for South Slavic Languages: Fine-Tuning or Prompting?

Add code
Nov 11, 2025
Viaarxiv icon

Whisper Leak: a side-channel attack on Large Language Models

Add code
Nov 05, 2025
Viaarxiv icon

Sharing the Learned Knowledge-base to Estimate Convolutional Filter Parameters for Continual Image Restoration

Add code
Nov 07, 2025
Figure 1 for Sharing the Learned Knowledge-base to Estimate Convolutional Filter Parameters for Continual Image Restoration
Figure 2 for Sharing the Learned Knowledge-base to Estimate Convolutional Filter Parameters for Continual Image Restoration
Figure 3 for Sharing the Learned Knowledge-base to Estimate Convolutional Filter Parameters for Continual Image Restoration
Figure 4 for Sharing the Learned Knowledge-base to Estimate Convolutional Filter Parameters for Continual Image Restoration
Viaarxiv icon

Culture Cartography: Mapping the Landscape of Cultural Knowledge

Add code
Oct 31, 2025
Viaarxiv icon

One Battle After Another: Probing LLMs' Limits on Multi-Turn Instruction Following with a Benchmark Evolving Framework

Add code
Nov 05, 2025
Viaarxiv icon

Revisiting Multilingual Data Mixtures in Language Model Pretraining

Add code
Oct 29, 2025
Viaarxiv icon

BiCA: Effective Biomedical Dense Retrieval with Citation-Aware Hard Negatives

Add code
Nov 11, 2025
Viaarxiv icon