Topic Modeling


Topic modeling is a type of statistical modeling for discovering the abstract topics that occur in a collection of documents.

Latent space analysis and generalization to out-of-distribution data

Add code
Nov 19, 2025
Viaarxiv icon

AA-Omniscience: Evaluating Cross-Domain Knowledge Reliability in Large Language Models

Add code
Nov 17, 2025
Figure 1 for AA-Omniscience: Evaluating Cross-Domain Knowledge Reliability in Large Language Models
Figure 2 for AA-Omniscience: Evaluating Cross-Domain Knowledge Reliability in Large Language Models
Figure 3 for AA-Omniscience: Evaluating Cross-Domain Knowledge Reliability in Large Language Models
Figure 4 for AA-Omniscience: Evaluating Cross-Domain Knowledge Reliability in Large Language Models
Viaarxiv icon

Opinion Dynamics Models for Sentiment Evolution in Weibo Blogs

Add code
Nov 19, 2025
Viaarxiv icon

HSKBenchmark: Modeling and Benchmarking Chinese Second Language Acquisition in Large Language Models through Curriculum Tuning

Add code
Nov 19, 2025
Figure 1 for HSKBenchmark: Modeling and Benchmarking Chinese Second Language Acquisition in Large Language Models through Curriculum Tuning
Figure 2 for HSKBenchmark: Modeling and Benchmarking Chinese Second Language Acquisition in Large Language Models through Curriculum Tuning
Figure 3 for HSKBenchmark: Modeling and Benchmarking Chinese Second Language Acquisition in Large Language Models through Curriculum Tuning
Figure 4 for HSKBenchmark: Modeling and Benchmarking Chinese Second Language Acquisition in Large Language Models through Curriculum Tuning
Viaarxiv icon

SAMIRO: Spatial Attention Mutual Information Regularization with a Pre-trained Model as Oracle for Lane Detection

Add code
Nov 13, 2025
Figure 1 for SAMIRO: Spatial Attention Mutual Information Regularization with a Pre-trained Model as Oracle for Lane Detection
Figure 2 for SAMIRO: Spatial Attention Mutual Information Regularization with a Pre-trained Model as Oracle for Lane Detection
Figure 3 for SAMIRO: Spatial Attention Mutual Information Regularization with a Pre-trained Model as Oracle for Lane Detection
Figure 4 for SAMIRO: Spatial Attention Mutual Information Regularization with a Pre-trained Model as Oracle for Lane Detection
Viaarxiv icon

Context is Enough: Empirical Validation of $\textit{Sequentiality}$ on Essays

Add code
Nov 12, 2025
Viaarxiv icon

Work-in-Progress: Function-as-Subtask API Replacing Publish/Subscribe for OS-Native DAG Scheduling

Add code
Nov 11, 2025
Viaarxiv icon

Sub-exponential Growth of New Words and Names Online: A Piecewise Power-Law Model

Add code
Nov 11, 2025
Viaarxiv icon

RAGFort: Dual-Path Defense Against Proprietary Knowledge Base Extraction in Retrieval-Augmented Generation

Add code
Nov 13, 2025
Figure 1 for RAGFort: Dual-Path Defense Against Proprietary Knowledge Base Extraction in Retrieval-Augmented Generation
Figure 2 for RAGFort: Dual-Path Defense Against Proprietary Knowledge Base Extraction in Retrieval-Augmented Generation
Figure 3 for RAGFort: Dual-Path Defense Against Proprietary Knowledge Base Extraction in Retrieval-Augmented Generation
Figure 4 for RAGFort: Dual-Path Defense Against Proprietary Knowledge Base Extraction in Retrieval-Augmented Generation
Viaarxiv icon

BhashaKritika: Building Synthetic Pretraining Data at Scale for Indic Languages

Add code
Nov 16, 2025
Viaarxiv icon