Topic Modeling


Topic modeling is a type of statistical modeling for discovering the abstract topics that occur in a collection of documents.

Kunnafonidilaw ka Cadeau: an ASR dataset of present-day Bambara

Add code
Dec 22, 2025
Figure 1 for Kunnafonidilaw ka Cadeau: an ASR dataset of present-day Bambara
Figure 2 for Kunnafonidilaw ka Cadeau: an ASR dataset of present-day Bambara
Figure 3 for Kunnafonidilaw ka Cadeau: an ASR dataset of present-day Bambara
Figure 4 for Kunnafonidilaw ka Cadeau: an ASR dataset of present-day Bambara
Viaarxiv icon

Evaluating the Capability of Video Question Generation for Expert Knowledge Elicitation

Add code
Dec 17, 2025
Viaarxiv icon

Simulstream: Open-Source Toolkit for Evaluation and Demonstration of Streaming Speech-to-Text Translation Systems

Add code
Dec 19, 2025
Figure 1 for Simulstream: Open-Source Toolkit for Evaluation and Demonstration of Streaming Speech-to-Text Translation Systems
Figure 2 for Simulstream: Open-Source Toolkit for Evaluation and Demonstration of Streaming Speech-to-Text Translation Systems
Figure 3 for Simulstream: Open-Source Toolkit for Evaluation and Demonstration of Streaming Speech-to-Text Translation Systems
Figure 4 for Simulstream: Open-Source Toolkit for Evaluation and Demonstration of Streaming Speech-to-Text Translation Systems
Viaarxiv icon

Large-Language Memorization During the Classification of United States Supreme Court Cases

Add code
Dec 15, 2025
Viaarxiv icon

PediatricAnxietyBench: Evaluating Large Language Model Safety Under Parental Anxiety and Pressure in Pediatric Consultations

Add code
Dec 17, 2025
Figure 1 for PediatricAnxietyBench: Evaluating Large Language Model Safety Under Parental Anxiety and Pressure in Pediatric Consultations
Figure 2 for PediatricAnxietyBench: Evaluating Large Language Model Safety Under Parental Anxiety and Pressure in Pediatric Consultations
Figure 3 for PediatricAnxietyBench: Evaluating Large Language Model Safety Under Parental Anxiety and Pressure in Pediatric Consultations
Figure 4 for PediatricAnxietyBench: Evaluating Large Language Model Safety Under Parental Anxiety and Pressure in Pediatric Consultations
Viaarxiv icon

Semantic Tree Inference on Text Corpa using a Nested Density Approach together with Large Language Model Embeddings

Add code
Dec 29, 2025
Viaarxiv icon

A stylometric analysis of speaker attribution from speech transcripts

Add code
Dec 18, 2025
Viaarxiv icon

M$^3$KG-RAG: Multi-hop Multimodal Knowledge Graph-enhanced Retrieval-Augmented Generation

Add code
Dec 24, 2025
Viaarxiv icon

Fluent Alignment with Disfluent Judges: Post-training for Lower-resource Languages

Add code
Dec 09, 2025
Figure 1 for Fluent Alignment with Disfluent Judges: Post-training for Lower-resource Languages
Figure 2 for Fluent Alignment with Disfluent Judges: Post-training for Lower-resource Languages
Figure 3 for Fluent Alignment with Disfluent Judges: Post-training for Lower-resource Languages
Figure 4 for Fluent Alignment with Disfluent Judges: Post-training for Lower-resource Languages
Viaarxiv icon

Impacts of Racial Bias in Historical Training Data for News AI

Add code
Dec 18, 2025
Figure 1 for Impacts of Racial Bias in Historical Training Data for News AI
Figure 2 for Impacts of Racial Bias in Historical Training Data for News AI
Figure 3 for Impacts of Racial Bias in Historical Training Data for News AI
Viaarxiv icon