Topic


MaterialFigBENCH: benchmark dataset with figures for evaluating college-level materials science problem-solving abilities of multimodal large language models

Add code
Mar 12, 2026
Viaarxiv icon

BTZSC: A Benchmark for Zero-Shot Text Classification Across Cross-Encoders, Embedding Models, Rerankers and LLMs

Add code
Mar 12, 2026
Viaarxiv icon

Disentangling Similarity and Relatedness in Topic Models

Add code
Mar 11, 2026
Viaarxiv icon

Large language models can disambiguate opioid slang on social media

Add code
Mar 11, 2026
Viaarxiv icon

Beyond Relevance: On the Relationship Between Retrieval and RAG Information Coverage

Add code
Mar 11, 2026
Viaarxiv icon

How do AI agents talk about science and research? An exploration of scientific discussions on Moltbook using BERTopic

Add code
Mar 11, 2026
Viaarxiv icon

Modeling Stage-wise Evolution of User Interests for News Recommendation

Add code
Mar 11, 2026
Viaarxiv icon

Social Knowledge for Cross-Domain User Preference Modeling

Add code
Mar 10, 2026
Viaarxiv icon

Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs

Add code
Mar 10, 2026
Viaarxiv icon

The Reasoning Trap -- Logical Reasoning as a Mechanistic Pathway to Situational Awareness

Add code
Mar 10, 2026
Viaarxiv icon