Topic


ChemPro: A Progressive Chemistry Benchmark for Large Language Models

Add code
Feb 03, 2026
Viaarxiv icon

EverMemBench: Benchmarking Long-Term Interactive Memory in Large Language Models

Add code
Feb 03, 2026
Viaarxiv icon

Cognitively Diverse Multiple-Choice Question Generation: A Hybrid Multi-Agent Framework with Large Language Models

Add code
Feb 03, 2026
Viaarxiv icon

Rethinking Benign Relearning: Syntax as the Hidden Driver of Unlearning Failures

Add code
Feb 03, 2026
Viaarxiv icon

Contrastive Concept-Tree Search for LLM-Assisted Algorithm Discovery

Add code
Feb 03, 2026
Viaarxiv icon

WildGraphBench: Benchmarking GraphRAG with Wild-Source Corpora

Add code
Feb 03, 2026
Viaarxiv icon

What LLMs Think When You Don't Tell Them What to Think About?

Add code
Feb 02, 2026
Viaarxiv icon

Large Language Model and Formal Concept Analysis: a comparative study for Topic Modeling

Add code
Feb 02, 2026
Viaarxiv icon

DrawSim-PD: Simulating Student Science Drawings to Support NGSS-Aligned Teacher Diagnostic Reasoning

Add code
Feb 02, 2026
Viaarxiv icon

From Utterance to Vividity: Training Expressive Subtitle Translation LLM via Adaptive Local Preference Optimization

Add code
Feb 01, 2026
Viaarxiv icon