Topic


Self-Preference Bias in Rubric-Based Evaluation of Large Language Models

Add code
Apr 08, 2026
Viaarxiv icon

Reasoning-Based Refinement of Unsupervised Text Clusters with LLMs

Add code
Apr 08, 2026
Viaarxiv icon

ChunQiuTR: Time-Keyed Temporal Retrieval in Classical Chinese Annals

Add code
Apr 08, 2026
Viaarxiv icon

Mixed-Initiative Context: Structuring and Managing Context for Human-AI Collaboration

Add code
Apr 08, 2026
Viaarxiv icon

Beyond Paper-to-Paper: Structured Profiling and Rubric Scoring for Paper-Reviewer Matching

Add code
Apr 07, 2026
Viaarxiv icon

Can We Trust a Black-box LLM? LLM Untrustworthy Boundary Detection via Bias-Diffusion and Multi-Agent Reinforcement Learning

Add code
Apr 07, 2026
Viaarxiv icon

AI and Collective Decisions: Strengthening Legitimacy and Losers' Consent

Add code
Apr 07, 2026
Viaarxiv icon

Context-Agent: Dynamic Discourse Trees for Non-Linear Dialogue

Add code
Apr 07, 2026
Viaarxiv icon

CAKE: Cloud Architecture Knowledge Evaluation of Large Language Models

Add code
Apr 07, 2026
Viaarxiv icon

The LLM Effect on IR Benchmarks: A Meta-Analysis of Effectiveness, Baselines, and Contamination

Add code
Apr 07, 2026
Viaarxiv icon