Topic


Correcting Selection Bias in Sparse User Feedback for Large Language Model Quality Estimation: A Multi-Agent Hierarchical Bayesian Approach

Add code
May 12, 2026
Viaarxiv icon

Robust Biomedical Publication Type and Study Design Classification with Knowledge-Guided Perturbations

Add code
May 12, 2026
Viaarxiv icon

G$^2$TR: Generation-Guided Visual Token Reduction for Separate-Encoder Unified Multimodal Models

Add code
May 12, 2026
Viaarxiv icon

On the Size Complexity and Decidability of First-Order Progression

Add code
May 12, 2026
Viaarxiv icon

Reward Hacking in Rubric-Based Reinforcement Learning

Add code
May 12, 2026
Viaarxiv icon

Output Composability of QLoRA PEFT Modules for Plug-and-Play Attribute-Controlled Text Generation

Add code
May 12, 2026
Viaarxiv icon

Enhancing Target-Guided Proactive Dialogue Systems via Conversational Scenario Modeling and Intent-Keyword Bridging

Add code
May 12, 2026
Viaarxiv icon

A Research Agenda on Agents and Software Engineering: Outcomes from the Rio A2SE Seminar

Add code
May 12, 2026
Viaarxiv icon

ASTRA-QA: A Benchmark for Abstract Question Answering over Documents

Add code
May 11, 2026
Viaarxiv icon

AgentGR: Semantic-aware Agentic Group Decision-Making Simulator for Group Recommendation

Add code
May 11, 2026
Viaarxiv icon