Relational Reasoning


EBPO: Empirical Bayes Shrinkage for Stabilizing Group-Relative Policy Optimization

Add code
Feb 05, 2026
Viaarxiv icon

LongR: Unleashing Long-Context Reasoning via Reinforcement Learning with Dense Utility Rewards

Add code
Feb 05, 2026
Viaarxiv icon

Back to Basics: Revisiting Exploration in Reinforcement Learning for LLM Reasoning via Generative Probabilities

Add code
Feb 05, 2026
Viaarxiv icon

THOR: Inductive Link Prediction over Hyper-Relational Knowledge Graphs

Add code
Feb 05, 2026
Viaarxiv icon

Graph-based Agent Memory: Taxonomy, Techniques, and Applications

Add code
Feb 05, 2026
Viaarxiv icon

SDR-CIR: Semantic Debias Retrieval Framework for Training-Free Zero-Shot Composed Image Retrieval

Add code
Feb 05, 2026
Viaarxiv icon

Unveiling Implicit Advantage Symmetry: Why GRPO Struggles with Exploration and Difficulty Adaptation

Add code
Feb 05, 2026
Viaarxiv icon

$f$-GRPO and Beyond: Divergence-Based Reinforcement Learning Algorithms for General LLM Alignment

Add code
Feb 05, 2026
Viaarxiv icon

Cost-Efficient RAG for Entity Matching with LLMs: A Blocking-based Exploration

Add code
Feb 05, 2026
Viaarxiv icon

xList-Hate: A Checklist-Based Framework for Interpretable and Generalizable Hate Speech Detection

Add code
Feb 05, 2026
Viaarxiv icon