Picture for Kaiqiang Ke

Kaiqiang Ke

Context-Picker: Dynamic context selection using multi-stage reinforcement learning

Add code
Dec 16, 2025
Figure 1 for Context-Picker: Dynamic context selection using multi-stage reinforcement learning
Figure 2 for Context-Picker: Dynamic context selection using multi-stage reinforcement learning
Figure 3 for Context-Picker: Dynamic context selection using multi-stage reinforcement learning
Figure 4 for Context-Picker: Dynamic context selection using multi-stage reinforcement learning
Viaarxiv icon

H$^2$R: Hierarchical Hindsight Reflection for Multi-Task LLM Agents

Add code
Sep 16, 2025
Figure 1 for H$^2$R: Hierarchical Hindsight Reflection for Multi-Task LLM Agents
Figure 2 for H$^2$R: Hierarchical Hindsight Reflection for Multi-Task LLM Agents
Figure 3 for H$^2$R: Hierarchical Hindsight Reflection for Multi-Task LLM Agents
Figure 4 for H$^2$R: Hierarchical Hindsight Reflection for Multi-Task LLM Agents
Viaarxiv icon

GCHR : Goal-Conditioned Hindsight Regularization for Sample-Efficient Reinforcement Learning

Add code
Aug 08, 2025
Viaarxiv icon