Picture for Bowen Song

Bowen Song

OPRD: On-Policy Representation Distillation

Add code
Jun 04, 2026
Viaarxiv icon

Smart Picks in the Dark: Towards Efficient RLVR for Reasoning via Tracing Metacognitive Pivots

Add code
Jun 03, 2026
Viaarxiv icon

GeoMin: Data-Efficient Semi-Supervised RLVR via Geometric Distribution Modeling

Add code
Jun 03, 2026
Viaarxiv icon

GAPD: Gold-Action Policy Distillation for Agentic Reinforcement Learning in Knowledge Base Question Answering

Add code
May 28, 2026
Viaarxiv icon

Can LLMs Learn to Reason Robustly under Noisy Supervision?

Add code
Apr 05, 2026
Viaarxiv icon

Multimodal Adaptive Retrieval Augmented Generation through Internal Representation Learning

Add code
Feb 28, 2026
Viaarxiv icon

Predict the Retrieval! Test time adaptation for Retrieval Augmented Generation

Add code
Jan 16, 2026
Viaarxiv icon

TraPO: A Semi-Supervised Reinforcement Learning Framework for Boosting LLM Reasoning

Add code
Dec 15, 2025
Figure 1 for TraPO: A Semi-Supervised Reinforcement Learning Framework for Boosting LLM Reasoning
Figure 2 for TraPO: A Semi-Supervised Reinforcement Learning Framework for Boosting LLM Reasoning
Figure 3 for TraPO: A Semi-Supervised Reinforcement Learning Framework for Boosting LLM Reasoning
Figure 4 for TraPO: A Semi-Supervised Reinforcement Learning Framework for Boosting LLM Reasoning
Viaarxiv icon

KBQA-R1: Reinforcing Large Language Models for Knowledge Base Question Answering

Add code
Dec 10, 2025
Viaarxiv icon

SMART: Relation-Aware Learning of Geometric Representations for Knowledge Graphs

Add code
Jul 17, 2025
Viaarxiv icon