Picture for Bolin Ding

Bolin Ding

Remember Me, Refine Me: A Dynamic Procedural Memory Framework for Experience-Driven Agent Evolution

Add code
Dec 11, 2025
Viaarxiv icon

d-TreeRPO: Towards More Reliable Policy Optimization for Diffusion Language Models

Add code
Dec 10, 2025
Viaarxiv icon

AgentEvolver: Towards Efficient Self-Evolving Agent System

Add code
Nov 13, 2025
Viaarxiv icon

BOTS: A Unified Framework for Bayesian Online Task Selection in LLM Reinforcement Finetuning

Add code
Oct 30, 2025
Viaarxiv icon

Omni-SafetyBench: A Benchmark for Safety Evaluation of Audio-Visual Large Language Models

Add code
Aug 10, 2025
Figure 1 for Omni-SafetyBench: A Benchmark for Safety Evaluation of Audio-Visual Large Language Models
Figure 2 for Omni-SafetyBench: A Benchmark for Safety Evaluation of Audio-Visual Large Language Models
Figure 3 for Omni-SafetyBench: A Benchmark for Safety Evaluation of Audio-Visual Large Language Models
Figure 4 for Omni-SafetyBench: A Benchmark for Safety Evaluation of Audio-Visual Large Language Models
Viaarxiv icon

Provoking Multi-modal Few-Shot LVLM via Exploration-Exploitation In-Context Learning

Add code
Jun 11, 2025
Viaarxiv icon

Respecting Temporal-Causal Consistency: Entity-Event Knowledge Graphs for Retrieval-Augmented Generation

Add code
Jun 06, 2025
Figure 1 for Respecting Temporal-Causal Consistency: Entity-Event Knowledge Graphs for Retrieval-Augmented Generation
Figure 2 for Respecting Temporal-Causal Consistency: Entity-Event Knowledge Graphs for Retrieval-Augmented Generation
Figure 3 for Respecting Temporal-Causal Consistency: Entity-Event Knowledge Graphs for Retrieval-Augmented Generation
Figure 4 for Respecting Temporal-Causal Consistency: Entity-Event Knowledge Graphs for Retrieval-Augmented Generation
Viaarxiv icon

Incentivizing Strong Reasoning from Weak Supervision

Add code
May 28, 2025
Figure 1 for Incentivizing Strong Reasoning from Weak Supervision
Figure 2 for Incentivizing Strong Reasoning from Weak Supervision
Figure 3 for Incentivizing Strong Reasoning from Weak Supervision
Figure 4 for Incentivizing Strong Reasoning from Weak Supervision
Viaarxiv icon

Incentivizing Reasoning from Weak Supervision

Add code
May 26, 2025
Figure 1 for Incentivizing Reasoning from Weak Supervision
Figure 2 for Incentivizing Reasoning from Weak Supervision
Figure 3 for Incentivizing Reasoning from Weak Supervision
Figure 4 for Incentivizing Reasoning from Weak Supervision
Viaarxiv icon

Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models

Add code
May 23, 2025
Viaarxiv icon