Picture for Chenfu Bao

Chenfu Bao

Token-Level Policy Optimization: Linking Group-Level Rewards to Token-Level Aggregation via Markov Likelihood

Add code
Oct 10, 2025
Viaarxiv icon

HAMLET-FFD: Hierarchical Adaptive Multi-modal Learning Embeddings Transformation for Face Forgery Detection

Add code
Jul 28, 2025
Viaarxiv icon

Knowledgeable-r1: Policy Optimization for Knowledge Exploration in Retrieval-Augmented Generation

Add code
Jun 05, 2025
Figure 1 for Knowledgeable-r1: Policy Optimization for Knowledge Exploration in Retrieval-Augmented Generation
Figure 2 for Knowledgeable-r1: Policy Optimization for Knowledge Exploration in Retrieval-Augmented Generation
Figure 3 for Knowledgeable-r1: Policy Optimization for Knowledge Exploration in Retrieval-Augmented Generation
Figure 4 for Knowledgeable-r1: Policy Optimization for Knowledge Exploration in Retrieval-Augmented Generation
Viaarxiv icon

A Framework for Cost-Effective and Self-Adaptive LLM Shaking and Recovery Mechanism

Add code
Mar 12, 2024
Viaarxiv icon