Picture for Hao Peng

Hao Peng

Beihang University

Generalization of RLVR Using Causal Reasoning as a Testbed

Add code
Dec 23, 2025
Viaarxiv icon

Adaptation of Agentic AI

Add code
Dec 22, 2025
Viaarxiv icon

MixKVQ: Query-Aware Mixed-Precision KV Cache Quantization for Long-Context Reasoning

Add code
Dec 22, 2025
Viaarxiv icon

RAGFort: Dual-Path Defense Against Proprietary Knowledge Base Extraction in Retrieval-Augmented Generation

Add code
Nov 13, 2025
Viaarxiv icon

Do Not Merge My Model! Safeguarding Open-Source LLMs Against Unauthorized Model Merging

Add code
Nov 13, 2025
Viaarxiv icon

RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments

Add code
Nov 10, 2025
Viaarxiv icon

Executable Counterfactuals: Improving LLMs' Causal Reasoning Through Code

Add code
Oct 02, 2025
Viaarxiv icon

Probing the Critical Point (CritPt) of AI Reasoning: a Frontier Physics Research Benchmark

Add code
Oct 01, 2025
Figure 1 for Probing the Critical Point (CritPt) of AI Reasoning: a Frontier Physics Research Benchmark
Figure 2 for Probing the Critical Point (CritPt) of AI Reasoning: a Frontier Physics Research Benchmark
Figure 3 for Probing the Critical Point (CritPt) of AI Reasoning: a Frontier Physics Research Benchmark
Figure 4 for Probing the Critical Point (CritPt) of AI Reasoning: a Frontier Physics Research Benchmark
Viaarxiv icon

Structural Information-based Hierarchical Diffusion for Offline Reinforcement Learning

Add code
Sep 26, 2025
Figure 1 for Structural Information-based Hierarchical Diffusion for Offline Reinforcement Learning
Figure 2 for Structural Information-based Hierarchical Diffusion for Offline Reinforcement Learning
Figure 3 for Structural Information-based Hierarchical Diffusion for Offline Reinforcement Learning
Figure 4 for Structural Information-based Hierarchical Diffusion for Offline Reinforcement Learning
Viaarxiv icon

Leveraging Support Vector Regression for Outcome Prediction in Personalized Ultra-fractionated Stereotactic Adaptive Radiotherapy

Add code
Sep 09, 2025
Viaarxiv icon