Picture for Andrew C Yao

Andrew C Yao

On the Design of KL-Regularized Policy Gradient Algorithms for LLM Reasoning

Add code
May 23, 2025
Viaarxiv icon

Hierarchical Attention Generates Better Proofs

Add code
Apr 27, 2025
Viaarxiv icon

Efficient Neural Theorem Proving via Fine-grained Proof Structure Analysis

Add code
Jan 30, 2025
Viaarxiv icon