Picture for Hao Peng

Hao Peng

Beihang University

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Add code
May 28, 2025
Viaarxiv icon

AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios

Add code
May 22, 2025
Viaarxiv icon

The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning

Add code
May 21, 2025
Viaarxiv icon

Unsupervised Graph Clustering with Deep Structural Entropy

Add code
May 20, 2025
Viaarxiv icon

mCLM: A Function-Infused and Synthesis-Friendly Modular Chemical Language Model

Add code
May 18, 2025
Viaarxiv icon

Reinforcement Learning Finetunes Small Subnetworks in Large Language Models

Add code
May 16, 2025
Viaarxiv icon

Prioritizing Image-Related Tokens Enhances Vision-Language Pre-Training

Add code
May 13, 2025
Viaarxiv icon

T-T: Table Transformer for Tagging-based Aspect Sentiment Triplet Extraction

Add code
May 08, 2025
Viaarxiv icon

Scalable Multi-Stage Influence Function for Large Language Models via Eigenvalue-Corrected Kronecker-Factored Parameterization

Add code
May 08, 2025
Viaarxiv icon

Adaptive and Robust DBSCAN with Multi-agent Reinforcement Learning

Add code
May 07, 2025
Viaarxiv icon