Picture for Jin Peng Zhou

Jin Peng Zhou

Cognitive Structure Generation: From Educational Priors to Policy Optimization

Add code
Aug 18, 2025
Viaarxiv icon

Efficient Controllable Diffusion via Optimal Classifier Guidance

Add code
May 27, 2025
Viaarxiv icon

Value-Guided Search for Efficient Chain-of-Thought Reasoning

Add code
May 23, 2025
Viaarxiv icon

Pre-training Large Memory Language Models with Internal and External Knowledge

Add code
May 21, 2025
Viaarxiv icon

INPROVF: Leveraging Large Language Models to Repair High-level Robot Controllers from Assumption Violations

Add code
Mar 17, 2025
Viaarxiv icon

$Q\sharp$: Provably Optimal Distributional RL for LLM Post-Training

Add code
Feb 27, 2025
Figure 1 for $Q\sharp$: Provably Optimal Distributional RL for LLM Post-Training
Figure 2 for $Q\sharp$: Provably Optimal Distributional RL for LLM Post-Training
Figure 3 for $Q\sharp$: Provably Optimal Distributional RL for LLM Post-Training
Figure 4 for $Q\sharp$: Provably Optimal Distributional RL for LLM Post-Training
Viaarxiv icon

Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond

Add code
Feb 26, 2025
Viaarxiv icon

Enhancing Cognitive Diagnosis by Modeling Learner Cognitive Structure State

Add code
Dec 27, 2024
Viaarxiv icon

Towards More Robust Retrieval-Augmented Generation: Evaluating RAG Under Adversarial Poisoning Attacks

Add code
Dec 21, 2024
Viaarxiv icon

Gemma 2: Improving Open Language Models at a Practical Size

Add code
Aug 02, 2024
Figure 1 for Gemma 2: Improving Open Language Models at a Practical Size
Figure 2 for Gemma 2: Improving Open Language Models at a Practical Size
Figure 3 for Gemma 2: Improving Open Language Models at a Practical Size
Figure 4 for Gemma 2: Improving Open Language Models at a Practical Size
Viaarxiv icon