Picture for Jin Peng Zhou

Jin Peng Zhou

Cognitive Structure Generation: From Educational Priors to Policy Optimization

Add code
Aug 18, 2025
Viaarxiv icon

Efficient Controllable Diffusion via Optimal Classifier Guidance

Add code
May 27, 2025
Viaarxiv icon

Value-Guided Search for Efficient Chain-of-Thought Reasoning

Add code
May 23, 2025
Figure 1 for Value-Guided Search for Efficient Chain-of-Thought Reasoning
Figure 2 for Value-Guided Search for Efficient Chain-of-Thought Reasoning
Figure 3 for Value-Guided Search for Efficient Chain-of-Thought Reasoning
Figure 4 for Value-Guided Search for Efficient Chain-of-Thought Reasoning
Viaarxiv icon

Pre-training Large Memory Language Models with Internal and External Knowledge

Add code
May 21, 2025
Viaarxiv icon

INPROVF: Leveraging Large Language Models to Repair High-level Robot Controllers from Assumption Violations

Add code
Mar 17, 2025
Figure 1 for INPROVF: Leveraging Large Language Models to Repair High-level Robot Controllers from Assumption Violations
Figure 2 for INPROVF: Leveraging Large Language Models to Repair High-level Robot Controllers from Assumption Violations
Figure 3 for INPROVF: Leveraging Large Language Models to Repair High-level Robot Controllers from Assumption Violations
Figure 4 for INPROVF: Leveraging Large Language Models to Repair High-level Robot Controllers from Assumption Violations
Viaarxiv icon

$Q\sharp$: Provably Optimal Distributional RL for LLM Post-Training

Add code
Feb 27, 2025
Figure 1 for $Q\sharp$: Provably Optimal Distributional RL for LLM Post-Training
Figure 2 for $Q\sharp$: Provably Optimal Distributional RL for LLM Post-Training
Figure 3 for $Q\sharp$: Provably Optimal Distributional RL for LLM Post-Training
Figure 4 for $Q\sharp$: Provably Optimal Distributional RL for LLM Post-Training
Viaarxiv icon

Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond

Add code
Feb 26, 2025
Figure 1 for Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond
Figure 2 for Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond
Figure 3 for Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond
Figure 4 for Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond
Viaarxiv icon

Enhancing Cognitive Diagnosis by Modeling Learner Cognitive Structure State

Add code
Dec 27, 2024
Viaarxiv icon

Towards More Robust Retrieval-Augmented Generation: Evaluating RAG Under Adversarial Poisoning Attacks

Add code
Dec 21, 2024
Viaarxiv icon

Gemma 2: Improving Open Language Models at a Practical Size

Add code
Aug 02, 2024
Figure 1 for Gemma 2: Improving Open Language Models at a Practical Size
Figure 2 for Gemma 2: Improving Open Language Models at a Practical Size
Figure 3 for Gemma 2: Improving Open Language Models at a Practical Size
Figure 4 for Gemma 2: Improving Open Language Models at a Practical Size
Viaarxiv icon