Picture for Pengfei Cao

Pengfei Cao

Learning How to Remember: A Meta-Cognitive Management Method for Structured and Transferable Agent Memory

Add code
Jan 12, 2026
Viaarxiv icon

Bias-Restrained Prefix Representation Finetuning for Mathematical Reasoning

Add code
Nov 13, 2025
Figure 1 for Bias-Restrained Prefix Representation Finetuning for Mathematical Reasoning
Figure 2 for Bias-Restrained Prefix Representation Finetuning for Mathematical Reasoning
Figure 3 for Bias-Restrained Prefix Representation Finetuning for Mathematical Reasoning
Figure 4 for Bias-Restrained Prefix Representation Finetuning for Mathematical Reasoning
Viaarxiv icon

Know-MRI: A Knowledge Mechanisms Revealer&Interpreter for Large Language Models

Add code
Jun 10, 2025
Viaarxiv icon

MMR-V: What's Left Unsaid? A Benchmark for Multimodal Deep Reasoning in Videos

Add code
Jun 04, 2025
Viaarxiv icon

Large Language Models for Planning: A Comprehensive and Systematic Survey

Add code
May 26, 2025
Figure 1 for Large Language Models for Planning: A Comprehensive and Systematic Survey
Figure 2 for Large Language Models for Planning: A Comprehensive and Systematic Survey
Figure 3 for Large Language Models for Planning: A Comprehensive and Systematic Survey
Figure 4 for Large Language Models for Planning: A Comprehensive and Systematic Survey
Viaarxiv icon

Revealing the Deceptiveness of Knowledge Editing: A Mechanistic Analysis of Superficial Editing

Add code
May 19, 2025
Viaarxiv icon

Rewarding Curse: Analyze and Mitigate Reward Modeling Issues for LLM Reasoning

Add code
Mar 07, 2025
Viaarxiv icon

The Knowledge Microscope: Features as Better Analytical Lenses than Neurons

Add code
Feb 18, 2025
Viaarxiv icon

RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment

Add code
Dec 18, 2024
Figure 1 for RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
Figure 2 for RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
Figure 3 for RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
Figure 4 for RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
Viaarxiv icon

One Mind, Many Tongues: A Deep Dive into Language-Agnostic Knowledge Neurons in Large Language Models

Add code
Nov 26, 2024
Figure 1 for One Mind, Many Tongues: A Deep Dive into Language-Agnostic Knowledge Neurons in Large Language Models
Figure 2 for One Mind, Many Tongues: A Deep Dive into Language-Agnostic Knowledge Neurons in Large Language Models
Figure 3 for One Mind, Many Tongues: A Deep Dive into Language-Agnostic Knowledge Neurons in Large Language Models
Figure 4 for One Mind, Many Tongues: A Deep Dive into Language-Agnostic Knowledge Neurons in Large Language Models
Viaarxiv icon