Picture for Pengfei Cao

Pengfei Cao

MMR-V: What's Left Unsaid? A Benchmark for Multimodal Deep Reasoning in Videos

Add code
Jun 04, 2025
Viaarxiv icon

Large Language Models for Planning: A Comprehensive and Systematic Survey

Add code
May 26, 2025
Viaarxiv icon

Revealing the Deceptiveness of Knowledge Editing: A Mechanistic Analysis of Superficial Editing

Add code
May 19, 2025
Viaarxiv icon

Rewarding Curse: Analyze and Mitigate Reward Modeling Issues for LLM Reasoning

Add code
Mar 07, 2025
Viaarxiv icon

The Knowledge Microscope: Features as Better Analytical Lenses than Neurons

Add code
Feb 18, 2025
Viaarxiv icon

RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment

Add code
Dec 18, 2024
Figure 1 for RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
Figure 2 for RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
Figure 3 for RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
Figure 4 for RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
Viaarxiv icon

One Mind, Many Tongues: A Deep Dive into Language-Agnostic Knowledge Neurons in Large Language Models

Add code
Nov 26, 2024
Figure 1 for One Mind, Many Tongues: A Deep Dive into Language-Agnostic Knowledge Neurons in Large Language Models
Figure 2 for One Mind, Many Tongues: A Deep Dive into Language-Agnostic Knowledge Neurons in Large Language Models
Figure 3 for One Mind, Many Tongues: A Deep Dive into Language-Agnostic Knowledge Neurons in Large Language Models
Figure 4 for One Mind, Many Tongues: A Deep Dive into Language-Agnostic Knowledge Neurons in Large Language Models
Viaarxiv icon

DTELS: Towards Dynamic Granularity of Timeline Summarization

Add code
Nov 14, 2024
Figure 1 for DTELS: Towards Dynamic Granularity of Timeline Summarization
Figure 2 for DTELS: Towards Dynamic Granularity of Timeline Summarization
Figure 3 for DTELS: Towards Dynamic Granularity of Timeline Summarization
Figure 4 for DTELS: Towards Dynamic Granularity of Timeline Summarization
Viaarxiv icon

A Troublemaker with Contagious Jailbreak Makes Chaos in Honest Towns

Add code
Oct 21, 2024
Viaarxiv icon

LINKED: Eliciting, Filtering and Integrating Knowledge in Large Language Model for Commonsense Reasoning

Add code
Oct 12, 2024
Viaarxiv icon