Picture for Pengfei Zhou

Pengfei Zhou

CrossLinear: Plug-and-Play Cross-Correlation Embedding for Time Series Forecasting with Exogenous Variables

Add code
May 29, 2025
Viaarxiv icon

REPA Works Until It Doesn't: Early-Stopped, Holistic Alignment Supercharges Diffusion Training

Add code
May 22, 2025
Viaarxiv icon

DD-Ranking: Rethinking the Evaluation of Dataset Distillation

Add code
May 19, 2025
Viaarxiv icon

Human-Aligned Bench: Fine-Grained Assessment of Reasoning Ability in MLLMs vs. Humans

Add code
May 16, 2025
Viaarxiv icon

EnerVerse-AC: Envisioning Embodied Environments with Action Condition

Add code
May 14, 2025
Viaarxiv icon

EWMBench: Evaluating Scene, Motion, and Semantic Quality in Embodied World Models

Add code
May 14, 2025
Viaarxiv icon

MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models

Add code
Apr 08, 2025
Viaarxiv icon

MPBench: A Comprehensive Multimodal Reasoning Benchmark for Process Errors Identification

Add code
Mar 16, 2025
Viaarxiv icon

PEBench: A Fictitious Dataset to Benchmark Machine Unlearning for Multimodal Large Language Models

Add code
Mar 16, 2025
Viaarxiv icon

ProJudge: A Multi-Modal Multi-Discipline Benchmark and Instruction-Tuning Dataset for MLLM-based Process Judges

Add code
Mar 09, 2025
Viaarxiv icon