Picture for Pengfei Zhou

Pengfei Zhou

TMD-Bench: A Multi-Level Evaluation Paradigm for Music-Dance Co-Generation

Add code
May 03, 2026
Viaarxiv icon

ClawMark: A Living-World Benchmark for Multi-Turn, Multi-Day, Multimodal Coworker Agents

Add code
Apr 26, 2026
Viaarxiv icon

Not All Frames Are Equal: Complexity-Aware Masked Motion Generation via Motion Spectral Descriptors

Add code
Mar 31, 2026
Viaarxiv icon

SentGraph: Hierarchical Sentence Graph for Multi-hop Retrieval-Augmented Question Answering

Add code
Jan 06, 2026
Viaarxiv icon

Act2Goal: From World Model To General Goal-conditioned Policy

Add code
Dec 29, 2025
Viaarxiv icon

Uni-Neur2Img: Unified Neural Signal-Guided Image Generation, Editing, and Stylization via Diffusion Transformers

Add code
Dec 21, 2025
Viaarxiv icon

RAPID^3: Tri-Level Reinforced Acceleration Policies for Diffusion Transformer

Add code
Sep 26, 2025
Viaarxiv icon

MDK12-Bench: A Comprehensive Evaluation of Multimodal Large Language Models on Multidisciplinary Exams

Add code
Aug 09, 2025
Viaarxiv icon

Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation

Add code
Aug 07, 2025
Figure 1 for Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation
Figure 2 for Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation
Figure 3 for Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation
Figure 4 for Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation
Viaarxiv icon

CrossLinear: Plug-and-Play Cross-Correlation Embedding for Time Series Forecasting with Exogenous Variables

Add code
May 29, 2025
Viaarxiv icon