Picture for Wenshuo Peng

Wenshuo Peng

WildWorld: A Large-Scale Dataset for Dynamic World Modeling with Actions and Explicit State toward Generative ARPG

Add code
Mar 24, 2026
Viaarxiv icon

PyVision-RL: Forging Open Agentic Vision Models via RL

Add code
Feb 24, 2026
Viaarxiv icon

SVBench: Evaluation of Video Generation Models on Social Reasoning

Add code
Dec 25, 2025
Viaarxiv icon

Yume: An Interactive World Generation Model

Add code
Jul 23, 2025
Figure 1 for Yume: An Interactive World Generation Model
Figure 2 for Yume: An Interactive World Generation Model
Figure 3 for Yume: An Interactive World Generation Model
Figure 4 for Yume: An Interactive World Generation Model
Viaarxiv icon

T3M: Text Guided 3D Human Motion Synthesis from Speech

Add code
Aug 23, 2024
Figure 1 for T3M: Text Guided 3D Human Motion Synthesis from Speech
Figure 2 for T3M: Text Guided 3D Human Motion Synthesis from Speech
Figure 3 for T3M: Text Guided 3D Human Motion Synthesis from Speech
Figure 4 for T3M: Text Guided 3D Human Motion Synthesis from Speech
Viaarxiv icon

Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification

Add code
Jul 11, 2024
Figure 1 for Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification
Figure 2 for Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification
Figure 3 for Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification
Figure 4 for Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification
Viaarxiv icon