Picture for Fan Wang

Fan Wang

WorldVLA: Towards Autoregressive Action World Model

Add code
Jun 26, 2025
Viaarxiv icon

DualFast: Dual-Speedup Framework for Fast Sampling of Diffusion Models

Add code
Jun 16, 2025
Viaarxiv icon

PlayerOne: Egocentric World Simulator

Add code
Jun 11, 2025
Viaarxiv icon

AnimateAnyMesh: A Feed-Forward 4D Foundation Model for Text-Driven Universal Mesh Animation

Add code
Jun 11, 2025
Figure 1 for AnimateAnyMesh: A Feed-Forward 4D Foundation Model for Text-Driven Universal Mesh Animation
Figure 2 for AnimateAnyMesh: A Feed-Forward 4D Foundation Model for Text-Driven Universal Mesh Animation
Figure 3 for AnimateAnyMesh: A Feed-Forward 4D Foundation Model for Text-Driven Universal Mesh Animation
Figure 4 for AnimateAnyMesh: A Feed-Forward 4D Foundation Model for Text-Driven Universal Mesh Animation
Viaarxiv icon

FPSAttention: Training-Aware FP8 and Sparsity Co-Design for Fast Video Diffusion

Add code
Jun 06, 2025
Viaarxiv icon

EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World?

Add code
Jun 05, 2025
Figure 1 for EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World?
Figure 2 for EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World?
Figure 3 for EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World?
Figure 4 for EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World?
Viaarxiv icon

Conceptual Framework Toward Embodied Collective Adaptive Intelligence

Add code
May 29, 2025
Viaarxiv icon

Benchmarking Multimodal Mathematical Reasoning with Explicit Visual Dependency

Add code
Apr 29, 2025
Viaarxiv icon

Flow Along the K-Amplitude for Generative Modeling

Add code
Apr 27, 2025
Viaarxiv icon

3DV-TON: Textured 3D-Guided Consistent Video Try-on via Diffusion Models

Add code
Apr 24, 2025
Figure 1 for 3DV-TON: Textured 3D-Guided Consistent Video Try-on via Diffusion Models
Figure 2 for 3DV-TON: Textured 3D-Guided Consistent Video Try-on via Diffusion Models
Figure 3 for 3DV-TON: Textured 3D-Guided Consistent Video Try-on via Diffusion Models
Figure 4 for 3DV-TON: Textured 3D-Guided Consistent Video Try-on via Diffusion Models
Viaarxiv icon