Picture for Fan Wang

Fan Wang

FPSAttention: Training-Aware FP8 and Sparsity Co-Design for Fast Video Diffusion

Add code
Jun 06, 2025
Viaarxiv icon

EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World?

Add code
Jun 05, 2025
Viaarxiv icon

Conceptual Framework Toward Embodied Collective Adaptive Intelligence

Add code
May 29, 2025
Viaarxiv icon

Benchmarking Multimodal Mathematical Reasoning with Explicit Visual Dependency

Add code
Apr 29, 2025
Viaarxiv icon

Flow Along the K-Amplitude for Generative Modeling

Add code
Apr 27, 2025
Viaarxiv icon

3DV-TON: Textured 3D-Guided Consistent Video Try-on via Diffusion Models

Add code
Apr 24, 2025
Viaarxiv icon

Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation

Add code
Apr 21, 2025
Viaarxiv icon

RealisDance-DiT: Simple yet Strong Baseline towards Controllable Character Animation in the Wild

Add code
Apr 21, 2025
Viaarxiv icon

DyDiT++: Dynamic Diffusion Transformers for Efficient Visual Generation

Add code
Apr 09, 2025
Viaarxiv icon

Joint Similarity Item Exploration and Overlapped User Guidance for Multi-Modal Cross-Domain Recommendation

Add code
Feb 22, 2025
Viaarxiv icon