Picture for Zhaoxin Fan

Zhaoxin Fan

Long-VLA: Unleashing Long-Horizon Capability of Vision Language Action Model for Robot Manipulation

Add code
Aug 28, 2025
Viaarxiv icon

Can Structured Templates Facilitate LLMs in Tackling Harder Tasks? : An Exploration of Scaling Laws by Difficulty

Add code
Aug 26, 2025
Viaarxiv icon

HieroAction: Hierarchically Guided VLM for Fine-Grained Action Analysis

Add code
Aug 23, 2025
Viaarxiv icon

Mem4D: Decoupling Static and Dynamic Memory for Dynamic Scene Reconstruction

Add code
Aug 12, 2025
Viaarxiv icon

Undress to Redress: A Training-Free Framework for Virtual Try-On

Add code
Aug 11, 2025
Viaarxiv icon

Pose-RFT: Enhancing MLLMs for 3D Pose Generation via Hybrid Action Reinforcement Fine-Tuning

Add code
Aug 11, 2025
Viaarxiv icon

SyncTalk++: High-Fidelity and Efficient Synchronized Talking Heads Synthesis Using Gaussian Splatting

Add code
Jun 17, 2025
Viaarxiv icon

RoboPARA: Dual-Arm Robot Planning with Parallel Allocation and Recomposition Across Tasks

Add code
Jun 07, 2025
Viaarxiv icon

DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations

Add code
May 26, 2025
Viaarxiv icon

HF-VTON: High-Fidelity Virtual Try-On via Consistent Geometric and Semantic Alignment

Add code
May 26, 2025
Viaarxiv icon