Picture for Hao Zhong

Hao Zhong

LLaDA2.1: Speeding Up Text Diffusion via Token Editing

Add code
Feb 09, 2026
Viaarxiv icon

SpatCode: Rotary-based Unified Encoding Framework for Efficient Spatiotemporal Vector Retrieval

Add code
Jan 14, 2026
Viaarxiv icon

Preserving Source Video Realism: High-Fidelity Face Swapping for Cinematic Quality

Add code
Dec 08, 2025
Viaarxiv icon

Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO

Add code
May 27, 2025
Viaarxiv icon

Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration

Add code
May 26, 2025
Viaarxiv icon