Picture for Jie Zhou

Jie Zhou

StyleDecoupler: Generalizable Artistic Style Disentanglement

Add code
Jan 25, 2026
Viaarxiv icon

FGTBT: Frequency-Guided Task-Balancing Transformer for Unified Facial Landmark Detection

Add code
Jan 19, 2026
Viaarxiv icon

Supervision-by-Hallucination-and-Transfer: A Weakly-Supervised Approach for Robust and Precise Facial Landmark Detection

Add code
Jan 19, 2026
Viaarxiv icon

Dual-Stream Collaborative Transformer for Image Captioning

Add code
Jan 19, 2026
Viaarxiv icon

Moaw: Unleashing Motion Awareness for Video Diffusion Models

Add code
Jan 19, 2026
Viaarxiv icon

STEP3-VL-10B Technical Report

Add code
Jan 15, 2026
Viaarxiv icon

Mimic Human Cognition, Master Multi-Image Reasoning: A Meta-Action Framework for Enhanced Visual Understanding

Add code
Jan 12, 2026
Viaarxiv icon

Evaluating Accounting Reasoning Capabilities of Large Language Models

Add code
Jan 10, 2026
Viaarxiv icon

ArrowGEV: Grounding Events in Video via Learning the Arrow of Time

Add code
Jan 10, 2026
Viaarxiv icon

PsychEval: A Multi-Session and Multi-Therapy Benchmark for High-Realism AI Psychological Counselor

Add code
Jan 08, 2026
Viaarxiv icon