Picture for Kai Liu

Kai Liu

refer to the report for detailed contributions

JavisDiT++: Unified Modeling and Optimization for Joint Audio-Video Generation

Add code
Feb 22, 2026
Viaarxiv icon

DM0: An Embodied-Native Vision-Language-Action Model towards Physical AI

Add code
Feb 16, 2026
Viaarxiv icon

LDA-1B: Scaling Latent Dynamics Action Model via Universal Embodied Data Ingestion

Add code
Feb 12, 2026
Viaarxiv icon

OMEGA-Avatar: One-shot Modeling of 360° Gaussian Avatars

Add code
Feb 12, 2026
Viaarxiv icon

AUHead: Realistic Emotional Talking Head Generation via Action Units Control

Add code
Feb 10, 2026
Viaarxiv icon

PlanViz: Evaluating Planning-Oriented Image Generation and Editing for Computer-Use Tasks

Add code
Feb 06, 2026
Viaarxiv icon

ERNIE 5.0 Technical Report

Add code
Feb 04, 2026
Viaarxiv icon

D-CORE: Incentivizing Task Decomposition in Large Reasoning Models for Complex Tool Use

Add code
Feb 02, 2026
Viaarxiv icon

JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation

Add code
Dec 28, 2025
Viaarxiv icon

Fose: Fusion of One-Step Diffusion and End-to-End Network for Pansharpening

Add code
Dec 19, 2025
Viaarxiv icon