Picture for Kunpeng Li

Kunpeng Li

Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning

Add code
Apr 07, 2026
Viaarxiv icon

Plasma GraphRAG: Physics-Grounded Parameter Selection for Gyrokinetic Simulations

Add code
Apr 07, 2026
Viaarxiv icon

Non-Markov Multi-Round Conversational Image Generation with History-Conditioned MLLMs

Add code
Jan 28, 2026
Viaarxiv icon

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

REFA: Real-time Egocentric Facial Animations for Virtual Reality

Add code
Jan 07, 2026
Viaarxiv icon

PhyGDPO: Physics-Aware Groupwise Direct Preference Optimization for Physically Consistent Text-to-Video Generation

Add code
Dec 31, 2025
Viaarxiv icon

Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models

Add code
Apr 24, 2025
Viaarxiv icon

Transfer between Modalities with MetaQueries

Add code
Apr 08, 2025
Figure 1 for Transfer between Modalities with MetaQueries
Figure 2 for Transfer between Modalities with MetaQueries
Figure 3 for Transfer between Modalities with MetaQueries
Figure 4 for Transfer between Modalities with MetaQueries
Viaarxiv icon

MoCha: Towards Movie-Grade Talking Character Synthesis

Add code
Mar 30, 2025
Figure 1 for MoCha: Towards Movie-Grade Talking Character Synthesis
Figure 2 for MoCha: Towards Movie-Grade Talking Character Synthesis
Figure 3 for MoCha: Towards Movie-Grade Talking Character Synthesis
Figure 4 for MoCha: Towards Movie-Grade Talking Character Synthesis
Viaarxiv icon

An Egocentric Vision-Language Model based Portable Real-time Smart Assistant

Add code
Mar 06, 2025
Viaarxiv icon