Picture for Zhe Kong

Zhe Kong

LongCat-Video-Avatar 1.5 Technical Report

Add code
May 26, 2026
Viaarxiv icon

MotionMERGE: A Multi-granular Framework for Human Motion Editing, Reasoning, Generation, and Explanation

Add code
May 18, 2026
Viaarxiv icon

InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing

Add code
Aug 19, 2025
Figure 1 for InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing
Figure 2 for InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing
Figure 3 for InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing
Figure 4 for InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing
Viaarxiv icon

DAM-VSR: Disentanglement of Appearance and Motion for Video Super-Resolution

Add code
Jul 01, 2025
Figure 1 for DAM-VSR: Disentanglement of Appearance and Motion for Video Super-Resolution
Figure 2 for DAM-VSR: Disentanglement of Appearance and Motion for Video Super-Resolution
Figure 3 for DAM-VSR: Disentanglement of Appearance and Motion for Video Super-Resolution
Figure 4 for DAM-VSR: Disentanglement of Appearance and Motion for Video Super-Resolution
Viaarxiv icon

Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation

Add code
May 28, 2025
Figure 1 for Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation
Figure 2 for Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation
Figure 3 for Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation
Figure 4 for Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation
Viaarxiv icon

MG-MotionLLM: A Unified Framework for Motion Comprehension and Generation across Multiple Granularities

Add code
Apr 03, 2025
Viaarxiv icon

StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos

Add code
Sep 11, 2024
Figure 1 for StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos
Figure 2 for StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos
Figure 3 for StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos
Figure 4 for StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos
Viaarxiv icon

OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models

Add code
Mar 16, 2024
Figure 1 for OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models
Figure 2 for OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models
Viaarxiv icon

Dual Teacher Knowledge Distillation with Domain Alignment for Face Anti-spoofing

Add code
Jan 02, 2024
Figure 1 for Dual Teacher Knowledge Distillation with Domain Alignment for Face Anti-spoofing
Figure 2 for Dual Teacher Knowledge Distillation with Domain Alignment for Face Anti-spoofing
Figure 3 for Dual Teacher Knowledge Distillation with Domain Alignment for Face Anti-spoofing
Figure 4 for Dual Teacher Knowledge Distillation with Domain Alignment for Face Anti-spoofing
Viaarxiv icon

Fingerprint Presentation Attack Detection by Channel-wise Feature Denoising

Add code
Nov 15, 2021
Figure 1 for Fingerprint Presentation Attack Detection by Channel-wise Feature Denoising
Figure 2 for Fingerprint Presentation Attack Detection by Channel-wise Feature Denoising
Figure 3 for Fingerprint Presentation Attack Detection by Channel-wise Feature Denoising
Figure 4 for Fingerprint Presentation Attack Detection by Channel-wise Feature Denoising
Viaarxiv icon