Picture for Kai Han

Kai Han

and Other Contributors

VersatileFFN: Achieving Parameter Efficiency in LLMs via Adaptive Wide-and-Deep Reuse

Add code
Dec 16, 2025
Viaarxiv icon

JoVA: Unified Multimodal Learning for Joint Video-Audio Generation

Add code
Dec 15, 2025
Viaarxiv icon

Positional Preservation Embedding for Multimodal Large Language Models

Add code
Oct 27, 2025
Viaarxiv icon

Revealing the Power of Post-Training for Small Language Models via Knowledge Distillation

Add code
Sep 30, 2025
Viaarxiv icon

Category Discovery: An Open-World Perspective

Add code
Sep 26, 2025
Figure 1 for Category Discovery: An Open-World Perspective
Figure 2 for Category Discovery: An Open-World Perspective
Figure 3 for Category Discovery: An Open-World Perspective
Figure 4 for Category Discovery: An Open-World Perspective
Viaarxiv icon

Inpaint4Drag: Repurposing Inpainting Models for Drag-Based Image Editing via Bidirectional Warping

Add code
Sep 04, 2025
Figure 1 for Inpaint4Drag: Repurposing Inpainting Models for Drag-Based Image Editing via Bidirectional Warping
Figure 2 for Inpaint4Drag: Repurposing Inpainting Models for Drag-Based Image Editing via Bidirectional Warping
Figure 3 for Inpaint4Drag: Repurposing Inpainting Models for Drag-Based Image Editing via Bidirectional Warping
Figure 4 for Inpaint4Drag: Repurposing Inpainting Models for Drag-Based Image Editing via Bidirectional Warping
Viaarxiv icon

Splat4D: Diffusion-Enhanced 4D Gaussian Splatting for Temporally and Spatially Consistent Content Creation

Add code
Aug 11, 2025
Viaarxiv icon

When Deepfake Detection Meets Graph Neural Network:a Unified and Lightweight Learning Framework

Add code
Aug 07, 2025
Viaarxiv icon

OmniEval: A Benchmark for Evaluating Omni-modal Models with Visual, Auditory, and Textual Inputs

Add code
Jun 26, 2025
Viaarxiv icon

EAQuant: Enhancing Post-Training Quantization for MoE Models via Expert-Aware Optimization

Add code
Jun 16, 2025
Viaarxiv icon