Picture for Kai Zhang

Kai Zhang

Victor

Toward Generalizable Deblurring: Leveraging Massive Blur Priors with Linear Attention for Real-World Scenarios

Add code
Jan 10, 2026
Viaarxiv icon

EDCO: Dynamic Curriculum Orchestration for Domain-specific Large Language Model Fine-tuning

Add code
Jan 07, 2026
Viaarxiv icon

Layer-Order Inversion: Rethinking Latent Multi-Hop Reasoning in Large Language Models

Add code
Jan 07, 2026
Viaarxiv icon

Self-Evaluation Unlocks Any-Step Text-to-Image Generation

Add code
Dec 26, 2025
Viaarxiv icon

Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing

Add code
Dec 19, 2025
Viaarxiv icon

GuangMing-Explorer: A Four-Legged Robot Platform for Autonomous Exploration in General Environments

Add code
Dec 17, 2025
Viaarxiv icon

Foundation Models in Biomedical Imaging: Turning Hype into Reality

Add code
Dec 17, 2025
Figure 1 for Foundation Models in Biomedical Imaging: Turning Hype into Reality
Figure 2 for Foundation Models in Biomedical Imaging: Turning Hype into Reality
Figure 3 for Foundation Models in Biomedical Imaging: Turning Hype into Reality
Figure 4 for Foundation Models in Biomedical Imaging: Turning Hype into Reality
Viaarxiv icon

L-STEC: Learned Video Compression with Long-term Spatio-Temporal Enhanced Context

Add code
Dec 14, 2025
Viaarxiv icon

E-RayZer: Self-supervised 3D Reconstruction as Spatial Visual Pre-training

Add code
Dec 11, 2025
Figure 1 for E-RayZer: Self-supervised 3D Reconstruction as Spatial Visual Pre-training
Figure 2 for E-RayZer: Self-supervised 3D Reconstruction as Spatial Visual Pre-training
Figure 3 for E-RayZer: Self-supervised 3D Reconstruction as Spatial Visual Pre-training
Figure 4 for E-RayZer: Self-supervised 3D Reconstruction as Spatial Visual Pre-training
Viaarxiv icon

From Noise to Latent: Generating Gaussian Latents for INR-Based Image Compression

Add code
Nov 11, 2025
Viaarxiv icon