Picture for Kai Ye

Kai Ye

The Less You Depend, The More You Learn: Synthesizing Novel Views from Sparse, Unposed Images without Any 3D Knowledge

Add code
Jun 11, 2025
Viaarxiv icon

Doc-CoB: Enhancing Multi-Modal Document Understanding with Visual Chain-of-Boxes Reasoning

Add code
May 24, 2025
Viaarxiv icon

More Clear, More Flexible, More Precise: A Comprehensive Oriented Object Detection benchmark for UAV

Add code
Apr 28, 2025
Viaarxiv icon

S$^2$Teacher: Step-by-step Teacher for Sparsely Annotated Oriented Object Detection

Add code
Apr 15, 2025
Viaarxiv icon

Cross-Frequency Implicit Neural Representation with Self-Evolving Parameters

Add code
Apr 15, 2025
Viaarxiv icon

SafeSpeech: Robust and Universal Voice Protection Against Malicious Speech Synthesis

Add code
Apr 14, 2025
Viaarxiv icon

Zeus: Zero-shot LLM Instruction for Union Segmentation in Multimodal Medical Imaging

Add code
Apr 09, 2025
Viaarxiv icon

Enhancing Large-scale UAV Route Planing with Global and Local Features via Reinforcement Graph Fusion

Add code
Dec 20, 2024
Viaarxiv icon

Grasp What You Want: Embodied Dexterous Grasping System Driven by Your Voice

Add code
Dec 14, 2024
Viaarxiv icon

GeoSplatting: Towards Geometry Guided Gaussian Splatting for Physically-based Inverse Rendering

Add code
Oct 31, 2024
Viaarxiv icon